• About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
Friday, June 5, 2026
mGrowTech
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
No Result
View All Result
mGrowTech
No Result
View All Result
Home Google Marketing

Gemma 4 with quantization-aware training

Josh by Josh
June 5, 2026
in Google Marketing
0
Gemma 4 with quantization-aware training


Since releasing Gemma 4 two months ago, we’ve been continuously working to expand its capabilities. First, we introduced Multi-Token Prediction (MTP) to accelerate inference, and just a couple of days ago, we released a 12B model to bridge the gap between our E4B and 26B MOE models.

Today, we are releasing new checkpoints optimized with Quantization-Aware Training (QAT) to make Gemma 4 even more efficient, so you can run models locally on everyday edge devices and consumer GPUs.

By simulating quantization during training, QAT minimizes quality loss when the model is compressed. This release includes QAT checkpoints for the popular Q4_0 quantization format as well as a novel quantization format specialized for mobile use cases. Using this mobile format, we’ve reduced the memory footprint of Gemma 4 E2B to 1GB. Together, these dramatically reduce memory requirements while preserving the capabilities and quality you expect from Gemma 4.

Keeping model quality while making them smaller

Quantization is a key technology to run models on consumer hardware by reducing their memory footprint while also accelerating decode speed. However, standard Post-Training Quantization (PTQ) often leads to performance degradation. Instead of simply quantizing the model after training, QAT integrates the quantization process directly into training. While PTQ is already effective at preserving quality, our QAT results yield even higher overall quality compared to standard PTQ baselines.

We applied this QAT recipe to the popular Q4_0 format to maximize performance for all the models. For the edge models (E2B and E4B), we rethought how we approach quantization with a special mobile-specialized quantization schema.

Saving on VRAM and Storage

Below are the approximate memory requirements indicating how much VRAM is required to load the models:



Source_link

READ ALSO

Google tests sending Chrome users straight into AI Mode

Google must let publishers opt out of AI Search features, rules UK

Related Posts

Google tests sending Chrome users straight into AI Mode
Google Marketing

Google tests sending Chrome users straight into AI Mode

June 5, 2026
Google must let publishers opt out of AI Search features, rules UK
Google Marketing

Google must let publishers opt out of AI Search features, rules UK

June 5, 2026
New perspectives on generative media for startups
Google Marketing

New perspectives on generative media for startups

June 5, 2026
This Google Photos update has saved your digital photo frame
Google Marketing

This Google Photos update has saved your digital photo frame

June 5, 2026
Google and Utah State Board of Education partner on Gemini tools
Google Marketing

Google and Utah State Board of Education partner on Gemini tools

June 5, 2026
Google is letting social media stars customize their search result page
Google Marketing

Google is letting social media stars customize their search result page

June 4, 2026
Next Post
Data Mining vs Data Warehousing: Understanding the Key Differences

Data Mining vs Data Warehousing: Understanding the Key Differences

POPULAR NEWS

Trump ends trade talks with Canada over a digital services tax

Trump ends trade talks with Canada over a digital services tax

June 28, 2025
15 Trending Songs on TikTok in 2025 (+ How to Use Them)

15 Trending Songs on TikTok in 2025 (+ How to Use Them)

June 18, 2025
Communication Effectiveness Skills For Business Leaders

Communication Effectiveness Skills For Business Leaders

June 10, 2025
App Development Cost in Singapore: Pricing Breakdown & Insights

App Development Cost in Singapore: Pricing Breakdown & Insights

June 22, 2025
Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

November 4, 2025

EDITOR'S PICK

Maytag Promo Codes and Deals: Appliances Under $300

Maytag Promo Codes and Deals: Appliances Under $300

April 4, 2026
Google’s Gemini is now on Android Auto

Google’s Gemini is now on Android Auto

November 24, 2025

How to Get Marketing Help You Can Trust (After Getting Burned Before)

November 20, 2025
How to watch Jensen Huang’s Nvidia GTC 2026 keynote

How to watch Jensen Huang’s Nvidia GTC 2026 keynote

March 13, 2026

About

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Follow us

Categories

  • Account Based Marketing
  • Ad Management
  • Al, Analytics and Automation
  • Brand Management
  • Channel Marketing
  • Digital Marketing
  • Direct Marketing
  • Event Management
  • Google Marketing
  • Marketing Attribution and Consulting
  • Marketing Automation
  • Mobile Marketing
  • PR Solutions
  • Social Media Management
  • Technology And Software
  • Uncategorized

Recent Posts

  • A viral space moment fell in Nutella’s lap. Here’s how they capitalized.
  • LinkedIn Crossclimb Answer Today for June 5, 2026 (Puzzle #766)
  • OpenAI Rolls Out A Lockdown Mode For Extra Protection Against Prompt Injection Attacks
  • Insider One vs Bloomreach: Pricing & TCO Analysis
  • About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions