• About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
Saturday, June 6, 2026
mGrowTech
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
No Result
View All Result
mGrowTech
No Result
View All Result
Home Google Marketing

Gemma 4 with quantization-aware training

Josh by Josh
June 5, 2026
in Google Marketing
0
Gemma 4 with quantization-aware training


Since releasing Gemma 4 two months ago, we’ve been continuously working to expand its capabilities. First, we introduced Multi-Token Prediction (MTP) to accelerate inference, and just a couple of days ago, we released a 12B model to bridge the gap between our E4B and 26B MOE models.

Today, we are releasing new checkpoints optimized with Quantization-Aware Training (QAT) to make Gemma 4 even more efficient, so you can run models locally on everyday edge devices and consumer GPUs.

By simulating quantization during training, QAT minimizes quality loss when the model is compressed. This release includes QAT checkpoints for the popular Q4_0 quantization format as well as a novel quantization format specialized for mobile use cases. Using this mobile format, we’ve reduced the memory footprint of Gemma 4 E2B to 1GB. Together, these dramatically reduce memory requirements while preserving the capabilities and quality you expect from Gemma 4.

Keeping model quality while making them smaller

Quantization is a key technology to run models on consumer hardware by reducing their memory footprint while also accelerating decode speed. However, standard Post-Training Quantization (PTQ) often leads to performance degradation. Instead of simply quantizing the model after training, QAT integrates the quantization process directly into training. While PTQ is already effective at preserving quality, our QAT results yield even higher overall quality compared to standard PTQ baselines.

We applied this QAT recipe to the popular Q4_0 format to maximize performance for all the models. For the edge models (E2B and E4B), we rethought how we approach quantization with a special mobile-specialized quantization schema.

Saving on VRAM and Storage

Below are the approximate memory requirements indicating how much VRAM is required to load the models:



Source_link

READ ALSO

Introducing the Google Colab CLI

Google tests sending Chrome users straight into AI Mode

Related Posts

Introducing the Google Colab CLI
Google Marketing

Introducing the Google Colab CLI

June 6, 2026
Google tests sending Chrome users straight into AI Mode
Google Marketing

Google tests sending Chrome users straight into AI Mode

June 5, 2026
Google must let publishers opt out of AI Search features, rules UK
Google Marketing

Google must let publishers opt out of AI Search features, rules UK

June 5, 2026
New perspectives on generative media for startups
Google Marketing

New perspectives on generative media for startups

June 5, 2026
This Google Photos update has saved your digital photo frame
Google Marketing

This Google Photos update has saved your digital photo frame

June 5, 2026
Google and Utah State Board of Education partner on Gemini tools
Google Marketing

Google and Utah State Board of Education partner on Gemini tools

June 5, 2026
Next Post
Data Mining vs Data Warehousing: Understanding the Key Differences

Data Mining vs Data Warehousing: Understanding the Key Differences

POPULAR NEWS

Trump ends trade talks with Canada over a digital services tax

Trump ends trade talks with Canada over a digital services tax

June 28, 2025
15 Trending Songs on TikTok in 2025 (+ How to Use Them)

15 Trending Songs on TikTok in 2025 (+ How to Use Them)

June 18, 2025
Communication Effectiveness Skills For Business Leaders

Communication Effectiveness Skills For Business Leaders

June 10, 2025
App Development Cost in Singapore: Pricing Breakdown & Insights

App Development Cost in Singapore: Pricing Breakdown & Insights

June 22, 2025
Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

November 4, 2025

EDITOR'S PICK

The Impact of AI on Software Development

The Impact of AI on Software Development

October 29, 2025
Mark Zuckerberg was initially opposed to parental controls for AI chatbots, according to legal filing

Mark Zuckerberg was initially opposed to parental controls for AI chatbots, according to legal filing

January 28, 2026
The Search Engine for OnlyFans Models Who Look Like Your Crush

The Search Engine for OnlyFans Models Who Look Like Your Crush

February 20, 2026
The Ultimate Guide to CPUs, GPUs, NPUs, and TPUs for AI/ML: Performance, Use Cases, and Key Differences

The Ultimate Guide to CPUs, GPUs, NPUs, and TPUs for AI/ML: Performance, Use Cases, and Key Differences

August 3, 2025

About

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Follow us

Categories

  • Account Based Marketing
  • Ad Management
  • Al, Analytics and Automation
  • Brand Management
  • Channel Marketing
  • Digital Marketing
  • Direct Marketing
  • Event Management
  • Google Marketing
  • Marketing Attribution and Consulting
  • Marketing Automation
  • Mobile Marketing
  • PR Solutions
  • Social Media Management
  • Technology And Software
  • Uncategorized

Recent Posts

  • Introducing the Google Colab CLI
  • A viral space moment fell in Nutella’s lap. Here’s how they capitalized.
  • LinkedIn Crossclimb Answer Today for June 5, 2026 (Puzzle #766)
  • OpenAI Rolls Out A Lockdown Mode For Extra Protection Against Prompt Injection Attacks
  • About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions