Our most capable open models to date

April 3, 2026

79

At the edge, our E2B and E4B models redefine on-device utility, prioritizing multimodal capabilities, low-latency processing and seamless ecosystem integration over raw parameter count.

Powerful, accessible, open

To power the next generation of pioneering research and products, we’ve sized the Gemma 4 models specifically to run and fine-tune efficiently on hardware — from billions of Android devices worldwide, to laptop GPUs, all the way up to developer workstations and accelerators.

By using these highly optimized models, you can fine-tune Gemma 4 to achieve state-of-the-art performance on your specific tasks. We’ve already seen incredible success with this approach; for instance, INSAIT created a pioneering Bulgarian-first language model (BgGPT), and we worked with Yale University on Cell2Sentence-Scale to discover new pathways for cancer therapy, among many others.

Here is what makes Gemma 4 our most capable open model family yet:

Advanced reasoning: Capable of multi-step planning and deep logic, Gemma 4 demonstrates significant improvements in math and instruction-following benchmarks that require it.
Agentic workflows: Native support for function-calling, structured JSON output, and native system instructions enables you to build autonomous agents that can interact with different tools and APIs and execute workflows reliably.
Code generation: Gemma 4 supports high-quality offline code, turning your workstation into a local-first AI code assistant.
Vision and audio: All models natively process video and images, supporting variable resolutions, and excelling at visual tasks like OCR and chart understanding. Additionally, the E2B and E4B models feature native audio input for speech recognition and understanding.
Longer context: Process long-form content seamlessly. The edge models feature a 128K context window, while the larger models offer up to 256K, allowing you to pass repositories or long documents in a single prompt.
140+ languages: Natively trained on over 140 languages, Gemma 4 helps developers build inclusive, high-performance applications for a global audience.

Versatile models for diverse hardware

We are releasing the Gemma 4 model weights in sizes tailored for specific hardware and use cases, ensuring you get frontier-class reasoning wherever you need it:

26B and 31B models: Frontier intelligence, offline on your personal computers

Optimized to provide researchers and developers with state-of-the-art reasoning on accessible hardware, our unquantized bfloat16 weights fit efficiently on a single 80GB NVIDIA H100 GPU. For local setups, quantized versions run natively on consumer GPUs to power your IDEs, coding assistants and agentic workflows. Our 26B Mixture of Experts (MoE) focus on latency, activating only 3.8 billion of its total parameters during inference to deliver exceptionally fast tokens-per-second, while our 31B Dense is maximizing raw quality and provides a powerful foundation for fine-tuning.

Source link

Previous articleWhy ‘curate first, annotate smarter’ is reshaping computer vision development

Next articleCleveland’s open data overhaul

Our most capable open models to date

Powerful, accessible, open

Versatile models for diverse hardware

26B and 31B models: Frontier intelligence, offline on your personal computers

Related Articles

Are social platforms going to charge all users for access?

Microsoft’s open-source toolkit for controlling out-of-control AI agents

TikTok Shop grows to new EU merchant countries

LEAVE A REPLY Cancel reply

CATEGORIES & TAGS

LATEST COMMENTS

Most Popular

Understanding Plex UDP Amplification DDoS Attack

Major Tech Layoffs in 2024: An Updated Tracker

Addressing the Skills Gap to Keep Up with the Evolution of the Cloud

How Automotive Radars Are Advancing Safety Features

What Can IT Executives Do to Improve Mental Health for Themselves and Their Teams?

Our most capable open models to date

Powerful, accessible, open

Versatile models for diverse hardware

26B and 31B models: Frontier intelligence, offline on your personal computers

Related Articles

LEAVE A REPLY Cancel reply

Stay Connected

CATEGORIES & TAGS

LATEST COMMENTS

Most Popular