Models · Apr 18, 2026

Google DeepMind releases Gemma 4, claiming state-of-the-art performance across four model sizes

The new open-source model family, available under Apache 2.0 license, ranges from 2B to 31B parameters and includes native multimodal capabilities for vision, audio, and video processing.

Trust58

HypeSome hype

1 source

ShareX LinkedIn Email

TL;DR

Google DeepMind introduced Gemma 4, a family of four open-source models (2B, 4B, 26B, and 31B parameters) claimed to deliver advanced reasoning and agentic workflow capabilities.
The 31B model ranks #3 on Arena.ai's chat leaderboard as of April 1, 2026, with the 26B model at #6, according to the company's announcement.
All models support 128K to 256K context windows, native processing of video and images, and training across 140+ languages.
The models are optimized for edge deployment across Android devices, laptops, and workstations, with smaller variants designed for on-device inference.
Gemma 4 includes support for function-calling, structured JSON output, and native system instructions for building autonomous agents.

Google DeepMind announced Gemma 4, a family of open-source language models released under an Apache 2.0 license. The lineup comprises four distinct sizes: a 2-parameter effective model, a 4-parameter effective model, a 26-parameter Mixture of Experts variant, and a 31-parameter dense model. The company positions these as purpose-built for advanced reasoning and autonomous agent workflows, emphasizing efficiency across hardware targets from mobile devices to workstation accelerators.

According to the announcement, Gemma 4's 31B model currently ranks third on Arena.ai's chat leaderboard as of April 1, 2026, with the 26B variant ranking sixth. The company claims these models outperform certain larger competitors on benchmarks. All variants feature context windows ranging from 128,000 tokens in smaller edge models to 256,000 tokens in larger versions, and natively process video, images, and audio input across over 140 languages.

The models support function-calling, structured JSON output, and system-level instruction handling, capabilities intended to enable autonomous agent construction. The announcement highlights prior community adoption, citing over 400 million downloads of the original Gemma line and more than 100,000 community variants built from those models. Specific prior use cases mentioned include collaboration with Yale University on molecular research and development of a Bulgarian-language model.

Sources

01Google DeepMind — Blog — Gemma 4: Byte for byte, the most capable open models

Also on Models

Google DeepMind releases Gemma 4, claiming state-of-the-art performance across four model sizes

Claude Code confirmed using Bun’s Rust port in production

Moonshot AI releases Kimi K3 open source model, touting frontier-level performance

OpenAI CFO proposes scorecard to measure AI ROI