The Open-Source AI Stack

Models · gemini

Gemini 2.0 Flash

Proprietary Google DeepMind · 2024-12-11 · Proprietary

Google's workhorse Gemini 2.0 checkpoint, released initially as an experimental developer preview. Google positioned it as outperforming Gemini 1.5 Pro on key benchmarks at roughly twice the speed, with native multimodal output (image generation, text-to-speech) and native tool use including Google Search and code execution. The Multimodal Live API added real-time audio and video streaming.

Cost

$0.15 / Mtok input

$0.60 / Mtok output

Google API · as of 2026-05-21

Speed

0 tok/sec output

0 ms TTFT

· as of 2026-05-21

Architecture

Schema-generated from data/models.yaml. Every label is auditable against the model's sources.

Specs

Architecture: unknown
Total params: not disclosed
Active params: not disclosed
Context window: 1.0M tokens
Attention: unknown
Position encoding: unknown
Post-training: rlhf
OSI-approved: no
Data released: no
Training code: not released

Benchmarks

Each score carries the date it was published; we never infer or interpolate missing scores.

General reasoning

MMLU-Pro	77.9	as of 2026-05-21	source ↗
GPQA-Diamond	62.3	as of 2026-05-21	source ↗

Code

LiveCodeBench

33.4

as of 2026-05-21

Math

MATH	93.0	as of 2026-05-21	source ↗
AIME 2024	33.0	as of 2026-05-21	source ↗
AIME 2025	21.7	as of 2026-05-21	source ↗

Available quantizations

None. The weights are not distributed, so there are no public quantizations.

Notable innovations

· Native multimodal output (image, audio)
· Native tool use with Google Search and code execution
· Multimodal Live API for realtime audio and video

Sources

Introducing Gemini 2.0: our new AI model for the agentic era (Google, Dec 11 2024) ↗