The Open-Source AI Stack
RSS
All models

Models · gemini

Gemini 2.0 Flash

Proprietary Google DeepMind · 2024-12-11 · Proprietary

Google's workhorse Gemini 2.0 checkpoint, released initially as an experimental developer preview. Google positioned it as outperforming Gemini 1.5 Pro on key benchmarks at roughly twice the speed, with native multimodal output (image generation, text-to-speech) and native tool use including Google Search and code execution. The Multimodal Live API added real-time audio and video streaming.

Cost

$0.15 / Mtok input
$0.60 / Mtok output

Google API · as of 2026-05-21

source ↗

Speed

0 tok/sec output
0 ms TTFT

· as of 2026-05-21

source ↗

Architecture

tokens in Embedding vocab not disclosed × N layers Architecture not disclosed (proprietary or undocumented) Output projection tokens out
Schema-generated from data/models.yaml. Every label is auditable against the model's sources.

Specs

Architecture
unknown
Total params
not disclosed
Active params
not disclosed
Context window
1.0M tokens
Attention
unknown
Position encoding
unknown
Post-training
rlhf
OSI-approved
no
Data released
no
Training code
not released

Benchmarks

Each score carries the date it was published; we never infer or interpolate missing scores.

General reasoning

MMLU-Pro 77.9 as of 2026-05-21 source ↗
GPQA-Diamond 62.3 as of 2026-05-21 source ↗

Code

LiveCodeBench 33.4 as of 2026-05-21 source ↗

Math

MATH 93.0 as of 2026-05-21 source ↗
AIME 2024 33.0 as of 2026-05-21 source ↗
AIME 2025 21.7 as of 2026-05-21 source ↗

Available quantizations

None. The weights are not distributed, so there are no public quantizations.

Notable innovations

  • · Native multimodal output (image, audio)
  • · Native tool use with Google Search and code execution
  • · Multimodal Live API for realtime audio and video

Sources