Models · Compare

Mistral 7B v0.1 vs Gemini 1.5 Pro

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field	A: Mistral 7B v0.1	B: Gemini 1.5 Pro
Released	2023-09-27	2024-02-15
Developer	Mistral AI	Google DeepMind
Openness	Open	Proprietary
License	Apache-2.0	Proprietary
OSI-approved	yes	no
Data released	no	no
Training code	no	no
Architecture	dense	unknown
Total params	7B	—
Active params	—	—
Experts	—	—
Context window	—	2.1M
Attention	hybrid-gqa-sliding	unknown
Position enc.	rope	unknown
Pretraining tokens	—	—
Post-training	—	rlhf
Training hardware	—	—
$/M input	—	$0.00
$/M output	—	$0.00
Output tok/sec	—	0

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro	—	75.0 2026-05-21
GPQA-Diamond	—	58.9 2026-05-21

Code

LiveCodeBench

—

31.6 2026-05-21

Math

MATH	—	87.6 2026-05-21
AIME 2024	—	23.0 2026-05-21

Context · A

The first open-weights model to beat Llama 2 13B at 7B scale using sliding-window attention. Mistral's Apache-2.0 release reset what "open" meant after Llama 2's community license.

Context · B

Google's first long-context Gemini checkpoint, introduced with a 128K standard window and a 1M token preview tier. Google described the design as a mixture-of-experts that activates a subset of expert networks per input, and demonstrated 99% recall on needle-in-a-haystack across 1M tokens at launch. The context window was later extended to 2M tokens in private preview, announced May 14 2024.

Mistral 7B v0.1 detail → · Gemini 1.5 Pro detail →