The Open-Source AI Stack
RSS
All models

Models · mistral-medium

Mistral Medium 3

Proprietary Mistral AI · 2025-05-07 · Proprietary

Mid-tier flagship released May 7 2025 at $0.40 / $2.00 per Mtok with a 128K context window. Mistral positioned it as roughly 90 percent of Claude Sonnet 3.7 performance at a fraction of the cost, with deployment supported on self-hosted setups starting at four GPUs.

Cost

$0.40 / Mtok input
$2.00 / Mtok output

Mistral La Plateforme · as of 2026-05-21

source ↗

Speed

29 tok/sec output
552 ms TTFT

· as of 2026-05-21

source ↗

Architecture

tokens in Embedding vocab not disclosed × N layers Architecture not disclosed (proprietary or undocumented) Output projection tokens out
Schema-generated from data/models.yaml. Every label is auditable against the model's sources.

Specs

Architecture
unknown
Total params
not disclosed
Active params
not disclosed
Context window
128K tokens
Attention
unknown
Position encoding
unknown
Post-training
sft, rlhf
OSI-approved
no
Data released
no
Training code
not released

Benchmarks

Each score carries the date it was published; we never infer or interpolate missing scores.

General reasoning

MMLU-Pro 76.0 as of 2026-05-21 source ↗
GPQA-Diamond 57.8 as of 2026-05-21 source ↗

Code

LiveCodeBench 40.0 as of 2026-05-21 source ↗

Math

MATH 90.7 as of 2026-05-21 source ↗
AIME 2024 44.0 as of 2026-05-21 source ↗
AIME 2025 30.3 as of 2026-05-21 source ↗

Available quantizations

None. The weights are not distributed, so there are no public quantizations.

Notable innovations

  • · Hybrid, on-premises, and in-VPC deployment options
  • · Coding and STEM focus

Lineage

New mid-tier slot between Mistral Small and Large; proprietary, not weights-released.

Sources