The Open-Source AI Stack
RSS
All models

Models · claude

Claude 3.5 Haiku

Proprietary Anthropic · 2024-11-04 · Proprietary

Announced October 22 2024 alongside the upgraded Claude 3.5 Sonnet and the computer-use beta; generally available in early November. Anthropic positioned it as matching Claude 3 Opus on many intelligence benchmarks at Haiku-tier speeds, with a SWE-Bench Verified score of 40.6% making it briefly the best small-tier coding model from a major lab.

Cost

$0.80 / Mtok input
$4.00 / Mtok output

Anthropic API · as of 2026-05-21

source ↗

Speed

0 tok/sec output
0 ms TTFT

· as of 2026-05-21

source ↗

Architecture

tokens in Embedding vocab not disclosed × N layers Architecture not disclosed (proprietary or undocumented) Output projection tokens out
Schema-generated from data/models.yaml. Every label is auditable against the model's sources.

Specs

Architecture
unknown
Total params
not disclosed
Active params
not disclosed
Context window
200K tokens
Attention
unknown
Position encoding
unknown
Post-training
rlhf, constitutional
OSI-approved
no
Data released
no
Training code
not released

Benchmarks

Each score carries the date it was published; we never infer or interpolate missing scores.

General reasoning

MMLU-Pro 63.4 as of 2026-05-21 source ↗
GPQA-Diamond 40.8 as of 2026-05-21 source ↗

Code

SWE-Bench Verified 40.6 as of 2024-10-22 source ↗
LiveCodeBench 31.4 as of 2026-05-21 source ↗

Math

MATH 72.1 as of 2026-05-21 source ↗
AIME 2024 3.3 as of 2026-05-21 source ↗

Available quantizations

None. The weights are not distributed, so there are no public quantizations.

Notable innovations

  • · Small-tier model matching prior flagship on many benchmarks
  • · SWE-Bench Verified 40.6% at Haiku price tier

Lineage

Successor to Claude 3 Haiku at the small-fast tier of the Claude lineup.

Sources