The Open-Source AI Stack
RSS
All models

Models · Compare

Claude 3.5 Haiku vs Qwen 2.5 Coder 32B Instruct

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field A: Claude 3.5 Haiku B: Qwen 2.5 Coder 32B Instruct
Released 2024-11-042024-11-12
Developer AnthropicAlibaba
Openness ProprietaryOpen
License ProprietaryApache-2.0
OSI-approved noyes
Data released nono
Training code nono
Architecture unknowndense
Total params 32.5B
Active params
Experts
Context window 200K131K
Attention unknowngqa
Position enc. unknownrope
Pretraining tokens
Post-training rlhf, constitutionalsft, dpo
Training hardware
$/M input $0.80$0.00
$/M output $4.00$0.00
Output tok/sec 00

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro 63.4 2026-05-21 63.5 2026-05-21
GPQA-Diamond 40.8 2026-05-21 41.7 2026-05-21

Code

SWE-Bench Verified 40.6 2024-10-22
LiveCodeBench 31.4 2026-05-21 29.5 2026-05-21

Math

MATH 72.1 2026-05-21 76.7 2026-05-21
AIME 2024 3.3 2026-05-21 12.0 2026-05-21

Context · A

Announced October 22 2024 alongside the upgraded Claude 3.5 Sonnet and the computer-use beta; generally available in early November. Anthropic positioned it as matching Claude 3 Opus on many intelligence benchmarks at Haiku-tier speeds, with a SWE-Bench Verified score of 40.6% making it briefly the best small-tier coding model from a major lab.

Context · B

Code-specialized 32B sibling to Qwen 2.5, released alongside 0.5B / 1.5B / 7B / 14B / 32B coder variants all under Apache 2.0. Alibaba positioned it as comparable to GPT-4o on Aider code-repair and first among open models on multi-language repair.

Claude 3.5 Haiku detail → · Qwen 2.5 Coder 32B Instruct detail →