Models · Compare
Claude 3.5 Haiku vs Qwen 2.5 Coder 32B Instruct
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: Claude 3.5 Haiku | B: Qwen 2.5 Coder 32B Instruct |
|---|---|---|
| Released | 2024-11-04 | 2024-11-12 |
| Developer | Anthropic | Alibaba |
| Openness | Proprietary | Open |
| License | Proprietary | Apache-2.0 |
| OSI-approved | no | yes |
| Data released | no | no |
| Training code | no | no |
| Architecture | unknown | dense |
| Total params | — | 32.5B |
| Active params | — | — |
| Experts | — | — |
| Context window | 200K | 131K |
| Attention | unknown | gqa |
| Position enc. | unknown | rope |
| Pretraining tokens | — | — |
| Post-training | rlhf, constitutional | sft, dpo |
| Training hardware | — | — |
| $/M input | $0.80 | $0.00 |
| $/M output | $4.00 | $0.00 |
| Output tok/sec | 0 | 0 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 63.4 2026-05-21 | 63.5 2026-05-21 |
| GPQA-Diamond | 40.8 2026-05-21 | 41.7 2026-05-21 |
Code
| SWE-Bench Verified | 40.6 2024-10-22 | — |
| LiveCodeBench | 31.4 2026-05-21 | 29.5 2026-05-21 |
Math
| MATH | 72.1 2026-05-21 | 76.7 2026-05-21 |
| AIME 2024 | 3.3 2026-05-21 | 12.0 2026-05-21 |
Context · A
Announced October 22 2024 alongside the upgraded Claude 3.5 Sonnet and the computer-use beta; generally available in early November. Anthropic positioned it as matching Claude 3 Opus on many intelligence benchmarks at Haiku-tier speeds, with a SWE-Bench Verified score of 40.6% making it briefly the best small-tier coding model from a major lab.
Context · B
Code-specialized 32B sibling to Qwen 2.5, released alongside 0.5B / 1.5B / 7B / 14B / 32B coder variants all under Apache 2.0. Alibaba positioned it as comparable to GPT-4o on Aider code-repair and first among open models on multi-language repair.
Claude 3.5 Haiku detail → · Qwen 2.5 Coder 32B Instruct detail →