Models · Compare

Gemma 2 27B IT vs Claude 3.5 Sonnet

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field	A: Gemma 2 27B IT	B: Claude 3.5 Sonnet
Released	2024-06-27	2024-06-20
Developer	Google DeepMind	Anthropic
Openness	Source-available	Proprietary
License	Gemma Terms of Use	Proprietary
OSI-approved	no	no
Data released	no	no
Training code	no	no
Architecture	dense	unknown
Total params	27.2B	—
Active params	—	—
Experts	—	—
Context window	8K	200K
Attention	hybrid-gqa-sliding	unknown
Position enc.	rope	unknown
Pretraining tokens	—	—
Post-training	sft, rlhf, knowledge-distillation	rlhf, constitutional
Training hardware	TPU v5p	—
$/M input	—	—
$/M output	—	—
Output tok/sec	—	—

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

Code

Context · A

Distilled from a larger Gemini teacher, demonstrating knowledge-distillation as a path to mid-scale open-weights quality. Source-available license restricts some uses.

Context · B

The first Claude release to beat its own larger sibling (Claude 3 Opus) on most benchmarks. Established Artifacts, driving a wave of code-and-canvas product copies.

Gemma 2 27B IT detail → · Claude 3.5 Sonnet detail →