Models · Compare

Command A+ vs Grok 4.3

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field	A: Command A+	B: Grok 4.3
Released	2026-05-20	—
Developer	Cohere	xAI
Openness	Open	Proprietary
License	Apache-2.0	Proprietary
OSI-approved	yes	no
Data released	no	no
Training code	no	no
Architecture	moe	unknown
Total params	218B	—
Active params	25B	—
Experts	—	—
Context window	128K	—
Attention	unknown	unknown
Position enc.	unknown	unknown
Pretraining tokens	—	—
Post-training	sft, rlhf	rlhf
Training hardware	—	—
$/M input	$0.00	$1.25
$/M output	$0.00	$2.50
Output tok/sec	212.2	88.1

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

GPQA-Diamond

76.1 2026-05-21

90.1 2026-05-21

Context · A

Cohere's first full Apache 2.0 model, released May 20 2026 as a 218B MoE activating 25B per token. Aimed at sovereign critical infrastructure deployments; fine-tuneable on classified data and runnable air-gapped. Distributed in BF16, FP8, and a W4A4 4-bit format. Multimodal and multilingual across 48 languages.

Context · B

xAI's most-intelligent-and-fastest model per the lab, released April 30 2026 after a Grok 4.3 Beta drop on April 17 for SuperGrok Heavy users. Native multimodal video understanding and direct PDF, spreadsheet, and PowerPoint generation from chat. Reasoning is now always-on rather than toggleable.

Command A+ detail → · Grok 4.3 detail →