Models · Compare
Command A+ vs Grok 4.3
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: Command A+ | B: Grok 4.3 |
|---|---|---|
| Released | 2026-05-20 | — |
| Developer | Cohere | xAI |
| Openness | Open | Proprietary |
| License | Apache-2.0 | Proprietary |
| OSI-approved | yes | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | moe | unknown |
| Total params | 218B | — |
| Active params | 25B | — |
| Experts | — | — |
| Context window | 128K | — |
| Attention | unknown | unknown |
| Position enc. | unknown | unknown |
| Pretraining tokens | — | — |
| Post-training | sft, rlhf | rlhf |
| Training hardware | — | — |
| $/M input | $0.00 | $1.25 |
| $/M output | $0.00 | $2.50 |
| Output tok/sec | 212.2 | 88.1 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| GPQA-Diamond | 76.1 2026-05-21 | 90.1 2026-05-21 |
Context · A
Cohere's first full Apache 2.0 model, released May 20 2026 as a 218B MoE activating 25B per token. Aimed at sovereign critical infrastructure deployments; fine-tuneable on classified data and runnable air-gapped. Distributed in BF16, FP8, and a W4A4 4-bit format. Multimodal and multilingual across 48 languages.
Context · B
xAI's most-intelligent-and-fastest model per the lab, released April 30 2026 after a Grok 4.3 Beta drop on April 17 for SuperGrok Heavy users. Native multimodal video understanding and direct PDF, spreadsheet, and PowerPoint generation from chat. Reasoning is now always-on rather than toggleable.