The Open-Source AI Stack
RSS
All models

Models · Compare

Falcon 180B Chat vs Gemini 1.5 Pro

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field A: Falcon 180B Chat B: Gemini 1.5 Pro
Released 2023-09-042024-02-15
Developer Technology Innovation InstituteGoogle DeepMind
Openness Source-availableProprietary
License Falcon-180B TII LicenseProprietary
OSI-approved nono
Data released nono
Training code nono
Architecture denseunknown
Total params 180B
Active params
Experts
Context window 2K2.1M
Attention mqaunknown
Position enc. ropeunknown
Pretraining tokens
Post-training sftrlhf
Training hardware A100 40GB
$/M input $0.00
$/M output $0.00
Output tok/sec 0

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro 75.0 2026-05-21
GPQA-Diamond 58.9 2026-05-21

Code

LiveCodeBench 31.6 2026-05-21

Math

MATH 87.6 2026-05-21
AIME 2024 23.0 2026-05-21

Context · A

TII's Falcon 180B was, at release, the largest openly available language model weights, fine-tuned on Ultrachat, Platypus, and Airoboros for the Chat variant. Trained on RefinedWeb across up to 4,096 A100 40GB GPUs on AWS SageMaker. The TII Falcon License is source-available with an acceptable-use policy attached.

Context · B

Google's first long-context Gemini checkpoint, introduced with a 128K standard window and a 1M token preview tier. Google described the design as a mixture-of-experts that activates a subset of expert networks per input, and demonstrated 99% recall on needle-in-a-haystack across 1M tokens at launch. The context window was later extended to 2M tokens in private preview, announced May 14 2024.

Falcon 180B Chat detail → · Gemini 1.5 Pro detail →