The Open-Source AI Stack
RSS
All models

Models · llama

Llama 2 70B Chat

Source-available Meta · 2023-07-18 · Llama 2 Community License

The first Meta-blessed commercial-use open-weights release. Llama 2's community license permitted commercial use under a 700M-MAU cap, opening the door to a year of derivatives that defined what open-weights deployment looked like.

Cost

/ Mtok input
/ Mtok output

Together AI (legacy) · as of 2026-05-19

via Artificial Analysis ↗

Speed

tok/sec output

Together AI (legacy) · as of 2026-05-19

via Artificial Analysis ↗

Architecture

tokens in Embedding vocab 32,000 · llama2 tokenizer × 80 layers Grouped-Query Attention RoPE context 4,096 tokens Dense MLP SwiGLU activation (standard) 70B active params Output projection tokens out
Schema-generated from data/models.yaml. Every label is auditable against the model's sources.

Specs

Architecture
dense
Total params
70B
Active params
70B
Context window
not verified
Attention
gqa
Position encoding
rope
Pretraining tokens
2.0T
Training hardware
A100
Post-training
sft, rlhf
OSI-approved
no
Data released
no
Training code
not released

Benchmarks

Each score carries the date it was published; we never infer or interpolate missing scores.

Recommended use cases

  • historical baseline
  • permissive license under 700M MAU cap

Available quantizations

GGUF llama.cpp's container; the common local format, k-quants from Q2 to Q8. runs on llama.cpp, Ollama
AWQ Activation-aware 4-bit weight quantization for GPU serving. runs on vLLM, SGLang
GPTQ Post-training 4-bit weight quantization for GPU serving. runs on vLLM, SGLang, Transformers
FP8 8-bit float, frequently a native release on Hopper / Blackwell GPUs. runs on vLLM, SGLang, TensorRT-LLM

Verified via the Hugging Face model tree ↗. Community quantizations change over time; the families shown are those with published weights at audit time.

Notable innovations

  • · Commercial-use community license
  • · GQA on the 70B variant

Lineage

Architecture base for the Llama 3 family.

Derivatives

Sources