Glossary

What the terms mean

Definitions for the concepts, protocols, and projects this site uses across its 15 layers. Each entry has a short hover summary and a deeper page. Most show up inline in the prose as dotted underlines you can hover or tap.

183 entries · 15 layers covered

Infrastructure

16 terms

AI factory

A purpose-built data center optimized for AI training rather than general cloud workloads, characterized by liquid-cooled high-density GPU racks, gigawatt-scale single-tenant power, and tightly-coupled networking.
behind-the-meter

A power arrangement where generation sits on the same side of the utility meter as the load, letting a data center draw directly from the plant and bypass the grid.
decentralized GPU marketplace

A protocol market matching GPU supply from many independent providers to AI demand, settled on a token rail; Akash, io.net, Bittensor compute, and Hyperbolic are canonical.
decode also in Runtime

The second phase of LLM inference, generating one token at a time from the KV cache. Memory-bandwidth-bound; throughput tracks memory bandwidth more than peak compute.
direct-to-chip cooling

A cooling architecture that pipes liquid coolant directly to a cold plate on each processor, evacuating the 700+ watts per GPU that air cooling cannot handle.
expert parallelism also in Runtime

A parallelism strategy for mixture-of-experts models where different GPUs hold different experts; requires all-to-all communication on every token routing step.
gigawatt-class cluster

An AI training facility whose power draw is measured in gigawatts rather than megawatts, the scale at which siting decisions become grid-and-permitting problems rather than real-estate ones.
grid interconnect queue

The regulatory queue a new generation or load project must traverse to connect to the grid; currently the binding constraint on how fast gigawatt-class AI sites come online.
hyperscaler capex

The capital expenditure that Microsoft, Google, Amazon, Meta, and Oracle are spending on AI infrastructure, totaling roughly $300B+ annually by 2025-2026 and dominating the supply-and-demand signal for the entire stack.
neocloud

A specialized cloud provider focused exclusively on GPU and AI workloads, operating outside the traditional AWS/Azure/GCP hyperscaler perimeter, with CoreWeave, Lambda, and Voltage Park as the canonical examples.
nuclear PPA

A long-term contract under which an AI operator commits to buy a defined nuclear-generated power output, becoming the cornerstone financing mechanism for gigawatt-class AI buildouts.
prefill also in Runtime

The first phase of LLM inference, processing the input prompt and building the initial KV cache. Compute-bound and parallel across prompt tokens.
PUE

Power Usage Effectiveness, the ratio of total data center facility power to power delivered to IT equipment; lower is better, with 1.0 the floor and 1.1 a strong target.
SMR

A nuclear reactor under ~300 MW per unit, factory-fabricated rather than site-built, positioned as the firm-power option for AI data centers needing new generation faster than conventional plants deliver.
sovereign compute

AI compute capacity owned, operated, or contractually controlled by a nation-state for the use of its own institutions and citizens, distinct from rented capacity on US-hyperscaler clouds.
tensor parallelism also in Runtime

A way to split a single model across multiple GPUs by sharding each layer's weight matrices and doing an all-reduce after every layer. Bandwidth-hungry but layer-by-layer fine-grained.

Silicon

41 terms

Compute

23 terms

Data

10 terms

Training

49 terms

Weights

51 terms

Runtime

79 terms

Retrieval and Memory

11 terms

Agents

29 terms

Protocols

10 terms

Evaluation

19 terms

Governance

19 terms

Identity and Trust

6 terms

Safety and Guardrails

11 terms

Sovereignty and Decentralization Primitives

17 terms