The Open-Source AI Stack
RSS

Track your progress. Log in to sync your course progress, notes, and chat history across devices.

Learn

Three self-paced tracks. The first walks the stack layer by layer and why openness matters at each one. The second covers how LLMs actually work, the mechanics from the inference loop to fine-tuning. The third covers how to run the stack on hardware you control. Pick any; do all three, in any order.

How LLMs work 14 modules · mechanics

The model-side foundation. Tokens, transformers, attention, the KV cache, decoding, chat templates, long context, RAG, tool use, fine-tuning. Start with the loop; the rest follows.