Apple Silicon (M-series) · The Open-Source AI Stack

Apple's M-series chips (M1, M2, M3, M4 across Pro, Max, Ultra tiers) are SoCs designed for laptops and desktops. Closed silicon on a custom ARM-based ISA, closed accelerator (the Neural Engine plus the GPU). What makes them matter for open-source AI is the unified memory architecture: CPU, GPU, and Neural Engine share the same physical memory, so a 192GB Mac Studio has 192GB of "VRAM" for inference purposes. For local AI specifically (single user, batch size 1), inference is bandwidth-bound: tokens per second is capped by memory bandwidth divided by model size in bytes. Apple's wide LPDDR5X bus delivers 273-800 GB/s depending on tier (M4 Pro to M2/M3 Ultra). That, combined with high memory capacity, makes Macs the only consumer hardware on the market that can hold a 70B model and run it at usable speed. NVIDIA's consumer cards (4090, 5090) cap memory capacity well below what 70B models need. AMD's Strix Halo (96GB unified memory) is the first credible non-Apple alternative. Production-ready and the de facto local-AI substrate in 2026. Used by llama.cpp (Apple Silicon backend is among the most optimized), Ollama (which wraps llama.cpp), and MLX (Apple's open ML framework specifically targeting Apple Silicon). The sovereignty critique: Apple's closed-everything ecosystem means "local AI on a Mac" still ties you to Apple's hardware and software roadmap. Genuinely open local-AI silicon at this capability remains a future bet (Tenstorrent, Strix Halo).

Sources

Apple M4 Pro / Max Specifications https://www.apple.com/newsroom/2024/10/new-macbook-pro-features-m4-family-of-chips-and-apple-intelligence/

Apple MLX Framework https://github.com/ml-explore/mlx

llama.cpp (Apple Silicon backend) https://github.com/ggml-org/llama.cpp

apple.com (audit-verified) https://www.apple.com/newsroom/2024/10/apple-introduces-m4-pro-and-m4-max/

apple.com (audit-verified) https://www.apple.com/newsroom/2025/03/apple-unveils-new-mac-studio-the-most-powerful-mac-ever/

Other projects at the Silicon layer

6 siblings · ordered open first

Tenstorrent (Wormhole, Blackhole) Open source

Open-trending AI accelerators on RISC-V; Jim Keller-led; tt-metal and tt-forge open.

RISC-V Open source

Open instruction set architecture; royalty-free; substrate for open silicon (CPUs and emerging AI accelerators).

NVIDIA H100 / H200 Proprietary

Hyperscaler-class AI accelerator with CUDA software moat; default frontier-training and frontier-inference hardware.

AMD MI300X / MI325X Proprietary

Highest-memory accelerator on the market (192 GB+ HBM); ROCm software stack open-source-adjacent.

Cerebras CS-3 Proprietary

Wafer-scale accelerator; proprietary but disruptive on inference economics for specific model sizes.

Groq LPU Proprietary

Language Processing Unit; proprietary; extraordinarily fast inference for small-to-medium models at low batch sizes.