Raphael Milliere, at the time of the December 17, 2024 Cosmos award an Assistant Professor in Philosophy of Artificial Intelligence at Macquarie University, works on the cognitive science of large neural networks. He has since moved to the University of Oxford's Institute for Ethics in AI. His research program treats variable binding, the ability to temporarily associate symbols with values while keeping their identity, as the question that decides whether transformer models can perform genuinely symbolic computation.
The technical work, published as "How Do Transformers Learn Variable Binding in Symbolic Programs?" on arXiv (2505.20896), trains transformers on synthetic programs of variable assignments and dereferences, then uses mechanistic interpretability methods, including causal interventions, to reverse-engineer how the model implements the task. The paper identifies three training phases: random constant prediction, a shallow heuristic that prefers early assignments, and finally the emergence of a systematic dereferencing mechanism. The trained model uses the residual stream as an addressable memory space, with specific attention heads routing variable values across token positions and layers.
Variable binding has been a long-standing dividing line in cognitive science between connectionist and classical-symbolic accounts. Showing that transformers learn to implement it without dedicated architecture, and that the implementation is legible to interpretability tools, lands at the evaluation and weights cross-layers of the open stack. The grant falls in the Cosmos fast-grant range of $1K to $10K. The deliverable is open interpretability infrastructure rather than a closed product, consistent with the institute's emphasis on artifacts other researchers can build on.
Recipient
Raphaël Millière
Funder
Cosmos Institute · foundation · US
Backs philosopher-builders making prototypes, essays, and projects at the intersection of AI and human flourishing, with emphasis on reason, decentralization, and individual autonomy.
Primary source
https://blog.cosmos-institute.org/p/announcing-the-second-cohort-of-cosmos
Additional sources
More from Cosmos Institute
- Institute for Decentralized AI launch 2025-09 · 5 fully funded research positions
New institute combining research and infrastructure to support AI that operates without centralized gatekeepers. Initial cohort of five fully funded positions across Oxford and Stanford.
- Metalens 2025-08-15 · $1K-$10K plus compute
Open-source scientific research platform combining AI summarization with structured human checks, intended to help researchers see beyond hype and identify reliable claims in dense literature.
- Perplex 2025-08-15 · $1K-$10K plus compute
Tooling for surfacing hidden goals in closed AI systems by probing them with open-weight reference models. Aims to give independent reviewers a way to audit proprietary deployments.
- Policy Explorer 2025-08-15 · $1K-$10K plus compute
AI tool for analyzing the impacts and assumptions embedded in regulatory and policy proposals, geared toward helping civil-society analysts surface what a rule actually does.
- Argument Debugger 2025-08-15 · $1K-$10K plus compute
AI assistant that finds gaps in chains of reasoning and suggests repairs, framed as a tool for better arguments rather than persuasive ones. Built by a Harvard CS faculty member.