Reyansh Sharma, a London/Cambridge-based researcher, was selected by the Mercatus Center's Emergent Ventures program in its 46th cohort (announced September 2025) for work on open-source mathematical training data and evaluation sets for language models. The Emergent Ventures application is a single short form covering the plan, a brief bio, and a ballpark budget, with Tyler Cowen as program director making the funding call after consulting informal advisers. Grants in the program typically fall in the $1K to $50K range.
The category Sharma is working in sits between the data and training layers and addresses a known bottleneck for open-weights reasoning models. Math corpora and step-level evaluation sets are scarce relative to demand. The reference points in the public literature are projects like OpenWebMath (an open dataset of mathematical web text) and NVIDIA's OpenMathInstruct-1 and OpenMathInstruct-2 (1.8M and larger instruction-tuning datasets). Closed labs are widely reported to have built much larger internal math corpora.
Public detail about the specific dataset, license, or evaluation framework Sharma is producing is not yet available beyond the cohort announcement at Marginal Revolution. The Emergent Ventures grant covers the early, exploratory phase of work where a single individual is the natural unit of funding.
Recipient
Reyansh Sharma
Funder
Emergent Ventures (Mercatus Center) · foundation · Global
Tyler Cowen's discretionary grant program at George Mason University's Mercatus Center; funds individuals working on under-supplied ideas including AI tools, AI policy, and AI for science.
Primary source
https://marginalrevolution.com/marginalrevolution/2025/09/emergent-ventures-winners-46th-cohort.html
Additional sources
More from Emergent Ventures (Mercatus Center)
- Data for the AIs 2025-12-01 · $1K-$50K range
Building open data resources for AI training. 50th cohort Emergent Ventures award.
- Privacy-preserving ML on encrypted data 2025-09-01 · $1K-$50K range
Privacy-preserving machine-learning models that operate directly on encrypted data without exposing the underlying inputs. 46th cohort Emergent Ventures award.