Governance

What it is

Overview

Definitions, licenses, foundations, and standards bodies. Governance does not produce silicon, weights, or runtime, but it produces the definitions by which the other layers get judged. “Open AI” means seven different things to seven different audiences; governance is where the meanings get reconciled or not.

Five things to keep in mind as you read:

Governance grades the rest of the stack. A model is “open” only relative to a definition; the definition is the governance artifact.
OSIgovernanceThe nonprofit that maintains the canonical Open Source Definition for software since 1998, and the OSAID definition for AI as of 2024. Open full entry is the dominant standards body for software. OSAIDgovernanceThe OSI's October 2024 definition of "open source AI," requiring not just weights but enough information about data, code, and architecture for third parties to reproduce the system. Open full entry v1.0 (October 2024) is its first formal AI definition.
The OSAID fight is about data. Required for “open AI” or not? The text says “sufficient information to reproduce”, not full release; critics want full release.
Foundations hold the IP. Linux Foundation stewards MCP, A2A, AAIF, and the AI Alliance. Apache stewards Apache 2.0.
Lab-specific licenses live outside this process. Llama Community License, Gemma Terms. None are OSI-open; many users treat them as such anyway.

The rest of this page walks the definitional fight, the foundations, and the lab-license category, then arrives at the 2026 revision cycle.

The OSAID v1.0 definition fight

The Open Source AI Definition v1.0 is the load-bearing governance artifact at this layer. Finalized by OSI in October 2024 after a two-year multi-stakeholder process (OSAID 1.0 announcement).

The text requires three components to be released under OSI-conformant terms:

The model weights and parameters
The training code, sufficient to retrain the system
“Sufficiently detailed information about the data used to train the system so that a skilled person can build a substantially equivalent system”

Clause 3 is the entire fight. The pro-OSAID-v1.0 reading (OSI’s): requiring full training-data release would exclude every commercially-trained model because of publisher licensing agreements that prevent redistribution. A definition no major lab can meet doesn’t shape the market.

The critics’ reading (notably Bruce Perens, the AI Now Institute, and parts of the EleutherAI / AI2 communities): “describe how to build it” without shipping the actual data lets labs claim openness while keeping the part competitors need. By analogy: would we call Linux “open source” if Linus shipped binaries plus a thorough README on how to write your own kernel, but not the source code? Of course not.

Both readings have force. The pragmatic-OSAID position lets the definition shape current practice; the strict-OSAID position keeps the definition meaningful by refusing to bend it. The 2026 revision cycle is where the tension comes due.

The foundations

The neutral hosts that hold spec IP and govern its evolution. These matter because a spec under one company’s control is vulnerable to that company changing it; a spec at a foundation has a multi-stakeholder process for change.

Linux Foundation — the most-active host for AI governance. Stewards MCP (since late 2025), A2A, AAIF, plus the AI Alliance Foundation (AAIF the standards body, not to be confused with AAIF the agent-identity protocol; the names are unfortunate) (LF AI & Data).
Apache Software Foundation — stewards Apache 2.0 (the most-used open license for ML), plus Apache Spark, Kafka, and many ML adjacent projects (Apache projects).
Open Source Initiative — stewards OSAID and the 10-criterion OSD that defines open-source software. Doesn’t host code; defines and certifies licenses (OSI).
OpenChain / SPDX — supply-chain governance and software-bill-of-materials standards. Not AI-specific but relevant for ML model packaging.
Free Software Foundation — the GPL-and-copyleft tradition. Has been notably absent from the OSAID conversation; considers OSAID v1.0 too permissive (FSF news).

The Linux Foundation’s strategy of attracting AI protocol governance early is the most visible “neutral host” play of 2024-2026. Whether it works depends on whether the original lab sponsors (Anthropic for MCP, Google for A2A) continue to treat the foundation-hosted spec as authoritative when their internal priorities conflict with the multi-vendor consensus.

The lab-license category

Operating outside the OSI process. Each is a take-it-or-leave-it contract written by a single lab.

Llama Community License (Meta, current at Llama 4 in 2025), free for most uses; Section 2 requires a license request from Meta if your products or services exceed 700M monthly active users on the release date; Section 1.b.iv incorporates an Acceptable Use Policy by reference (Llama 4 Community License, Llama 4 AUP).
Gemma Terms (Google) — similar shape, with Google’s own AUP and a use-restriction list (Gemma Prohibited Use Policy).
Mistral Research License (for the larger Mistral models) — research-only, no commercial use without separate license (Mistral AI Non-Production License).
DeepSeek License — varies by model; some MIT-equivalent, some with additional terms. Read the LICENSE file per release.
OpenRAIL-M family (used by some BigScience / BLOOM releases) — responsible-AI license category that adds use-restrictions to an otherwise-permissive shape (RAIL definition).

None of these are OSI-open. The OpenRAIL family was explicitly designed to NOT be open-source by OSI standards because adding use-restrictions violates OSD criterion 6 (no discrimination against fields of endeavor). The lab-license families inherit similar tensions.

In practice, most users treat the lab licenses as “open enough” because the realistic alternative is fully-proprietary API access. The governance argument is that “open enough” is a strategic concession that lets the definition drift toward what the labs want it to mean.

What’s open and what isn’t

The governance layer is open by nature (definitions are public; specs are published) but the question of which definition counts is itself contested.

OSI-conformant licenses: Apache 2.0, MIT, BSD-3-Clause, and the OSI-approved list. These are unambiguously open.
OSAID v1.0-conformant releases: OLMo, a handful of others. Defined by OSI as “open source AI”.
Lab licenses: Llama, Gemma, Mistral Research. Widely-used, not OSI-open.
OpenRAIL family: explicitly not-open by OSI, with use-restrictions. Used by some research releases.
Proprietary: GPT, Claude, Gemini. No license; API ToS.

The reverse-lock-in risk at this layer is definitional drift. If “open AI” comes to mean “Llama-style source-available”, the strict definition loses its market influence, and labs face no incentive to ship data or training code.

The editorial tension

The OSAID v1.0 settlement was a pragmatic compromise. The 2026 revision cycle is where it either holds or breaks.

The case for holding the v1.0 line: the definition is finally in production use. Major labs are starting to label releases “OSAID-conformant” where applicable. Changing the rules quickly would destroy the brand the definition is building.

The case for tightening: nearly every “open-weights” release in 2025-2026 met v1.0’s text by writing more thorough README files, not by shipping more data. The labs hit the spec without ever changing their data-disclosure practice, which suggests the bar is too low.

Whether the open-source-AI definition is something that gets strict enough to actually distinguish what AI2 ships from what Meta ships, or stays loose enough to accommodate both, is the governance question that decides what “open AI” means for the next five years.

Key projects

9 catalogued · ordered open first · "Details" for a project page, "Ask" for an in-context chat

Agentic AI Foundation (AAIF) Open source

Linux Foundation umbrella for agent and protocol governance; founded Dec 2025; hosts MCP, goose, AGENTS.md.

Standard · new
Open Source Initiative Open source

Body that defines and certifies open source; oversees the Open Source AI Definition (OSAID).

Standard · stable
OSAID v1.0 Open source

Open Source AI Definition v1.0 (October 2024); the first formal definition of 'open source AI'; v2.0 cycle in 2026.

Standard · stable
Linux Foundation Open source

Neutral hosting and governance body; owns AAIF, LF AI and Data, A2A, and many other open AI projects.

Standard · stable
Apache License 2.0 Open source

Most-used license for open AI projects; explicit patent grant plus patent-retaliation clause; durable rights.

Apache 2.0 · stable
MIT License Open source

Shortest permissive open-source license; no explicit patent language; common in agent and runtime projects.

MIT · stable
Llama Community License Source available

Meta's source-available license; 700M MAU clause; OSI-rejected. Sets the precedent for 'source-available AI.'

Source-available · stable
Gemma Terms Source available

Google's open-weights license for Gemma; source-available; prohibited-use policy parallels Meta's.

Source-available · stable
IndiaAI Mission Proprietary

India's national AI mission, ~Rs 10,372 crore approved by the Union Cabinet in March 2024; a Common Compute Facility track procures shared GPU capacity for Indian researchers and startups.

Government-funded · stable

Reading list

12 hand-picked · primary sources first · recent within each group

Papers

On the Opportunities and Risks of Foundation Models 2021

Stanford HAI

The paper that named the category. Still the best framing of why foundation models are a political object, not only a technical one.

Posts

Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text 2025

EleutherAI + partner institutions

First serious attempt at a fully license-clean pretraining corpus at scale. Read the methodology, not just the announcement.
AI Snake Oil 2025

Arvind Narayanan, Sayash Kapoor

The single most-useful ongoing critical reading on AI claims. Filter to the evaluation posts.
AI 2027 2025

AI Futures Project

Scenario-based forecast that grounded a lot of 2025 safety-policy debate. Read as fiction-with-cited-priors.
Anthropic Economic Index 2025

Anthropic

Recurring report on how Claude is actually used in the workforce. Best public proxy for measuring agent and assistant adoption by occupation.
The NYT v. OpenAI Complaint 2023

New York Times v. Microsoft, OpenAI (SDNY)

Read the actual complaint, not the news coverage. Sets the legal frame for every dataset-licensing decision being made today.

Docss

EU AI Act, Plain English 2025

Future of Life Institute

The most accessible summary of the EU AI Act with citations to articles. Read before citing 'GPAI obligations'.
Choose an Open Source License 2025

GitHub

The first stop before publishing anything. Pragmatic license comparison without lawyerese.
The Foundation Model Transparency Index, 2025 2025

Stanford HAI / CRFM

Scored transparency across 14 frontier developers on data, labor, compute, and downstream impact. The most-actionable open-vs-closed scorecard.
The Open Source AI Definition v1.0 2024

OSI

Read the actual text, not the takes about it. The most-contested definition in the field.
The Foundation Model Transparency Index 2024

Stanford HAI

Multi-dimensional scoring of transparency across major foundation models. Most-useful published artifact for this conversation.
Executive Order on AI (and its rescission) 2023

White House archives

Read the original EO, then read the 2025 rescission. The arc tells you where US AI policy is.