Open-source AI models: what is actually open?

Mar 9, 2026 · 7 min read

Everyone calls their model open-source. Here's a ranked breakdown of what's actually public — weights, training code, data, post-training, and license — for the models that matter right now.

The phrase open-source model has become marketing shorthand. In practice, a release can give you downloadable weights and a smooth inference path while hiding everything you'd actually need to reproduce the system — the training data, the recipes, the post-training steps. That's not open-source. That's open-weight, and the distinction matters.

Open model research

Ranked by openness first, not just by benchmark strength.

Ranked by the Artificial Analysis Openness Index as of March 9, 2026. The goal isn't to pick a winner — it's to separate what you can actually reproduce from what you can only run.

Metric key

AA Rank

The model's position on the Artificial Analysis Openness Index.

Metric key

AA Open

Normalized openness score. Higher means more of the stack is public and reproducible.

Metric key

AA Int.

Normalized capability score. Shows where openness and raw performance diverge.

Status legend

OpenPartialClosed

Open: released and usable. Partial: described but not fully reproducible. Closed: not public.

WWeights

Downloadable checkpoints.

INFInference

A first-party or official runtime path.

TRNTraining code

Enough released code or recipes to reproduce training.

DATATraining data

Open data or a public mixture strong enough to recreate.

RLPost-train / RL

Alignment and post-training recipe beyond a vague mention.

LICLicense

Commercially usable without meaningful extra restrictions.

AA Rank	Model	AA Open	AA Int.	W	INF	TRN	DATA	RL	LIC
#3	OLMo 3.1 32B ThinkAllen Institute for AI	88.89	13.94	Open	Open	Open	Open	Open	Open
#5	K2 Think V2MBZUAI / LLM360	88.89	24.12	Open	Open	Open	Open	Open	Open
#11	Nemotron 3 Nano 30B A3BNVIDIA	72.22	24.27	Open	Open	Partial	Partial	Partial	Open
#19	GLM-4.5 (Reasoning)Z.ai	55.56	26.42	Open	Open	Partial	Closed	Partial	Open
#20	GPT-OSS 120BOpenAI	55.56	26.03	Open	Open	Partial	Closed	Partial	Open
#30	Gemma 3 27B InstructGoogle	50.00	10.31	Open	Open	Closed	Closed	Partial	Open
#35	Magistral Small 1.2Mistral	50.00	18.16	Open	Open	Closed	Closed	Partial	Open
#36	DeepSeek R1 0528DeepSeek	50.00	27.07	Open	Open	Closed	Closed	Partial	Open
#41	GLM-5 (Reasoning)Z.ai	50.00	49.77	Open	Open	Closed	Closed	Closed	Open
#80	DeepSeek V3.2 ExpDeepSeek	44.44	32.94	Open	Open	Partial	Closed	Closed	Open
#97	Llama 4 MaverickMeta	38.89	18.00	Open	Open	Closed	Partial	Partial	Partial
#118	Qwen3.5 397B A17BAlibaba	38.89	45.05	Open	Open	Closed	Closed	Closed	Open
#127	Kimi K2.5Moonshot AI	33.33	46.81	Open	Open	Closed	Partial	Partial	Partial
#138	MiniMax-M2.5MiniMax	27.78	41.93	Open	Open	Closed	Partial	Partial	Partial

What should count as open?

If a model is genuinely open, you should be able to do more than run it. The minimum useful stack is weights, inference code, training code, training data, and a post-training recipe — plus a license that doesn't claw back the freedom the release seems to promise.

What stands out in this snapshot

OLMo 3.1 32B Think and K2 Think V2 are the real benchmark for openness — they publish the data, recipes, and code, not just the checkpoint.
GPT-OSS is a serious step from OpenAI: permissive license, strong runtime support, downloadable weights. But the training stack stays private, which is the divide that matters.
The biggest pattern here is the openness-capability inversion. The most capable models are often the least open. That gap is widening.
If you're evaluating models for anything beyond inference, the AA Open score matters as much as the benchmark rank. Open-weight and open-source are not the same thing.

Open-source AI models: what is actually open?

Ranked by openness first, not just by benchmark strength.

AA Rank

AA Open

AA Int.

What should count as open?

What stands out in this snapshot

Sources