O1
Open SourceText Generationby Allen Institute for AI (AI2)

OLMo 1.7

OLMo is a truly open language model from the Allen Institute for AI (AI2): open weights, open training code, and the open Dolma dataset. It is built for reproducible, transparent LLM research, with 1B and 7B sizes.

ai2llmolmoopen-source-aireproducible-aitransparent-ai
Quick facts
LicenseApache 2.0
Params7B
OpennessWeights+Data+Code
ByAI2
No ratings yet — be the first
Openness
Fully open
weights+code+data
Params
1B / 7B
and larger later
Dataset
Dolma
released
License
Apache 2.0
commercial OK

What is OLMo?

OLMo (Open Language Model) is a large language model from the Allen Institute for AI (AI2) built on a radical commitment to openness. Many 'open' models release only their weights; OLMo releases everything needed to reproduce it — the model weights, the full training code, training logs and intermediate checkpoints, the evaluation suite, and the complete training dataset (Dolma). Its purpose is to make large-model research transparent and reproducible, so scientists can study exactly how capabilities emerge. It is released under Apache 2.0, with 1B and 7B sizes.

What 'fully open' means

The key distinction is end-to-end transparency. With OLMo you can see the Dolma corpus the model learned from, rerun the exact training pipeline, inspect checkpoints from across training to study how the model evolved, and use the same evaluation harness AI2 used. This removes the guesswork that surrounds weights-only releases and turns the model into a genuine scientific artifact rather than an opaque black box, which is exactly what is needed for claims about large models to be independently checked, challenged and built upon by the wider research community.

What it is good at

OLMo's clearest strength is research and education: studying training dynamics, data influence, interpretability and reproducibility on a real, modern LLM. The models are also capable general-purpose base models for their size — competitive with other open 7B-class models of their generation — and serve as a clean, well-documented foundation to fine-tune when you want full visibility into provenance.

Licensing & access

OLMo is Apache 2.0 — fully open for research and commercial use — with weights and instruct/chat variants on Hugging Face and support in Transformers, plus Ollama for easy local runs. The 1B and 7B sizes run on accessible hardware (a single GPU, or quantised on consumer cards), and AI2 provides the Dolma dataset and training code separately for those who want the full stack.

Practical considerations

OLMo prioritises openness and reproducibility over chasing the absolute frontier, so on raw capability it may trail the very latest closed or larger models — though successive releases (the OLMo 2 line, with larger sizes) have closed much of that gap. Use the Instruct variants for assistant behaviour, and take advantage of the open checkpoints and data when transparency, auditability or research are priorities.

How it compares

Compared with GPT-Neo (an earlier open model), BLOOM (open multilingual) and MPT (commercially open), OLMo goes furthest on openness by releasing the training data and full pipeline, not just weights. For capability per parameter it is competitive with its generation's open 7B models. When reproducibility, provenance and scientific study matter as much as raw output, OLMo is the standout choice.

Getting started

Load an OLMo model from Hugging Face with Transformers, or pull it via Ollama for a quick local chat; and use the Instruct variant whenever you want clean assistant-style interaction rather than raw completion. For deeper research, fetch the complete Dolma dataset together with the open training code to fully reproduce or extend the original work, and and explore the released intermediate checkpoints to study, step by step, how the model's abilities actually developed over training.

Model variants

OLMo 1B

1B
Base

Lightweight

MOST POPULAR

OLMo 7B

7B
Base

Main open model

MOST POPULAR

OLMo 7B Instruct

7B
InstructChat

Tuned for chat

Capabilities

🔓
End-to-end open
Weights, training code, logs and the Dolma dataset are all released.
📚
Open training data
The full Dolma corpus is available to inspect and reuse.
🧭
Intermediate checkpoints
Models from across training let you study how capabilities emerged.
💬
Capable base
Competitive general-purpose generation for its size, with Instruct variants.

Pros & Cons

Pros6
  • Truly open: weights, code, data and logs
  • Open Dolma training dataset released
  • Intermediate checkpoints for study
  • Apache 2.0 — research and commercial use
  • Capable base for its size
  • Ideal for reproducible LLM research
Cons4
  • Prioritises openness over the frontier
  • Base models need the Instruct variant for chat
  • Earlier sizes trail the largest LLMs
  • Full reproduction needs real compute

Inspiration

OLMo 1.7 use cases & project ideas

Training research

Study how capabilities emerge.

Data influence

Trace behaviour to the Dolma corpus.

Open assistant base

Fine-tune with full provenance.

Education

Teach how an LLM is built.

FAQ

Frequently asked questions

It releases not just weights but the full training code, logs, intermediate checkpoints and the Dolma training dataset, for reproducible research.