OLM Learning

Course Overview

A guided path from a little Python and a little deep learning to building and training your own language model — and understanding every piece. You write and run real code from the very first lesson, using OLM.

Note

Already comfortable with the basics and just want to train a model? Skip ahead to Your First Language Model and Custom Architectures in Start Building.

Who this is for

This course is for you if you can write a bit of Python and have seen a neural network at least once — enough to know roughly what a tensor and a weight are. If you're a second-year CS student who has taken one introductory ML course, you're exactly who it's written for. Everything specific to language models — tokens, embeddings, attention, transformers, how training works — we build up from the beginning, one lesson at a time.

Where you'll get to

  • Explain how a language model works and how it is trained.
  • Train a small GPT-style model on your own text and generate from it.
  • Read a modern architecture (Llama, Qwen) and recognise every component.
  • Assemble your own architecture from OLM's building blocks.
  • Know where to go next — scaling up, multiple GPUs, research.

The path

Work through the lessons in order; each is short and builds on the last. Keep Key Concepts handy for a quick definition of any term.

Foundations

Build for real

Once you've got the foundations, these tutorials take you the rest of the way:

How this course is different

Most resources teach language modelling starting from the mathematics. This one starts from building. From the first lessons you assemble and train real models with OLM's components, and pick up the theory alongside — as deep as you want to go. Curious what's under a step? Each lesson has optional going deeper notes. Just want to build? Skip them and you'll still come away with working models. It's the approach that made PyTorch approachable for deep learning: useful first, fundamental when you're ready.

For educators & institutions

OLM is built to be taught. The library is small and readable, every component maps one-to-one to a concept, and this course already sequences intuition → code → practice. That makes it a ready-made backbone for a semester course or lab module, a workshop or bootcamp unit, or self-study cohorts and reading groups.

Because the same Markdown powers this site and the GitHub repository, students can read along wherever they work, and the code is the textbook. If you're considering OLM for a course and want help adapting this path — or want to contribute lessons, exercises, or solutions — open an issue or reach out on GitHub.

Note

The foundation lessons (0–5) and the attention deep dive are all available now. Exercises and further material are planned.