HOME / CATALOG / LARGE LANGUAGE MODELS (LLMS)
01
ROADMAP / ADVANCED

Large Language Models (LLMs)

38 TOPICS · 35 HOURS · ADVANCED · SCALE 1:4
START CANVAS

A comprehensive learning path covering LLMs from foundational ML concepts through transformer architecture, training, fine-tuning, prompt engineering, and production deployment.


§ SYLLABUS

§ SECTION 01 · FOUNDATIONS
  1. 01
    Machine Learning Fundamentals

    Understand supervised and unsupervised learning, loss functions, gradient descent, and overfitting. These are the building blocks every ML system relies on.

  2. 02
    Neural Networks & Deep Learning

    Learn how layers of neurons compose to approximate complex functions, including backpropagation, activation functions, and common architectures like MLPs and CNNs.

  3. 03
    NLP Fundamentals

    Explore how computers process human language — tokenization, bag-of-words, TF-IDF, and the challenges of representing meaning in text.

  4. 04
    Word Embeddings

    Understand how words are mapped to dense vector spaces (Word2Vec, GloVe) where semantic relationships are preserved as geometric structure.

  5. 05
    RNNs & Sequence Models

    Learn how recurrent neural networks process sequential data, including LSTMs, GRUs, and the encoder-decoder pattern for sequence-to-sequence tasks.

  6. 06
    Foundations Complete

    You now understand the core ML and NLP concepts that motivated the development of transformers and LLMs.

§ SECTION 02 · TRANSFORMER ARCHITECTURE
  1. 01
    Attention Mechanism

    Understand how attention lets a model dynamically focus on relevant parts of the input, solving the information bottleneck problem of fixed-size hidden states.

  2. 02
    Self-Attention & Multi-Head Attention

    Learn how self-attention computes query-key-value relationships within a sequence, and how multiple heads capture different types of relationships in parallel.

  3. 03
    Positional Encoding

    Understand how transformers inject sequence order information since self-attention is permutation-invariant — covering sinusoidal, learned, and rotary position embeddings.

  4. 04
    Transformer Blocks & Architecture

    See how attention, layer normalization, and feed-forward networks compose into transformer blocks, and how stacking them creates the full encoder-decoder or decoder-only architecture.

  5. 05
    Encoder-Only vs Decoder-Only vs Encoder-Decoder

    Compare the three transformer variants (BERT-style, GPT-style, T5-style) and understand why decoder-only models became dominant for generative LLMs.

  6. 06
    Transformer Architecture Mastered

    You can explain how a transformer processes text from input tokens to output probabilities, and why this architecture replaced RNNs.

§ SECTION 03 · PRETRAINING & DATA
  1. 01
    Tokenization (BPE, SentencePiece, Tiktoken)

    Learn how raw text is split into subword tokens that the model processes — including byte-pair encoding, unigram models, and the tradeoffs of vocabulary size.

  2. 02
    Pretraining Objectives

    Understand causal language modeling (next-token prediction) and masked language modeling, and why the pretraining objective shapes what the model learns.

  3. 03
    Training Data & Curation

    Explore where LLM training data comes from (Common Crawl, books, code), how it is filtered and deduplicated, and how data quality directly affects model capability.

  4. 04
    Scaling Laws & Compute-Optimal Training

    Learn the empirical relationships between model size, data size, and compute budget (Chinchilla scaling laws) that guide how to allocate resources for training.

  5. 05
    Distributed Training Infrastructure

    Understand data parallelism, model parallelism, pipeline parallelism, and mixed-precision training — the engineering that makes training billion-parameter models feasible.

  6. 06
    Pretraining Understood

    You understand how a raw transformer becomes a capable language model through large-scale pretraining on text data.

§ SECTION 04 · FINE-TUNING & ALIGNMENT
  1. 01
    Supervised Fine-Tuning (SFT)

    Learn how instruction-following datasets are used to fine-tune a pretrained model so it produces helpful, structured responses instead of raw text completions.

  2. 02
    RLHF & Reward Modeling

    Understand how human preference data trains a reward model, which then guides policy optimization (PPO) to align model outputs with human values and preferences.

  3. 03
    DPO & Preference Optimization

    Learn Direct Preference Optimization and related methods that skip the reward model step, directly optimizing the policy from preference pairs — simpler and increasingly popular.

  4. 04
    Parameter-Efficient Fine-Tuning (LoRA, QLoRA)

    Understand how low-rank adapters let you fine-tune large models on consumer hardware by training only a small fraction of the parameters.

  5. 05
    Constitutional AI & Self-Alignment

    Explore how models can critique and revise their own outputs using a set of principles, reducing reliance on human feedback for alignment.

  6. 06
    Alignment Techniques Mastered

    You understand how pretrained models are shaped into assistants that follow instructions, respect preferences, and behave safely.

§ SECTION 05 · PROMPT ENGINEERING & APPLICATION
  1. 01
    Prompt Design Fundamentals

    Learn how to write effective prompts — system messages, few-shot examples, formatting instructions, and the impact of prompt structure on output quality.

  2. 02
    Chain-of-Thought & Reasoning

    Understand how prompting a model to think step-by-step dramatically improves performance on complex reasoning, math, and logic tasks.

  3. 03
    Retrieval-Augmented Generation (RAG)

    Learn how to ground LLM responses in external knowledge by retrieving relevant documents at inference time, reducing hallucination and enabling domain-specific answers.

  4. 04
    Tool Use & Function Calling

    Understand how LLMs can invoke external tools, APIs, and functions — turning a text generator into an agent that can take actions in the real world.

  5. 05
    LLM Agents & Orchestration

    Explore agentic architectures where LLMs plan, execute multi-step tasks, use tools, and maintain state — including frameworks like ReAct, reflection, and multi-agent systems.

  6. 06
    Structured Output & Constrained Generation

    Learn techniques for getting LLMs to produce valid JSON, XML, or code — including grammar-constrained decoding and output parsers.

  7. 07
    Practical LLM Usage Mastered

    You can design effective prompts, build RAG pipelines, and orchestrate LLM agents for real-world applications.

§ SECTION 06 · DEPLOYMENT & PRODUCTION
  1. 01
    Inference Optimization (KV Cache, Batching)

    Understand key-value caching, continuous batching, speculative decoding, and other techniques that make LLM inference fast and cost-efficient at scale.

  2. 02
    Quantization (GPTQ, AWQ, GGUF)

    Learn how reducing model precision from 16-bit to 8-bit or 4-bit dramatically cuts memory and compute costs with minimal quality loss.

  3. 03
    LLM Evaluation & Benchmarks

    Explore how to measure LLM quality — automated benchmarks (MMLU, HumanEval), LLM-as-judge, human evaluation, and the challenges of evaluating open-ended generation.

  4. 04
    Safety, Guardrails & Red-Teaming

    Understand prompt injection, jailbreaking, content filtering, and defense strategies for deploying LLMs safely in user-facing applications.

  5. 05
    Cost Management & Latency Optimization

    Learn practical strategies for managing LLM costs — caching, model routing, prompt optimization, and choosing between hosted APIs vs self-hosting.

  6. 06
    Multimodal Models (Vision, Audio)

    Explore how modern LLMs process images, audio, and video alongside text — architectures like vision transformers and cross-modal attention.

  7. 07
    Production-Ready

    You can deploy, optimize, evaluate, and secure LLM-powered applications in production environments.