A comprehensive learning path covering LLMs from foundational ML concepts through transformer architecture, training, fine-tuning, prompt engineering, and production deployment.

§ SYLLABUS

§ SECTION 01 · FOUNDATIONS

01
Machine Learning Fundamentals
Understand supervised and unsupervised learning, loss functions, gradient descent, and overfitting. These are the building blocks every ML system relies on.
02
Neural Networks & Deep Learning
Learn how layers of neurons compose to approximate complex functions, including backpropagation, activation functions, and common architectures like MLPs and CNNs.
03
NLP Fundamentals
Explore how computers process human language — tokenization, bag-of-words, TF-IDF, and the challenges of representing meaning in text.
04
Word Embeddings
Understand how words are mapped to dense vector spaces (Word2Vec, GloVe) where semantic relationships are preserved as geometric structure.
05
RNNs & Sequence Models
Learn how recurrent neural networks process sequential data, including LSTMs, GRUs, and the encoder-decoder pattern for sequence-to-sequence tasks.
06
Foundations Complete
You now understand the core ML and NLP concepts that motivated the development of transformers and LLMs.

§ SECTION 02 · TRANSFORMER ARCHITECTURE

01
Attention Mechanism
Understand how attention lets a model dynamically focus on relevant parts of the input, solving the information bottleneck problem of fixed-size hidden states.
02
Self-Attention & Multi-Head Attention
Learn how self-attention computes query-key-value relationships within a sequence, and how multiple heads capture different types of relationships in parallel.
03
Positional Encoding
Understand how transformers inject sequence order information since self-attention is permutation-invariant — covering sinusoidal, learned, and rotary position embeddings.
04
Transformer Blocks & Architecture
See how attention, layer normalization, and feed-forward networks compose into transformer blocks, and how stacking them creates the full encoder-decoder or decoder-only architecture.
05
Encoder-Only vs Decoder-Only vs Encoder-Decoder
Compare the three transformer variants (BERT-style, GPT-style, T5-style) and understand why decoder-only models became dominant for generative LLMs.
06
Transformer Architecture Mastered
You can explain how a transformer processes text from input tokens to output probabilities, and why this architecture replaced RNNs.

§ SECTION 03 · PRETRAINING & DATA

01
Tokenization (BPE, SentencePiece, Tiktoken)
Learn how raw text is split into subword tokens that the model processes — including byte-pair encoding, unigram models, and the tradeoffs of vocabulary size.
02
Pretraining Objectives
Understand causal language modeling (next-token prediction) and masked language modeling, and why the pretraining objective shapes what the model learns.
03
Training Data & Curation
Explore where LLM training data comes from (Common Crawl, books, code), how it is filtered and deduplicated, and how data quality directly affects model capability.
04
Scaling Laws & Compute-Optimal Training
Learn the empirical relationships between model size, data size, and compute budget (Chinchilla scaling laws) that guide how to allocate resources for training.
05
Distributed Training Infrastructure
Understand data parallelism, model parallelism, pipeline parallelism, and mixed-precision training — the engineering that makes training billion-parameter models feasible.
06
Pretraining Understood
You understand how a raw transformer becomes a capable language model through large-scale pretraining on text data.

§ SECTION 04 · FINE-TUNING & ALIGNMENT

01
Supervised Fine-Tuning (SFT)
Learn how instruction-following datasets are used to fine-tune a pretrained model so it produces helpful, structured responses instead of raw text completions.
02
RLHF & Reward Modeling
Understand how human preference data trains a reward model, which then guides policy optimization (PPO) to align model outputs with human values and preferences.
03
DPO & Preference Optimization
Learn Direct Preference Optimization and related methods that skip the reward model step, directly optimizing the policy from preference pairs — simpler and increasingly popular.
04
Parameter-Efficient Fine-Tuning (LoRA, QLoRA)
Understand how low-rank adapters let you fine-tune large models on consumer hardware by training only a small fraction of the parameters.
05
Constitutional AI & Self-Alignment
Explore how models can critique and revise their own outputs using a set of principles, reducing reliance on human feedback for alignment.
06
Alignment Techniques Mastered
You understand how pretrained models are shaped into assistants that follow instructions, respect preferences, and behave safely.

§ SECTION 05 · PROMPT ENGINEERING & APPLICATION

01
Prompt Design Fundamentals
Learn how to write effective prompts — system messages, few-shot examples, formatting instructions, and the impact of prompt structure on output quality.
02
Chain-of-Thought & Reasoning
Understand how prompting a model to think step-by-step dramatically improves performance on complex reasoning, math, and logic tasks.
03
Retrieval-Augmented Generation (RAG)
Learn how to ground LLM responses in external knowledge by retrieving relevant documents at inference time, reducing hallucination and enabling domain-specific answers.
04
Tool Use & Function Calling
Understand how LLMs can invoke external tools, APIs, and functions — turning a text generator into an agent that can take actions in the real world.
05
LLM Agents & Orchestration
Explore agentic architectures where LLMs plan, execute multi-step tasks, use tools, and maintain state — including frameworks like ReAct, reflection, and multi-agent systems.
06
Structured Output & Constrained Generation
Learn techniques for getting LLMs to produce valid JSON, XML, or code — including grammar-constrained decoding and output parsers.
07
Practical LLM Usage Mastered
You can design effective prompts, build RAG pipelines, and orchestrate LLM agents for real-world applications.

§ SECTION 06 · DEPLOYMENT & PRODUCTION

01
Inference Optimization (KV Cache, Batching)
Understand key-value caching, continuous batching, speculative decoding, and other techniques that make LLM inference fast and cost-efficient at scale.
02
Quantization (GPTQ, AWQ, GGUF)
Learn how reducing model precision from 16-bit to 8-bit or 4-bit dramatically cuts memory and compute costs with minimal quality loss.
03
LLM Evaluation & Benchmarks
Explore how to measure LLM quality — automated benchmarks (MMLU, HumanEval), LLM-as-judge, human evaluation, and the challenges of evaluating open-ended generation.
04
Safety, Guardrails & Red-Teaming
Understand prompt injection, jailbreaking, content filtering, and defense strategies for deploying LLMs safely in user-facing applications.
05
Cost Management & Latency Optimization
Learn practical strategies for managing LLM costs — caching, model routing, prompt optimization, and choosing between hosted APIs vs self-hosting.
06
Multimodal Models (Vision, Audio)
Explore how modern LLMs process images, audio, and video alongside text — architectures like vision transformers and cross-modal attention.
07
Production-Ready
You can deploy, optimize, evaluate, and secure LLM-powered applications in production environments.

Large Language Models (LLMs)

§ SYLLABUS