Master the principles, architectures, and practical techniques for building autonomous AI agents that can reason, plan, use tools, and collaborate to accomplish complex goals.

§ SYLLABUS

§ SECTION 01 · FOUNDATIONS OF AI AGENTS

01
What Are AI Agents?
Understand the defining characteristics that separate an AI agent from a simple chatbot or API call — autonomy, goal-directedness, environment interaction, and feedback loops.
02
LLM Fundamentals for Agents
Learn how large language models generate text, what the context window is, how temperature and sampling affect output, and why these properties matter when building agents.
03
Prompt Engineering for Agents
Master structured prompting techniques — system prompts, few-shot examples, chain-of-thought, and output formatting — that form the backbone of agent instruction.
04
Agent Architectures & Taxonomies
Survey the landscape of agent designs: reactive agents, deliberative agents, BDI (Belief-Desire-Intention), and modern LLM-based agent architectures.

§ SECTION 02 · REASONING & PLANNING

01
Chain-of-Thought & Structured Reasoning
Understand how breaking problems into explicit reasoning steps improves agent accuracy, and learn techniques like CoT, self-consistency, and tree-of-thought.
02
Task Decomposition & Planning
Learn how agents break complex goals into subtasks, create execution plans, and adapt plans when intermediate steps fail or produce unexpected results.
03
Reflection & Self-Critique
Explore how agents can evaluate their own outputs, detect errors, and iteratively refine their work through self-reflection loops like Reflexion and self-debugging.
04
The ReAct Pattern
Learn the Reasoning + Acting loop where agents interleave thinking steps with tool calls, observe results, and decide the next action — the dominant pattern in modern agents.

§ SECTION 03 · TOOL USE & ENVIRONMENT INTERACTION

01
Function Calling & Tool Integration
Understand how agents invoke external functions — API design for tools, JSON schema definitions, parameter extraction, and handling tool responses.
02
Code Generation & Execution
Learn how agents write and run code to solve problems, including sandboxed execution environments, error handling, and iterative debugging workflows.
03
Web Browsing & API Interaction
Explore how agents navigate websites, call REST APIs, and extract structured information from unstructured web content to accomplish real-world tasks.
04
Retrieval-Augmented Generation (RAG)
Learn how agents use vector databases and retrieval systems to access large knowledge bases, ground their responses in facts, and reduce hallucination.

§ SECTION 04 · MEMORY & STATE MANAGEMENT

01
Context Window Management
Learn strategies for managing limited context windows — summarization, sliding windows, priority-based truncation, and when to offload information to external storage.
02
Short-Term & Working Memory
Understand how agents maintain conversation state, scratch pads, and intermediate results within a single task execution session.
03
Long-Term Memory & Knowledge Persistence
Explore techniques for agents to remember information across sessions — vector stores, structured databases, memory retrieval strategies, and memory consolidation.

§ SECTION 05 · MULTI-AGENT SYSTEMS

01
Multi-Agent Collaboration Patterns
Learn common patterns for multiple agents working together — supervisor/worker, debate, ensemble, pipeline, and blackboard architectures.
02
Inter-Agent Communication
Understand how agents exchange information — message passing protocols, shared state, structured handoffs, and maintaining coherence across agent boundaries.
03
Role Specialization & Delegation
Explore how to design agents with focused expertise and route tasks to the right specialist — creating systems that are more capable than any single agent.

§ SECTION 06 · FRAMEWORKS & IMPLEMENTATION

01
LangChain & LangGraph
Learn the LangChain ecosystem for building agent chains and LangGraph for creating stateful, graph-based agent workflows with cycles and branching.
02
Anthropic Agent SDK & Claude Tool Use
Build agents using Claude's native tool use, the Anthropic SDK, and the Claude Agent SDK — including structured outputs, streaming, and extended thinking.
03
OpenAI Agents SDK
Explore OpenAI's approach to agent building with their Agents SDK, Assistants API, function calling, and code interpreter tools.
04
Building Custom Agent Loops
Learn to build agents from scratch without frameworks — understanding the core loop of prompt → LLM call → parse → tool execution → observe → repeat.

§ SECTION 07 · SAFETY, EVALUATION & PRODUCTION

01
Guardrails & Safety Boundaries
Learn how to constrain agent behavior — input/output validation, action allowlists, human-in-the-loop approvals, and preventing prompt injection attacks.
02
Agent Evaluation & Benchmarks
Understand how to measure agent performance — task completion rates, tool use accuracy, reasoning quality, and established benchmarks like SWE-bench and GAIA.
03
Observability & Debugging Agents
Learn to trace agent execution, log reasoning chains, monitor tool calls, and debug failures in complex multi-step agent workflows.
04
Cost & Latency Optimization
Explore strategies for managing token usage, caching, model routing (using smaller models for simple steps), and reducing end-to-end agent latency.
05
Human-in-the-Loop Workflows
Design systems where agents escalate to humans for approval, clarification, or quality review — balancing autonomy with oversight for high-stakes tasks.

Agentic AI

§ SYLLABUS