SESSION 11

Session 11 — RAG (Retrieval Augmented Generation)

Chunk documents, embed them, build a vector store, add BM25, and answer questions with Claude.

3 hours•6 exercises · 3 phases

What you'll be able to do by the end

&check; Chunk documents three ways: by size, by sentence, and by structure
&check; Turn text into embeddings and compute cosine similarity
&check; Build a vector store and perform semantic search
&check; Implement BM25 keyword search from scratch
&check; Combine semantic and keyword search with Reciprocal Rank Fusion
&check; Build a complete RAG pipeline that answers questions grounded in your documents

Prerequisites

Python 3.10+ with a virtual environment OPENAI_API_KEY for embeddings (Tutorials 02-06)ANTHROPIC_API_KEY for Claude answers (Tutorial 06)The file report.md in your RAG folder

The 3-phase arc

Phase 1 puts documents into a searchable index. Phase 2 adds a second retrieval method and merges them. Phase 3 wires retrieval to Claude for grounded answers.

Phase 1

Index

Phase 2

Retrieve

Phase 3

Generate