The beginner capstone — build a complete Retrieval-Augmented Generation pipeline that combines everything from the last four days: vector search, knowledge graphs, and LLMs working together.
Retrieval-Augmented Generation (RAG) is a pattern that gives an LLM access to external knowledge at query time. Instead of relying only on what the model memorized during training, RAG retrieves relevant documents on demand and passes them into the prompt — so the model can answer about your data, fresh data, and private data.
┌────────────┐
question ───────▶│ embed │──▶ query vector
└────────────┘
│
▼
┌─────────────────┐
│ vector store │ ──▶ top-K chunks
└─────────────────┘
│
▼
┌─────────────────────────┐
│ build prompt with │
│ system + chunks + Q │
└─────────────────────────┘
│
▼
┌────────┐
│ LLM │ ──▶ grounded answer
└────────┘
| Use case | RAG | Fine-tune |
|---|---|---|
| Answer questions about a knowledge base | ✓ | |
| Frequent content updates | ✓ | |
| Enforce a tone/format/style | ✓ | |
| Specialize for a narrow domain task | weak | ✓ |
| Combine multiple sources at query time | ✓ | weak |
A common production pattern is both: fine-tune for style and behavior; RAG for fresh, citable facts.
It still relies on:
This is the capstone because RAG braids everything you've learned.