This is a comprehensive RAG implementation guide that goes well beyond basic vector search setup. It walks you through the full production stack: chunking strategies with overlap and semantic boundaries, embedding selection and indexing with deduplication, hybrid search combining dense retrieval with BM25, reranking with Cohere, and proper evaluation metrics like precision@k and recall@k. The checkpoints after each step are genuinely useful, and the "must not" constraints call out common mistakes like blindly using 512 token chunks. If you're building a RAG system that needs to actually work at scale rather than just demo well, this covers the architectural decisions and implementation patterns that matter.
npx -y skills add jeffallan/claude-skills --skill rag-architect --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
cursor/plugins
metabase/metabase
metabase/metabase
telagod/code-abyss
github/awesome-copilot
DietrichGebert/ponytail