These posts document design decisions, trade-offs, and failure modes encountered while building Retrieval-Augmented Generation (RAG) systems under real-world constraints.
The focus is on:
- retrieval precision over model size
- robustness over convenience
- understanding why systems fail
Many of these notes directly relate to MQNotebook, but the principles generalize to any production-grade RAG pipeline.