RAG hallucinations usually trace to retrieval: missing passages, wrong chunks ranked first, or generation that ignores retrieved text. Experts separate grounding failures from generic model fluency.
RAG hallucinations usually trace to retrieval: missing passages, wrong chunks ranked first, or generation that ignores retrieved text. Experts separate grounding failures from generic model fluency.
Clearest explanation
strong· 90
Canonical expert clip
Chosen for clarity and how directly it answers the question — not for views or hype.
Best expert explanation
"You might be missing data. You might be chunking them in the wrong way. You might be using an embedding model that isn't optimum. Maybe your retrieval strategy needs to change."
• Required fact never appears in any retrieved passage.
• Model ignores retrieved text and answers from parametric memory.
• Conflicting passages merged into one summary.
Implementation mistakes
• Tuning prompts while recall on required facts is still low.
• Assuming citations prove grounding without checking chunk relevance.
Supporting expert clips
these challenges with naive rag
strong· 88
There are blockers for actually being able to productionize these applications — and these challenges with naive RAG are exactly what teams hit before they add hybrid search, reranking, and eval loops.
•Required fact never appears in any retrieved passage.
•Model ignores retrieved text and answers from parametric memory.
•Conflicting passages merged into one summary.
•Tuning prompts while recall on required facts is still low.
•Assuming citations prove grounding without checking chunk relevance.
Implementation tradeoffs
•Reranking: Cross-encoder or LLM rerankers improve top-k quality at higher latency and inference cost.
•Regression testing: Fine-tune releases need behavior suites on fixed prompts; RAG releases need recall suites on labeled questions — teams often test only one.
•Evaluation: Offline labeled sets catch regressions early; online failure logs catch drift and long-tail queries production suites miss.
Themes repeated across indexed engineering talks and practitioner writeups — not a survey, vote count, or attributed quote roundup.
Build a RAG investigation
Save expert explanations into one investigation, compare voices, and export a shareable research brief on this device.