Hybrid search regression after fusion deploy
anonymizedB2B SaaS copilot (anonymized)
Before
recall@10 −22% after alpha=1.0 deploy; incidents triaged via ad-hoc log grep with no retrieve span standard.
After
RRF fusion restored; investigation workflow + Slack export cut triage time; eval gate blocks regressions.
Timeline: Representative 5-day path: Day 1 rollback, Day 2 benchmark, Day 3 eval gate, Day 4–5 monitor.
Retrieval: recall@10 restored from 0.58 → 0.76 on golden set (representative benchmark).
Hallucination: Faithfulness gate reduced empty-context answers (representative −40% empty-context spans).