RAG from First Principles · Part 12

The cost / latency / quality triangle

One RAG request, four stages, two bars each. Toggle the levers and watch a corner move.

Optimizations

Per request, by stage