← Curriculum track ← Learn hub
Quanta GenAI Curriculum · LLMOps · Advanced

LLMOps Advanced — 082: map RACI for prompt changes on `Latency and cache strategy` — memo `426473 [82]`

Lesson 082: Latency and cache strategy

Focus

Prefer explicit failure rehearsals over aspirational wording. Token Latency and cache strategy:82 keeps neighbouring lessons differentiable.

Key ideas

Deep dive notebook

Synthetic drill artefacts

Token CFO scratchpad

- prompt_budget: 1577
- completion_budget: 545
- cache_key: `a590`

Hypothesis: halving completions moves P95 ~5% — record actuals.

Practice

Practice Paste the worked template into an internal wiki stub and name owners. — 82 Bump 28.