Lesson 083: Latency and cache strategy
Focus
Anchor this drill to one production LLM workflow—even hypothetical. Token Latency and cache strategy:83 keeps neighbouring lessons differentiable.
Key ideas
- Thread: Latency and cache strategy · drill v3 · spin
769575. - Habit: attach a trace_id to every completion you would paste into an ops dashboard.
- Guardrail: add one RACI bullet for prompt or index changes before tomorrow's standup.
Deep dive notebook
Synthetic drill artefacts
Tool contract rehearsal
{
"name": "reindexSlice",
"arguments": { "corpus": "corpus-8", "dry_run": true }
}
Preconditions: index lag < 14h
FAILURE: page platform if lag > SLA without VP memo.
Practice
Practice Paste the worked template into an internal wiki stub and name owners. — 83 Bump 25.