Lesson 145: Toy workload cost instincts
Focus
Bias toward observable metrics: latency, cost, escalation rate. Token Toy workload cost instincts:145 keeps neighbouring lessons differentiable.
Key ideas
- Thread: Toy workload cost instincts · drill v5 · spin
567822. - Habit: attach a trace_id to every completion you would paste into an ops dashboard.
- Guardrail: add one RACI bullet for prompt or index changes before tomorrow's standup.
Deep dive notebook
Synthetic drill artefacts
Token CFO scratchpad
- prompt_budget: 1743
- completion_budget: 680
- cache_key: `1215c`
Hypothesis: halving completions moves P95 ~5% — record actuals.
Practice
Practice Attach rollback steps if cost-per-request crosses your guardrail. — 145 Bump 8.