Lesson 045: Eval slices that compound
Focus
Assume an auditor replays your claims; narrate checkpoints aloud. Token Eval slices that compound:45 keeps neighbouring lessons differentiable.
Key ideas
- Thread: Eval slices that compound · drill v5 · spin
261620. - Habit: attach a trace_id to every completion you would paste into an ops dashboard.
- Guardrail: add one RACI bullet for prompt or index changes before tomorrow's standup.
Deep dive notebook
Synthetic drill artefacts
Agent choreography
1. Observe bucket `TENANT-11`
2. Max steps `9`
3. Tools: `retrieve, escalate_human, log_decision`
4. Stop: spend cap | keyword `STOP-4`
Practice
Practice Paste the worked template into an internal wiki stub and name owners. — 45 Bump 28.