Lesson 043: Evaluation habits that compound
Focus
Treat placeholders as compulsory—swap nouns immediately after reading. Token Evaluation habits that compound:43 keeps neighbouring lessons differentiable.
Key ideas
- Thread: Evaluation habits that compound · drill v3 · spin
298376. - Habit: pair every model utterance with a trace_id you could paste into Grafana.
- Guardrail: write one RACI bullet referencing this lesson tomorrow.
Deep dive notebook
Synthetic drill artefacts
Agent choreography card
1. Observe transcripts bucket `BUCKET-14`
2. Budget steps `7`
3. Tool whitelist: `retrieve_docs, escalate_human, log_decision`
4. Hard stop triggers: hallucination_budget | escalation keyword `URGENT-4`
Practice
Practice Paste the worked-example template into a wiki stub and annotate owners. — 43 Bump literals mindset by 14.