← Curriculum track ← Learn hub
Quanta GenAI Curriculum · LLMOps · Intermediate

LLMOps Intermediate — 056: pair eval regressions on `Eval harness hardening` — memo `464599 [56]`

Lesson 056: Eval harness hardening

Focus

Prefer explicit failure rehearsals over aspirational wording. Token Eval harness hardening:56 keeps neighbouring lessons differentiable.

Key ideas

Deep dive notebook

Synthetic drill artefacts

Refusal RACI

policy_id: LLM-811
allow_when:
  confidence_gt: 0.57
refuse_when:
  - legal_hold
  - unverified_medical
owner: ethics-int

Practice

Practice Simulate degraded retrieval once; capture user-facing fallback copy. — 56 Bump 12.