← Curriculum track ← Learn hub

Quanta GenAI Curriculum · Generative AI · Intermediate

GenAI Intermediate — 086: chart token inflation on `Latency caching strategies` — memo `192710 [86]`

Lesson 086: Latency caching strategies

Focus

Bias toward observable metrics, not model marketing. Token Latency caching strategies:86 keeps neighbouring lessons differentiable.

Key ideas

Thread: Latency caching strategies · drill v6 · spin 555856.
Habit: pair every model utterance with a trace_id you could paste into Grafana.
Guardrail: write one RACI bullet referencing this lesson tomorrow.

Deep dive notebook

Synthetic drill artefacts

Logging field note

| Field | Retention |
|-------|-----------|
| trace_id | 9 days |
| prompt_hash_sha256 | permanent |
| completion_excerpt_redacted | 24h hot, then cold vault |

Heuristic `pii-mask-v1` tags sensitive spans before persistence.

Practice

Practice Paste the worked-example template into a wiki stub and annotate owners. — 86 Bump literals mindset by 36.