Interpreting Hybrid Retrieval Metrics
Wikantik publishes a small set of Prometheus metrics covering structural
index health, for-agent projection size, and (post-Phase 5) retrieval
quality. They're all under the `wikantik_` prefix.
When to use this runbook
When you have a `/metrics` scrape and want to convert numbers into
verdicts.
Context
- **Phase 1 (shipped):** `wikantik_structural_index_{pages_total,clusters_total,tags_total,unclaimed_total,lag_seconds}` — gauges. Plus `wikantik_structural_index_rebuild_duration_seconds` (timer) for rebuild cost.
- **Phase 2 (shipped):** `wikantik_for_agent_response_bytes` — DistributionSummary with percentile histogram. Records every projection's serialised size.
- **Phase 5 (planned):** `wikantik_retrieval_{ndcg_at_5,ndcg_at_10,recall_at_20,mrr}{set,mode}` — gauges, plus `wikantik_retrieval_run_duration_seconds` (histogram) and `wikantik_retrieval_run_failed_total` (counter).
Walkthrough
The frontmatter `steps` walk the metric set in priority order: index
health first (the foundation), projection size second (the agent
contract), retrieval quality third (the eventual signal).
Pitfalls
The frontmatter `pitfalls` capture the recurring misreads. The
"comparing absolute numbers across deploys" trap is especially common —
agents take a snapshot of metrics, deploy a different corpus, then
report that "retrieval got worse" when the page count just shifted.