Knowledge Graph Rerank

Status: OFF by default — shelved. Knowledge-Graph reranking is not used in default hybrid search. The boost weight defaults to 0 and the page-level reranker was never wired into production. A 2026-06-16 ceiling spike measured zero net lift even with a Claude-quality KG: relational section relevance is not the same as entity-proximity. The rerank was shelved (Phase 4 Track A) and left dormant, not removed. Do not expect KG reranking to affect retrieval results unless it is explicitly re-enabled.

🌐 Product overview: Knowledge graph on wikantik.com — a plain-language walkthrough for readers and AI agents.

Knowledge Graph Rerank is the final stage of the Wikantik retrieval pipeline. It transforms a raw list of search results into a contextually relevant set of pages by leveraging the semantic relationships stored in the [Knowledge Graph](Knowledge Graph).

The Retrieval Pipeline

The Wikantik search engine (exposed via /api/search and the /knowledge-mcp tool retrieve_context) follows a multi-stage process:

Lexical Retrieval (BM25): A fast keyword-based search against the Lucene index.
Semantic Retrieval (Dense Vector): Cosine similarity search using embeddings stored in pgvector.
Hybrid Fusion: Combining the scores from BM25 and Vector search using Reciprocal Rank Fusion (RRF).
Graph Rerank (Final Stage): Adjusting the scores based on "Node Mention" density and co-occurrence in the graph.

How Graph Rerank Works

The reranker identifies "seed nodes" within the top-N results from the hybrid fusion stage. It then uses the KnowledgeGraphService to find co-mentioned neighbors and high-confidence relationships.

Boost by Co-mention: If a page is not in the top results but is heavily co-mentioned with multiple pages that are in the top results, its score is boosted.
Entity Density: Pages that contain a high density of relevant entities (nodes) related to the query are prioritized.
Fail-Closed Fallback: As documented in WikantikDevelopment, if the embedding service or the graph database is unreachable, the system fails closed to a BM25-only result set to ensure availability.

Configuration

Reranking behavior is controlled via wikantik-custom.properties:

# Graph reranking is OFF by default (shelved 2026-06-16). The boost weight is the
# on/off gate: 0 disables it (the default), and the rerank step is not even wired
# when boost = 0. This is the property that matters.
wikantik.search.graph.boost = 0

# The remaining graph-rerank knobs all live under wikantik.search.graph.* and are
# inert while boost = 0 — e.g. wikantik.search.graph.max-hops,
# wikantik.search.graph.weight.tier.human / .tier.machine,
# wikantik.search.graph.weight.mention.floor. See GraphRerankConfig for the full
# set and defaults. (BM25/dense fusion weights are a separate concern — they live
# under wikantik.search.hybrid.rrf.* and are documented on HybridRetrieval.)

Performance Impact

Benchmarks recorded in KnowledgeGraphExtractionBenchmarks show that while graph reranking adds approximately 15-20ms of latency, it significantly improves Recall@5 for "multi-hop" queries where the relevant information is spread across related topics rather than a single page.