Wikantik Hybrid Search Architecture

Wikantik features a high-precision, multi-stage retrieval pipeline designed to serve both humans and AI agents. It goes beyond simple keyword matching by fusing lexical, semantic, and relational data.

1. The Retrieval Pipeline

When a query is submitted (via /api/search or the retrieve_context tool), it undergoes four distinct phases:

A. Lexical Retrieval (BM25)

The first stage uses Apache Lucene to perform a classic BM25 search.

Engine: com.wikantik.search.subsystem.lucene
Strength: Excellent at finding exact matches, technical terms, and unique identifiers (e.g., "RTO", "SPIA").
Scope: Indexes page titles, content body, and frontmatter keywords.

B. Dense Retrieval (pgvector)

In parallel, the query is converted into a high-dimensional vector.

Engine: com.wikantik.search.hybrid.DenseRetriever
Pipeline: The query is embedded (typically 768d via nomic-embed-text) and compared against chunked content in the content_chunk_embeddings table.
Strength: Captures semantic meaning and intent, finding relevant content even when keywords do not match exactly.

C. Hybrid Fusion (RRF)

The results from BM25 and Dense retrieval are combined using Reciprocal Rank Fusion (RRF).

Mechanism: Merges the ranked lists by summing the inverse ranks: $1/(k + \text{rank})$ .
Resilience: The system is designed to "fail-safe." If the embedding service is down, the fuser automatically collapses to the lexical result, ensuring search remains functional.

D. Knowledge Graph Reranking

KG reranking is off by default (boost=0, never wired into production; shelved 2026-06-16 after a measured zero-lift ceiling spike). See KnowledgeGraphRerank.

2. The Embedding Infrastructure

Wikantik's dense search depends on a chunk-and-embed pipeline:

Chunking: Pages are split into logical chunks (usually by headings) during the save process.
Embedding: Chunks are processed by the EmbeddingClient (communicating with Ollama, OpenAI, or TEI).
Storage: Vectors are stored in PostgreSQL with an HNSW index for fast $O(\log n)$ nearest-neighbor retrieval.

Search quality is measured by a standalone utility in the wikantik-tools module.

Tool: bin/search-eval (backed by ExperimentEvaluator.java).
Harness: Runs a set of 40+ "Ideal Queries" against the running wiki.
Metrics: Reports Recall@5, Recall@20, and Mean Reciprocal Rank (MRR).
Baseline: Results are committed to eval/ (e.g., grand-finale.txt) to ensure retrieval performance never regresses during refactoring.

See Also:

Building Industrial Search Systems — The underlying theory.
Evaluating Retrieval Quality — Deep dive into the math of MRR.
Knowledge Graph Extraction — How the reranking data is generated.