Principle | Design Guidance | Observable Signals | Common Pitfalls |
---|---|---|---|
Embedding Quality | Align embedding model with corpus domain; monitor drift and re-train quarterly for dynamic data. |
NDCG@10 ≥ 0.82
Mean cosine ≥ 0.75
|
Mixed encoders per class, unnormalized vectors, ignoring prompt leakage. |
Index Fit | Match ANN family (graph, IVF, disk-based) to recall target and memory budget before adding replicas. |
Recall@50 ≥ 0.92
P95 latency ≤ 120 ms
|
Default parameters (ef, nlist, m) kept static despite growth. |
Hybrid Retrieval | Blend dense similarity with lexical or metadata filters for compliance and explainability. |
Filter hit rate ≥ 70%
CTR lift 10–18%
|
Combining scores without normalization; filter-first queries that break ANN heuristics. |
Freshness SLAs | Use dual write path (OLTP + queue) and incremental index builders to keep staleness under target. |
Age p95 ≤ 5 min
Import success ≥ 99.9%
|
Bulk-only ingestion, missing dead-letter queues, absent idempotency. |
← Home
Vector Database Engineering
Design guidance, index selection trade-offs, and operational benchmarks for production-grade vector search systems.
GenAI Community
join.maxpool.dev →