Advanced Information

πŸš€ Recent Advancements
NEW
BM25 Hybrid Search

Dense semantic vectors + sparse BM25 keyword vectors searched simultaneously in Qdrant. Catches exact-match terms that pure embedding search misses.

NEW
Voyage AI Rerank 2.5

Cross-encoder reranking replaces LLM-based reranking. Attends jointly to query + document β€” +8–15% retrieval accuracy, highest nDCG@10 on BEIR.

SEC
Encrypted Key Storage

API keys are Fernet-encrypted at rest. Keys are stripped from all server responses and never rendered into page source or DevTools.

NEW
PageIndex (Vectorless RAG)

Agentic reasoning-based retrieval β€” no embeddings or vector DB required. Uses tool-calling loops to locate and synthesise answers directly from document pages.

NEW
Query Planning (HyDE + Decomposition)

Multi-hop queries are decomposed into sub-questions. Hypothetical Document Embeddings (HyDE) expand sparse queries before retrieval.

UPD
Parallel Document Indexing

Batch graph embedding and parallel doc-level indexing with skip-cache. Large documents index significantly faster with fewer redundant API calls.

Current Configuration
SettingValue
RAG Mode Normal RAG
Preferred LLM gemini
Embedding Model Gemini Embedding API (768-dim)
Hybrid Search Dense + BM25 Sparse
Reranker Voyage AI Rerank 2.5 βœ“
Query Planning HyDE + Decomposition βœ“
Vector Database Qdrant Cloud
Document Collection simple_rag_docs
Graph Collection simple_rag_graph
Chunk Size 1000 chars
Chunk Overlap 200 chars
Results Count (Top K) 5
Embedding Cache Enabled
Key Encryption Fernet AES-128 at rest βœ“
Retrieval Pipeline
πŸ“„
1. Query Planning

HyDE expansion + sub-question decomposition

πŸ”
2. Hybrid Retrieval

Dense (Gemini embeddings) + Sparse (BM25) searched in parallel

🎯
3. Reranking

Voyage AI Rerank 2.5 cross-encoder re-scores the candidate pool

✨
4. Synthesis

LLM generates grounded answer from top-ranked chunks

RAG Mode Comparison
πŸ“š Normal RAG
  • Speed: Fast
  • Search: Hybrid dense + BM25
  • Best for: Direct facts, document search
  • Storage: Single vector collection
πŸ•ΈοΈ Graph RAG
  • Speed: Slower, more thorough
  • Search: Hybrid + entity graph traversal
  • Best for: Relationships, multi-hop reasoning
  • Storage: Docs + graph collections
⚑ PageIndex
  • Speed: Variable (agentic loop)
  • Search: No vectors β€” tool-based page lookup
  • Best for: Precise page-level extraction
  • Storage: No vector DB required
πŸ”’ Security
Encryption at Rest

API keys are encrypted with Fernet (AES-128-CBC + HMAC-SHA256) before being written to disk. The plaintext key never appears in the config file.

Zero Browser Exposure

Sensitive fields are stripped from every template context before rendering. Key values never appear in page source, network responses, or browser DevTools.

No Logging

Key values are never written to application logs β€” only field names are logged (e.g. "gemini_api_key updated"), never values.

Masked Form Fields

The setup form never pre-fills key inputs. Fields show "Already set β€” leave blank to keep" when a key exists, so values never transit to the browser.

About SimpleRAGx

SimpleRAGx is a multi-mode retrieval-augmented generation system with three complementary approaches to document Q&A, all backed by the same hybrid retrieval pipeline:

  • Normal RAG: Fast hybrid semantic + keyword search for direct factual queries
  • Graph RAG: Knowledge graph reasoning for complex relationships and multi-hop questions
  • PageIndex: Agentic tool-based retrieval β€” no vector DB required, works directly on document pages

All modes share the same Qdrant backend, Gemini embeddings, and Voyage AI reranker. The retrieval pipeline (query planning β†’ hybrid search β†’ reranking β†’ synthesis) is applied consistently across modes.