Upload Documents
Upload documents to index them using your preferred RAG mode.
Current Default RAG Mode:
Normal RAG
You can override this setting for individual uploads below.
You can override this setting for individual uploads below.
Normal RAG Process
- Text extraction from document
- Split into overlapping chunks
- Generate embeddings for chunks
- Store in vector database
Graph RAG Process
- Text extraction from document
- Extract entities and relationships
- Build knowledge graph
- Generate embeddings for graph elements
- Store both chunks and graph in database
Note: Graph RAG processing takes significantly longer than Normal RAG due to entity extraction and relationship mapping, but provides much richer contextual understanding.
🌲
PageIndex RAG — How It Works
NEW — Vectorless
Index Phase (one-time, ~2–5 min)
- LLM reads the full document
- Builds a hierarchical tree (like a Table of Contents)
- Each node gets a title, page range, and summary
- Tree saved as JSON — no vector DB needed
Query Phase (~10–20 s)
- LLM agent inspects the tree structure
- Reasons about which sections are relevant
- Fetches exact page text (tight ranges only)
- Synthesises answer with traceable page citations
Best for: Annual reports, SEC filings, research papers, legal manuals, technical specifications — any long document where exact section navigation matters.
PageIndex achieved 98.7% accuracy on the FinanceBench benchmark.