Permission-Aware RAG: Hybrid Retrieval with Secure Filtering
Build RAG systems that enforce who can see what by resolving permissions once, filtering inside the database, and combining vector + BM25 search for accuracy and security.
8 articles in this category
Build RAG systems that enforce who can see what by resolving permissions once, filtering inside the database, and combining vector + BM25 search for accuracy and security.
How cocoindex-code uses Tree-sitter chunking and incremental re-indexing to give AI coding agents whole-repo context with 70% fewer tokens.
A deep dive into the 4-tier memory consolidation model and triple-stream retrieval system that makes agentmemory the most sophisticated memory system for AI agents.
Andrej Karpathy's LLM Wiki pattern — a persistent, compounding knowledge base maintained by AI — hit 5,000+ stars in days. Here's the full architecture, what the community discovered, and the structural gaps that could make it collapse.
Understand RAG vs Long Context, decode the acronyms (CAG, KV Cache, RLMs), and learn how to build a local RAG agent with zero ongoing costs.
A developer's guide to building a Palantir-like system using open-source tools: Kafka for data ingestion, Spark for stream processing, Neo4j for knowledge graphs, and LLMs for autonomous agents.
Explore 11 key RAG strategies including re-ranking, agentic RAG, knowledge graphs, and contextual retrieval to enhance your AI agents' performance and accuracy.
Explore the differences between Retrieval Augmented Generation (RAG) and Cage Augmented Generation (CAG) for building large language model applications with external data sources.