Home / Technology / Karpathy's "LLM Knowledge Bases" Rethink AI Memory
Karpathy's "LLM Knowledge Bases" Rethink AI Memory
4 Apr
Summary
- AI now uses Markdown files as a 'source of truth' for knowledge.
- This new approach bypasses complex vector databases.
- It aims to create self-healing, auditable, and human-readable AI memory.

Andrej Karpathy is pioneering a new AI development paradigm called "LLM Knowledge Bases." This system leverages Markdown files as the primary source of truth, allowing AI models to function as self-sufficient research librarians. Instead of relying on complex vector databases, this method utilizes the LLM's inherent ability to process structured text, creating a more accessible and auditable knowledge repository.
The "LLM Knowledge Bases" approach offers a significant departure from the dominant Retrieval-Augmented Generation (RAG) model. By treating Markdown files as the core, Karpathy’s system ensures that AI-generated information is traceable and human-readable, avoiding the 'black box' issue associated with vector embeddings. This methodology has sparked interest in enterprise applications, suggesting a future where unstructured company data can be synthesized into a continuously updating 'Company Bible.'
While currently a collection of scripts, the implications are vast. This "file-over-app" philosophy empowers users with data sovereignty, positioning AI as a sophisticated editor rather than a proprietary tool. The system's scalability is notable, with Karpathy demonstrating its effectiveness for projects up to 100 articles. This approach may also pave the way for synthetic data generation and model fine-tuning, transforming personal research projects into custom, private intelligences.