Find the best RAG pipeline configuration for your company's PDF knowledge base by running automated trials
Auto-generate evaluation question-answer pairs from a document corpus without writing them by hand
Compare retrieval methods and embedding models across your documents without manual benchmarking
Export the winning pipeline configuration and deploy it to production once optimization is complete
Requires an LLM API key (such as OpenAI) for automatic QA generation and evaluation, optimization trial time grows with corpus size.
RAG (Retrieval-Augmented Generation) is a technique for making an AI language model answer questions about your own documents. Instead of the AI relying solely on what it learned during training, a retrieval system first searches your documents for relevant passages, and those passages are fed to the model as context when it generates an answer. The challenge is that building a good RAG system involves many decisions: how to split documents into chunks, which text embedding model to use, which retrieval method to apply, and more. Different combinations work better for different types of data. AutoRAG is a Python tool that automates the process of finding which combination of components works best for a specific dataset. You provide two things: a corpus (your documents) and a QA dataset (sample questions with correct answers drawn from those documents). AutoRAG then runs your documents through many pipeline combinations, measures how accurately each one answers the sample questions using standard evaluation metrics, and reports which pipeline performed best. The tool includes utilities for building those inputs from raw documents. A parser step converts PDFs and other file formats into text. A chunking step splits that text into segments. A QA creation step uses a language model to generate sample questions and answers from your corpus automatically, so you do not need to write evaluation data by hand. Once optimization is complete, AutoRAG provides a dashboard to compare results across pipeline variants and can export the best-performing configuration for deployment. Interactive demos are available on Hugging Face Spaces for trying the tool without any local installation. The library is released under the Apache 2.0 license.
← marker-inc-korea on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.