quivrhq/quivr

★ 39,152PythonAudience · developerComplexity · 3/5QuietSetup · moderate

Things people build with this

USE CASE 1

Build an internal knowledge-base chatbot that answers questions about your company's documentation and policies.

USE CASE 2

Create a customer support bot that retrieves answers from your help docs and product guides.

USE CASE 3

Add a Q&A feature to your app so users can ask questions about uploaded files or documents.

USE CASE 4

Index and search through PDFs, Markdown, and text files without writing custom retrieval logic.

Tech stack

PythonOpenAIAnthropicMistralOllamaPGVectorFaissDocker

Getting it running

Difficulty · moderate Time to first run · 30min

Requires API keys for LLM providers (OpenAI/Anthropic/Mistral) or local Ollama setup; vector DB choice (PGVector needs Postgres, Faiss is local) affects initial config.

License could not be detected automatically. Check the repository's LICENSE file before use.

In plain English

Quivr is a Python framework that helps developers add AI-powered question-and-answer capabilities to their own applications without building the underlying plumbing from scratch. The problem it solves is that setting up a "RAG" system, Retrieval-Augmented Generation, where an AI answers questions based on your own documents rather than only its training data, normally requires assembling many moving parts: a document parser, a vector database for storing searchable chunks of text, an LLM (large language model) integration, a conversation history manager, and a retrieval pipeline. Quivr bundles all of that into one opinionated, pre-configured package. How it works: you install the quivr-core Python package, point it at your files (PDFs, text files, Markdown, etc.), and it creates a "brain", an indexed, searchable representation of that content. When a user asks a question, Quivr retrieves the most relevant chunks from the index, passes them to an LLM (such as GPT-4, Groq, or a local Ollama model), and returns a grounded answer. The retrieval pipeline is configurable via YAML files: you can swap in different vector stores (PGVector or Faiss), plug in a reranker (a model that re-sorts retrieved chunks by relevance), and tune parameters like conversation history depth and temperature. You would reach for Quivr when building an internal knowledge-base chatbot, a customer support bot that answers from your documentation, or any product feature where users should be able to ask questions about specific files. Rather than writing the indexing, chunking, and retrieval logic yourself, you get a working pipeline in roughly five lines of code and configure it from there. The tech stack is Python 3.10+, with optional integrations for OpenAI, Anthropic, Mistral, and Ollama APIs, and vector storage via PGVector (PostgreSQL extension) or Faiss (Facebook's similarity search library). Docker is supported for deployment.

Copy-paste prompts

Prompt 1

Show me how to set up Quivr to index a folder of PDFs and answer questions about them in Python.

Prompt 2

How do I configure Quivr to use a local Ollama model instead of OpenAI for the LLM?

Prompt 3

Walk me through creating a simple chatbot with Quivr that retrieves answers from my documentation.

Prompt 4

How do I swap the vector database in Quivr from Faiss to PGVector and what's the YAML config?

Prompt 5

Show me how to add a reranker to Quivr to improve the relevance of retrieved chunks.

Open on GitHub → Explain another repo

Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.