explaingit

egoist-machines/lodedb

Analysis updated 2026-05-18

60PythonAudience · developerComplexity · 3/5LicenseSetup · easy

TLDR

A fast, compact embedded vector database for Python that runs in-process with no server, stores vectors on disk with sub-millisecond writes, and integrates directly with LangChain, LlamaIndex, and mem0.

Mindmap

mindmap
  root((LodeDB))
    What it does
      Embedded vector DB
      No server needed
      Multimodal search
    Performance
      Sub-ms writes
      GPU batch search
      Compact on-disk
    Integrations
      LangChain
      LlamaIndex
      mem0
      MCP server
    Extras
      CLIP image search
      Swift iOS bindings
      CLI tool
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Replace the default vector store in a LangChain or LlamaIndex RAG app with LodeDB for faster search and a smaller disk footprint

USE CASE 2

Add persistent local memory to a Claude Code or Cursor session by registering LodeDB's MCP server with one command

USE CASE 3

Build a multimodal image and text search feature that runs entirely on-device with no API keys using LodeDB's CLIP mode

USE CASE 4

Speed up batched vector search in a GPU-enabled environment by running LodeDB's GPU-resident index on CUDA

What is it built with?

PythonRustCUDAONNX RuntimeCLIPPyTorch

How does it compare?

egoist-machines/lodedb0xh4ku/manga-pdf-to-epubayyouboss0011/sherlockmaps
Stars606060
LanguagePythonPythonPython
Setup difficultyeasymoderatemoderate
Complexity3/52/53/5
Audiencedevelopergeneraldata

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 5min

GPU batch search requires Linux with CUDA, the base pip install works on CPU on Linux, macOS, and Windows with no compilation.

Apache 2.0 -- use freely for any purpose including commercial, with attribution and license notice preserved.

In plain English

LodeDB is an embedded vector database designed to store and search through text, images, and other data that has been converted into numerical vectors, which is the format many AI applications use for memory and search. It runs directly inside your Python program with no separate server or account required, and stores everything on disk in a compact format. The core use case is as a memory backend for AI frameworks like LangChain, LlamaIndex, and mem0, which are tools developers use to build AI applications that need to remember things. LodeDB positions itself as a faster and smaller replacement for the default stores those frameworks ship with. The benchmarks in the README show large speed advantages over alternatives like ChromaDB, Qdrant, and sqlite-vec, with much smaller on-disk file sizes. A few things make it distinctive among embedded options. Writes are incremental: instead of rewriting the entire file on each save, it only writes the rows that changed, so adding a memory entry takes under a millisecond even with a million existing entries. On machines with an NVIDIA GPU, batch search can run on the GPU instead of the CPU, reaching tens of thousands of searches per second. Storage is compressed using low-precision vector codes plus zstd compression on retained text. All data stays local by default with no telemetry on raw content. Beyond text, LodeDB supports multimodal search: you can index both images and text in the same database using a CLIP model and search across both types at once. It also ships adapters for the major AI frameworks, a command-line tool, an MCP server for coding assistants like Claude Code and Cursor, and native Swift bindings for iOS and macOS apps. Installation is a single pip command with prebuilt wheels for Linux, macOS, and Windows, requiring no compilation. Optional extras add GPU acceleration, image search, and framework adapters. The full README is longer than what was shown.

Copy-paste prompts

Prompt 1
Migrate my existing LangChain InMemoryVectorStore to LodeDB. Show me the lodedb migrate command, what options I need, and how to verify the migration succeeded
Prompt 2
Add LodeDB as the memory backend for a LlamaIndex RAG pipeline. Show me the Python code to swap the SimpleVectorStore for LodeDB's VectorStore adapter
Prompt 3
Register LodeDB as an MCP server in Claude Code using lodedb mcp install, then store and retrieve a few test embeddings to confirm it works
Prompt 4
I want to index 10,000 product images and search them by text description. Show me how to install the image extra and write the indexing and search code with model='clip'
Prompt 5
Run lodedb doctor on my machine, explain what each check means, and show me how to fix the CUDA GPU embedding issue on Windows

Frequently asked questions

What is lodedb?

A fast, compact embedded vector database for Python that runs in-process with no server, stores vectors on disk with sub-millisecond writes, and integrates directly with LangChain, LlamaIndex, and mem0.

What language is lodedb written in?

Mainly Python. The stack also includes Python, Rust, CUDA.

What license does lodedb use?

Apache 2.0 -- use freely for any purpose including commercial, with attribution and license notice preserved.

How hard is lodedb to set up?

Setup difficulty is rated easy, with roughly 5min to a first successful run.

Who is lodedb for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub egoist-machines on gitmyhub

Verify against the repo before relying on details.