Analysis updated 2026-05-18
Give Claude Desktop persistent memory across sessions using your own Postgres database instead of a cloud memory service.
Run a fleet of Hermes agents where each agent has its own scoped memory space that doesn't interfere with other agents.
Add a private, CPU-only vector search layer to an AI coding workflow in Cursor so the agent remembers past decisions.
Store conversation summaries and tagged entities for an AI assistant and retrieve relevant context at sub-10ms latency.
| codenamekt/hexus | captaingrock/krea2trainer | devopsaiguru123/awesome-agentic-devops | |
|---|---|---|---|
| Stars | 7 | 7 | 7 |
| Language | Python | Python | Python |
| Setup difficulty | moderate | hard | easy |
| Complexity | 3/5 | 4/5 | 1/5 |
| Audience | developer | designer | ops devops |
Figures from each repo's GitHub metadata at analysis time.
Requires PostgreSQL 16 with the pgvector extension enabled, a Docker Compose file is provided to spin one up.
Hexus is a memory system for AI agents that stores and retrieves information using your own PostgreSQL database instead of a paid cloud service. It converts text into vector embeddings using a small local language model (MiniLM-L6-v2), stores them in Postgres with the pgvector extension, and searches them at millisecond speed when an agent needs to remember something. Because all embedding happens locally on your CPU, there are no calls to an external AI service at search time. The project serves two kinds of users. If you are running a Hermes agent (an open-source multi-agent framework), you install Hexus as a Python plugin and it automatically gives each agent its own scoped memory space, called a theme, so separate agents do not overwrite each other's notes. If you want to give Claude Desktop, Cursor, or another MCP-compatible AI tool persistent memory, you run Hexus as a standalone Model Context Protocol server, either via Docker or by pointing Claude's config file at the Docker image. Beyond basic vector search, the tool includes hybrid search that combines vector similarity with keyword matching (BM25), a temporal decay feature that makes older memories less prominent over time, a TTL option that automatically forgets entries after a set period, entity tagging, and conversation summaries. The README reports a single embed taking 7.4 milliseconds and a top-5 recall taking 2 milliseconds on a basic CPU with no GPU. Setup requires a running PostgreSQL 16 instance with the pgvector extension enabled. A Docker Compose file is included to spin one up quickly. The Hermes plugin path is a single pip install. The MCP server path involves adding a short JSON block to Claude's desktop config file. Hexus is licensed under the BSD 3-Clause license and described as intentionally minimal: a drop-in storage and search layer you add to an existing agent setup rather than a complete agent platform.
A local Postgres-backed vector memory layer for AI agents that uses CPU-only BERT embeddings for fast, private storage and search, works as a Hermes agent plugin or a standalone MCP server for Claude Desktop and Cursor.
Mainly Python. The stack also includes Python, PostgreSQL, pgvector.
BSD 3-Clause license, use freely for commercial or personal projects as long as you retain the copyright notice and do not use the project name for endorsement.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.