explaingit

samvardhan03/omnipulse

15TypeScriptAudience · researcherComplexity · 5/5ActiveLicenseSetup · hard

TLDR

Audio fingerprinting pipeline for IP and compliance work. Wavelet scattering math in C++/CUDA, Rust HNSW index, MCP server over JSON-RPC, and a Python agent driven by Claude.

Mindmap

mindmap
  root((Omnipulse))
    Inputs
      WAV audio clip
      Anthropic API key
      Fingerprint database
    Outputs
      Audio fingerprint
      Nearest match
      License check result
    Use Cases
      Detect generated audio
      License lookup
      IP rights enforcement
    Tech Stack
      C++
      CUDA
      Rust
      Python
      MCP
      HNSW

Things people build with this

USE CASE 1

Fingerprint short audio clips for fast nearest-neighbor lookup against a rights database

USE CASE 2

Run audio identity checks behind a Claude-driven agent over MCP

USE CASE 3

Build a compliance pipeline that compares generated audio against owned content

Tech stack

C++CUDARustPythonMCPHNSW

Getting it running

Difficulty · hard Time to first run · 1day+

Requires CUDA-capable GPU (Hopper-class quoted for benchmarks), Rust toolchain, Python, and an ANTHROPIC_API_KEY before the agent works.

Apache 2.0, free to use, modify, and redistribute commercially, with an explicit patent grant and a requirement to keep the copyright notice.

In plain English

OmniPulse is positioned as compliance and intellectual property infrastructure for the age of generative media. The basic idea is to take a piece of audio (the README focuses on 1 second clips of 44.1 kHz audio) and turn it into a mathematical fingerprint that can be stored, searched, and compared against a database, so that licensing and ownership claims can be checked quickly. The author describes performance numbers like fingerprint latency under 40 milliseconds on a Hopper-class GPU and sustained 15 GB per second host to device transfer. The project is a monorepo split into four modules that each ship to their respective package registries. omni-wst-core is a C++ and CUDA Python wheel that contains the digital signal processing math, based on Morlet wavelet scattering transforms. omni-ffi is a Rust crate that calls into the C++ side using the cxx crate, passing raw pointers so no data is marshalled or copied across the boundary. omnipulse-rs is a Rust workspace with a vector-index crate for HNSW nearest neighbor search, a sliced-wasserstein crate for comparing fingerprint distributions, and an MCP server binary called omnipulse-mcp that speaks line-delimited JSON-RPC 2.0 over stdio. omnipulse-agent is a Python package that uses an Anthropic-driven router to drive the whole pipeline. Data moves between the Python and Rust sides through POSIX shared memory, with a 28 character SHA3-256 hex name that fits the macOS shared memory name length limit. The audio buffer is written once and never re-serialized. The README emphasizes that there is no HTTP, no gRPC, no protobuf, and no TLS in this design. To try it, the README says to pip install omni-wst-core, cargo install omnipulse-mcp, pip install omnipulse-agent, set the ANTHROPIC_API_KEY environment variable, and run the agent against a WAV file. The license is Apache 2.0.

Copy-paste prompts

Prompt 1
Install omni-wst-core, omnipulse-mcp, and omnipulse-agent and fingerprint a WAV in Omnipulse end to end
Prompt 2
Walk me through the POSIX shared memory handoff between Python and Rust in Omnipulse and why it avoids copies
Prompt 3
Build a HNSW index in Omnipulse from 10000 audio fingerprints and benchmark query latency
Prompt 4
Wire the Omnipulse MCP server into Claude Desktop and run a license lookup against a sample clip
Open on GitHub → Explain another repo

Generated 2026-05-22 · Model: sonnet-4-6 · Verify against the repo before relying on details.