Give your development team access to multiple AI models without sharing raw API keys or credentials.
Switch between AI providers (OpenAI, Claude, Gemini) without changing your application code.
Track and limit AI API spending per user, team, or project with automatic quota enforcement.
Build an internal AI service where different departments pay for their own model usage.
Requires Docker, database setup (SQLite or MySQL), multiple API keys from external providers, and likely backend/frontend coordination to see a working gateway.
One API is a management and distribution system for large language model APIs. Instead of having to integrate separately with OpenAI, Anthropic Claude, Google Gemini, Azure, DeepSeek, and dozens of other AI providers, each with their own authentication, request formats, and billing, One API acts as a single gateway that sits in front of all of them. Your application sends requests to One API using the standard OpenAI API format, and One API routes those requests to whichever providers and models you have configured. The key idea is centralized control over API keys and usage. As an administrator, you create "channels" (each connected to a real AI provider), then issue tokens to users or teams. Each token can have spending limits, expiry dates, allowed IP addresses, and restrictions on which models it can access. When a user's application calls One API with their token, the system handles routing, load balancing across multiple channels, automatic retry on failures, and usage tracking, all transparently. This is useful for teams or businesses that want to give developers access to AI models without sharing raw provider API keys, or for anyone who wants to switch between providers without rewriting application code. It is also used to build internal API-as-a-service setups where usage quotas and costs are tracked per user or group. The system deploys as a single binary or via Docker, connects to either SQLite or MySQL for storage, and exposes a web admin panel for managing channels, users, tokens, and viewing usage statistics. The frontend is JavaScript and the backend is Go. It supports streaming responses for real-time typewriter effects in chat interfaces, and integrates with services like Cloudflare AI Gateway for additional routing options.
Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.