Analysis updated 2026-05-18
Fork the repo to track how free AI model speeds and reliability change over time without managing any servers.
Use the leaderboard to pick the fastest free model for your current prototype before committing to a paid API.
Compare two specific models head-to-head to decide which one to use in a side project.
| saif658/llmstats | a-bissell/unleash-lite | abhiinnovates/whatsapp-hr-assistant | |
|---|---|---|---|
| Stars | 1 | 1 | 1 |
| Language | Python | Python | Python |
| Setup difficulty | easy | hard | hard |
| Complexity | 2/5 | 4/5 | 3/5 |
| Audience | developer | researcher | developer |
Figures from each repo's GitHub metadata at analysis time.
Requires three free API keys (OpenRouter, Groq, Mistral) added as GitHub repository secrets before the first run.
LLMstats is a tool that automatically tests and compares 47 AI language models from three free-tier providers, then publishes the results to a live public dashboard that refreshes roughly every three hours. The providers covered are OpenRouter (which routes to models from OpenAI, Meta, NVIDIA, Qwen, Google, and others), Groq, and Mistral. The entire system runs on GitHub infrastructure at no cost. A scheduled job fires every three hours, sends test requests to each of the 47 models, records how fast they respond and whether they succeed, then saves the results. A companion job builds a static website from those results and publishes it to GitHub Pages. There is no server to manage or database to maintain. The live dashboard has five views. The Overview shows summary cards and success-rate trend charts. The Leaderboard ranks models by a composite score and lets you sort by speed, throughput, or reliability, with a chip showing which provider each model comes from. The Explorer lets you drill into a single model to see its response-time history and error breakdown. The Timeline shows the history of each three-hour run. The Compare view lets you pick two models and see them side by side. Anyone can fork the repository and run the same dashboard for their own data. Setup takes fewer than five minutes: fork the repo, add three free API keys as GitHub repository secrets, enable GitHub Pages, and trigger the first benchmark manually. From that point the cron job keeps the data current automatically. The project uses Python for the benchmark runner and plain HTML plus JavaScript for the dashboard. The architecture was originally inspired by a similar project called NIMStats, rebuilt to cover multiple providers side by side.
A zero-infrastructure tool that benchmarks 47 free AI models from OpenRouter, Groq, and Mistral every 3 hours via GitHub Actions and publishes a live comparison dashboard to GitHub Pages.
Mainly Python. The stack also includes Python, GitHub Actions, GitHub Pages.
MIT license, use freely for any purpose, including commercial, as long as you keep the copyright notice.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.