maxforai/tokenless

Analysis updated 2026-06-24

★ 32JavaScriptAudience · vibe coderComplexity · 2/5LicenseSetup · easy

Mindmap

mindmap
  root((Tokenless))
    Inputs
      Claude Code sessions
      File reads and tool outputs
      User prompts
    Outputs
      Compact read packets
      Shorter chat replies
      Local artifact store
    Use Cases
      Cut vibe coding costs
      Trim chat verbosity
      Reduce agent context bloat
    Tech Stack
      Node
      npm
      Claude Code hooks

mindmap root((Tokenless)) Inputs Claude Code sessions File reads and tool outputs User prompts Outputs Compact read packets Shorter chat replies Local artifact store Use Cases Cut vibe coding costs Trim chat verbosity Reduce agent context bloat Tech Stack Node npm Claude Code hooks

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Cut Claude Code monthly bill by switching to the coding profile for vibe sessions

USE CASE 2

Keep chat replies short during natural Q and A with the chat profile

USE CASE 3

Store large file reads as local artifacts so they stop bloating future requests

USE CASE 4

Disable Task and Plan tools by default to keep agent trajectories small

What is it built with?

JavaScriptNodenpm

How does it compare?

	maxforai/tokenless	systemoutprintlnhelloworld/plus-pp-helper	rosalina7515/ui-modernizer
Stars	32	32	30
Language	JavaScript	JavaScript	JavaScript
Setup difficulty	easy	moderate	easy
Complexity	2/5	3/5	2/5
Audience	vibe coder	developer	vibe coder

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 5min

Not on npm registry yet, install via npm install -g github:MaxForAI/Tokenless and run repair-hooks before first launch.

MIT license, free to use, modify, and ship commercially as long as the copyright notice stays.

In plain English

Tokenless is a command-line add-on for Claude Code that tries to cut how many tokens each session burns through. The pitch on the front page is a single number: roughly 50% less request tokens in a vibe-coding run, and up to 80% less response tokens in plain chat. You install it from GitHub with npm, run a couple of setup commands, and then keep using Claude Code as normal. A tokenless style command lets you switch between three output profiles at any time. The core problem it tackles is that Claude Code sessions get expensive because every file read, log, diff, and verbose final reply gets carried into the context of the next request. Tokenless intercepts that flow: large tool outputs and file reads are kept on your machine as raw artifacts, while what actually goes to the model is a compact packet, called a TOKENLESS-READ-PACKET, containing an artifact id, imports, symbol list, a few snippets, and the exact commands needed to expand the original output later if the agent really needs it. It also targets two other sources of growth. The launcher trims out heavy Task and Plan tools by default to keep agent trajectory from ballooning, and the chat and coding output profiles change how Claude itself replies. The chat profile keeps responses short and readable, the coding profile produces dense structured responses for code work, and the off profile disables all of this and gives you stock Claude Code behavior. The README is unusually open about evidence. It lists a table of API-body measurements from real sessions: a 5-turn CRM vibe-coding run dropped from 4.7M request tokens to 2.5M, a natural conversation dropped from 7,223 response tokens to 1,442, a large CSS visual edit dropped by around 54 to 60 percent, and a 10,000-line React edit dropped by about 40 percent. There is also a research-backing section pointing at papers on brevity constraints, prompt compression, LLMLingua, LongLLMLingua, Selective Context, and Gist Tokens, while admitting these do not automatically prove Tokenless helps every session. Installation is npm install -g from the GitHub repo, then tokenless repair-hooks, tokenless install-commands, and tokenless launch. CLAUDE_BIN can point to a non-standard Claude Code binary. The project is MIT licensed and includes README translations for Chinese, Japanese, French, and Spanish.

Copy-paste prompts

Prompt 1

Install Tokenless globally from GitHub and walk me through repair-hooks, install-commands, and launch for Claude Code on macOS.

Prompt 2

Show me how to wire CLAUDE_BIN to a non-standard Claude Code binary path before running tokenless launch.

Prompt 3

Add a new tokenless style profile that sits between chat and coding, and explain which hook files I need to edit.

Prompt 4

Read a 10k-line TSX file inside a Tokenless session and show me how the TOKENLESS-READ-PACKET expansion command works.

Prompt 5

Compare my token usage on a 5-turn coding run with tokenless style off versus tokenless style coding and print the deltas.

Frequently asked questions

What is tokenless?

CLI add-on for Claude Code that reduces session token usage by storing large tool outputs locally and sending compact reference packets to the model instead.

What language is tokenless written in?

Mainly JavaScript. The stack also includes JavaScript, Node, npm.

What license does tokenless use?

MIT license, free to use, modify, and ship commercially as long as the copyright notice stays.

How hard is tokenless to set up?

Setup difficulty is rated easy, with roughly 5min to a first successful run.

Who is tokenless for?

Mainly vibe coder.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.