Explore which neurons in Qwen 2.5 3B activate for negation or style control by clicking through the 3D layer viewer in your browser
Run the Python bake script on a Llama, Mistral, or Phi-3 model to generate a neuron atlas for your own interpretability research
Compare neuron behavior across Qwen, Phi-3, and Mistral 7B side by side to look for neurons that serve similar roles across architectures
The viewer is a single HTML file that loads in under 10 seconds, running the bake script to generate data for a new model requires Python and takes longer.
LLM Neuron Atlas is an interactive 3D visualization that lets you explore the internal structure of a large language model one neuron at a time. Think of it as a kind of Google Maps for an AI model: you can zoom out to see the entire 36-layer tower of the model at once, or zoom in to a single layer and click on individual neurons to see how their signals flow to other neurons across the network. The tool is built around the Qwen 2.5 3B model, which is a publicly available language model with about 3 billion parameters. A Python script reads the model weights, computes which neurons most strongly influence which other neurons, and writes that relationship data into a set of JSON files. A browser-based viewer then loads those files and renders the network as a 3D structure using three.js, a popular JavaScript graphics library. The whole bake process runs on a regular laptop without a graphics card. Some neurons in the visualization are labeled with names from prior research by the same author, such as the INHIBITOR neuron at dimension 715, which appears to fire specifically when the model processes negation, and the CONTROLLER neuron at dimension 758, which influences style across many layers. Colored highways trace how a single neuron dimension carries its signal all the way from the input layer to the output. You can also compare the Qwen model side by side with Phi-3 and Mistral 7B to see which neurons seem to serve similar roles across different model architectures. The viewer is a single HTML file with no build process required. A live demo is available online and loads in under 10 seconds. The output data for the Qwen 3B model is about 35 megabytes when compressed. The bake script is designed so researchers can adapt it to other model families with small changes, and the README documents exactly which tensor names to adjust for Llama, Mistral, Phi-3, and GPT-2 style models. Mixture-of-experts models like Mixtral are not yet supported.
← norika1207-lab on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.