Analysis updated 2026-05-18
Generate AI images in ComfyUI using a memory-efficient ternary-quantized model on a mid-range NVIDIA GPU
Run AI image generation entirely offline after the initial model download with no internet dependency
Experiment with Clark Air Sana 1.6B without installing any additional ComfyUI custom node packs
| clark-labs-inc/comfyui-clarkairsana | 0marildo/imago | agentlexi/agent-lexi | |
|---|---|---|---|
| Stars | 3 | 3 | 3 |
| Language | Python | Python | Python |
| Setup difficulty | moderate | easy | moderate |
| Complexity | 3/5 | 2/5 | 4/5 |
| Audience | vibe coder | general | vibe coder |
Figures from each repo's GitHub metadata at analysis time.
Requires an NVIDIA CUDA GPU, best run on Linux or WSL2 as native Windows Triton support is community-maintained and may not work on every setup.
ComfyUI-ClarkAirSana is a plugin for ComfyUI, the popular node-based interface for AI image generation. It adds support for a model called Clark Air Sana 1.6B, which generates images from text descriptions. The defining feature of this model is that its weights are stored in an extremely compact format called ternary quantization, roughly 1.58 bits per value instead of the usual 16 or 32, which significantly reduces memory usage while keeping image quality practical. The actual computation uses a library called GemLite with INT2 CUDA kernels, meaning it runs on NVIDIA GPU hardware directly. Installing the plugin takes three steps. You install the ComfyUI-ClarkAirSana node pack through ComfyUI Manager, download a 495 MB model file to a specific folder, and drag a provided workflow file onto the ComfyUI canvas. The Gemma text encoder (which converts your text prompts into a form the model understands) and the DC-AE image decoder download themselves automatically on first use. Total download size is about 3.2 GB. The plugin adds four custom nodes to ComfyUI: a model loader, a text encoder, an image decoder, and a latent placeholder. These connect together through the standard KSampler node, so the workflow fits into the same mental model as other ComfyUI setups. The provided example workflow handles all the wiring. There are a few practical requirements to know. The plugin needs an NVIDIA GPU and runs best on Linux or Windows Subsystem for Linux (WSL2). Running on native Windows is possible but may not work reliably depending on the setup. Once all components have been downloaded once, the entire workflow runs offline with no network access needed.
A ComfyUI plugin that runs Clark Air Sana 1.6B, an AI image generation model with ultra-compact 1.58-bit ternary weights, through the standard KSampler workflow with no other node packs needed.
Mainly Python. The stack also includes Python, ComfyUI, CUDA.
The vendored Sana model code is Apache-2.0, use freely for any purpose including commercial, with attribution.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly vibe coder.
This repo across BitVibe Labs
Verify against the repo before relying on details.