clark-labs-inc/comfyui-clarkairsana

Analysis updated 2026-05-18

★ 3PythonAudience · vibe coderComplexity · 3/5LicenseSetup · moderate

Mindmap

mindmap
  root((repo))
    What it does
      AI image generation
      Ternary 1.58-bit weights
      ComfyUI integration
    Nodes Included
      Model loader
      Gemma text encoder
      DC-AE VAE decoder
    Requirements
      NVIDIA CUDA GPU
      Linux or WSL2
      3.2 GB download
    Setup
      Install via Manager
      Download model file
      Drag workflow json

mindmap root((repo)) What it does AI image generation Ternary 1.58-bit weights ComfyUI integration Nodes Included Model loader Gemma text encoder DC-AE VAE decoder Requirements NVIDIA CUDA GPU Linux or WSL2 3.2 GB download Setup Install via Manager Download model file Drag workflow json

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Generate AI images in ComfyUI using a memory-efficient ternary-quantized model on a mid-range NVIDIA GPU

USE CASE 2

Run AI image generation entirely offline after the initial model download with no internet dependency

USE CASE 3

Experiment with Clark Air Sana 1.6B without installing any additional ComfyUI custom node packs

What is it built with?

PythonComfyUICUDAGemLiteTritonPyTorch

How does it compare?

	clark-labs-inc/comfyui-clarkairsana	0marildo/imago	agentlexi/agent-lexi
Stars	3	3	3
Language	Python	Python	Python
Setup difficulty	moderate	easy	moderate
Complexity	3/5	2/5	4/5
Audience	vibe coder	general	vibe coder

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Requires an NVIDIA CUDA GPU, best run on Linux or WSL2 as native Windows Triton support is community-maintained and may not work on every setup.

The vendored Sana model code is Apache-2.0, use freely for any purpose including commercial, with attribution.

In plain English

ComfyUI-ClarkAirSana is a plugin for ComfyUI, the popular node-based interface for AI image generation. It adds support for a model called Clark Air Sana 1.6B, which generates images from text descriptions. The defining feature of this model is that its weights are stored in an extremely compact format called ternary quantization, roughly 1.58 bits per value instead of the usual 16 or 32, which significantly reduces memory usage while keeping image quality practical. The actual computation uses a library called GemLite with INT2 CUDA kernels, meaning it runs on NVIDIA GPU hardware directly. Installing the plugin takes three steps. You install the ComfyUI-ClarkAirSana node pack through ComfyUI Manager, download a 495 MB model file to a specific folder, and drag a provided workflow file onto the ComfyUI canvas. The Gemma text encoder (which converts your text prompts into a form the model understands) and the DC-AE image decoder download themselves automatically on first use. Total download size is about 3.2 GB. The plugin adds four custom nodes to ComfyUI: a model loader, a text encoder, an image decoder, and a latent placeholder. These connect together through the standard KSampler node, so the workflow fits into the same mental model as other ComfyUI setups. The provided example workflow handles all the wiring. There are a few practical requirements to know. The plugin needs an NVIDIA GPU and runs best on Linux or Windows Subsystem for Linux (WSL2). Running on native Windows is possible but may not work reliably depending on the setup. Once all components have been downloaded once, the entire workflow runs offline with no network access needed.

Copy-paste prompts

Prompt 1

I installed ComfyUI-ClarkAirSana and want to generate my first image. Walk me through loading the example workflow and what KSampler settings to use.

Prompt 2

Why does ComfyUI-ClarkAirSana require keeping the transformer GPU-resident and recommend avoiding --lowvram? What breaks if I use lowvram mode?

Prompt 3

How do I install ComfyUI-ClarkAirSana on a Windows machine without WSL2? What are the risks and workarounds for Triton on native Windows?

Prompt 4

What is ternary quantization and how does it shrink the Clark Air Sana model to 495 MB compared to a full-precision image generation model?

Frequently asked questions

What is comfyui-clarkairsana?

A ComfyUI plugin that runs Clark Air Sana 1.6B, an AI image generation model with ultra-compact 1.58-bit ternary weights, through the standard KSampler workflow with no other node packs needed.

What language is comfyui-clarkairsana written in?

Mainly Python. The stack also includes Python, ComfyUI, CUDA.

What license does comfyui-clarkairsana use?

The vendored Sana model code is Apache-2.0, use freely for any purpose including commercial, with attribution.

How hard is comfyui-clarkairsana to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is comfyui-clarkairsana for?

Mainly vibe coder.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub clark-labs-inc on gitmyhub

Verify against the repo before relying on details.