explaingit

hellozicky/navicache

14PythonAudience · researcherComplexity · 4/5Setup · hard

TLDR

ICML 2026 research code for NaviCache, a training-free method that speeds up AI video generation by skipping redundant computations when internal model states change very little between steps.

Mindmap

mindmap
  root((NaviCache))
    Problem
      Slow video generation
      Redundant computation
    Method
      Change tracking
      Skip low-delta blocks
      On-the-fly estimation
    Supported Models
      Wan2.1
      HunyuanVideo
      Open-Sora 1.2
    Controls
      Threshold setting
      Fast medium slow presets
      Environment variables
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Speed up Wan2.1, HunyuanVideo, or Open-Sora 1.2 video generation with a tunable speed-quality tradeoff threshold.

USE CASE 2

Apply adaptive computation caching to video diffusion inference with no pre-calibration dataset required.

USE CASE 3

Reproduce NaviCache benchmark comparisons against competing caching methods on the same generation prompts.

Tech stack

PythonWan2.1HunyuanVideoOpen-Sorashell scripts

Getting it running

Difficulty · hard Time to first run · 1h+

Requires cloning both NaviCache and an official model repository, plus GPU resources to run video diffusion inference.

In plain English

This repository contains the official code for NaviCache, a research method accepted at ICML 2026. Its purpose is to make AI video generation faster without sacrificing too much quality. Video generation models are slow because they process the same information through many computational steps on every frame, even when the underlying content has not changed much from one step to the next. NaviCache addresses this by tracking how much the internal representations are actually changing, and skipping computations when the change is small enough not to matter. The approach does not require any setup beforehand. It does not need a separate calibration dataset or pre-processing phase. Instead, it learns on the fly during inference, updating its own estimate of feature change as the video is generated. A lightweight estimator decides at each step whether to skip a processing block or compute it fully. This is controlled by a threshold setting that lets users trade generation speed for output quality. The code supports three video generation models: Wan2.1, HunyuanVideo, and Open-Sora 1.2. For each model, there are shell scripts that handle setup and launch, and the NaviCache entry point is a single Python script that gets copied into the official model repository. The README provides recommended threshold and alignment step values for fast, medium, and slow presets across all three supported models. Installation requires cloning both the NaviCache repository and whichever official model repository you want to accelerate, then running the provided launch script from within the model's directory. Environment variables control the key hyperparameters, so you can adjust them without editing files. This is a research release for AI practitioners and researchers working with video diffusion models. The paper and project page were still forthcoming at the time of release. The repository includes comparison videos showing NaviCache outputs alongside several competing caching methods on the same prompt.

Copy-paste prompts

Prompt 1
I want to use NaviCache from hellozicky/navicache to speed up my HunyuanVideo generation. Walk me through cloning both repositories, running the launch script, and setting the threshold for medium quality.
Prompt 2
How does NaviCache decide which computation blocks to skip during inference? What signal does the lightweight estimator track and how is the threshold parameter used to make that decision?
Prompt 3
I am using Wan2.1 with NaviCache and want to tune the speed-quality tradeoff. What are the recommended threshold and alignment step values for the fast, medium, and slow presets?
Prompt 4
I want to add NaviCache support to a video diffusion model that is not yet in the supported list. What does the NaviCache entry point script hook into and what interface does it expect from the model?
Open on GitHub → Explain another repo

← hellozicky on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.