explaingit

jiang-cx/h-omnistereo

Analysis updated 2026-06-24

31Audience · researcherComplexity · 5/5Setup · hard

TLDR

Research project page for H-OmniStereo, a zero-shot stereo matching method for 360-degree top-and-bottom cameras, with a synthetic dataset and normal estimator coming.

Mindmap

mindmap
  root((H-OmniStereo))
    Inputs
      Top-bottom 360 image pairs
      Equirectangular frames
    Outputs
      Depth from stereo
      Heading-aligned normals
    Use Cases
      Full-surround perception
      Consumer 360 camera depth
    Tech Stack
      Computer Vision
      Deep Learning
      Synthetic Data
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Read the paper to learn how omnidirectional stereo differs from regular stereo

USE CASE 2

Wait for the dataset release to train your own 360 stereo model

USE CASE 3

Cite the work in a related CV paper on panoramic perception

What is it built with?

PythonPyTorchComputerVision

How does it compare?

jiang-cx/h-omnistereo732124645/promptopsadiao1973/librobotbagfix
Stars313131
LanguageGoC++
Setup difficultyhardeasyhard
Complexity5/53/54/5
Audienceresearcherdeveloperops devops

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1day+

Repo is currently a paper landing page with no code or dataset released yet, so nothing runs.

In plain English

H-OmniStereo is a computer vision research project from a group at HKUST, written up in a paper on arXiv as 2605.14963. It is about stereo matching on 360-degree omnidirectional cameras, which is the problem of figuring out how far away each part of a scene is when you have two panoramic cameras instead of two regular ones pointing forward. The motivation, in plain terms: regular stereo matching has gotten very good because there are large training datasets and pretrained networks that can estimate properties like surface orientation from a single image. When you try to copy that pipeline over to 360-degree top-and-bottom camera rigs, two things go wrong. There are not enough public 360 stereo datasets to train on, and the pretrained helpers from regular cameras get confused by the heavy warping that happens when you flatten a sphere into a rectangular image. The paper proposes two fixes. First, the authors built a synthetic dataset with over 2.8 million pairs of top-and-bottom 360 stereo images, which is large enough to train a competitive model. Second, they trained a new normal-estimation network, a model that predicts which way each pixel of the scene is facing, operating directly in the 360 image space with a heading-aligned coordinate system so that the spherical distortion does not throw it off. That gives the stereo matcher a stable geometric hint to work with. The headline claim is that the resulting model is zero-shot: it does well on datasets it was not trained on, and it generalises to real-world setups from off-the-shelf consumer 360 cameras using a single model. The authors say both the model and the dataset will be open-sourced. The current state of the repo is more of a project landing page than a code release. There is a logo, a link to the arXiv paper, a link to a YouTube demo video, the abstract, an author contact email, and the BibTeX citation. There is no installation guide, no Python code, and no dataset download link visible in the README yet.

Copy-paste prompts

Prompt 1
Summarise the H-OmniStereo paper in plain English for someone who knows perspective stereo but not omnidirectional cameras
Prompt 2
Compare H-OmniStereo to existing 360 stereo methods and explain what the heading-aligned normal prior buys you
Prompt 3
Draft a checklist of what code and data files I should expect when H-OmniStereo open-sources its model
Prompt 4
Write a short blog post explaining why pretrained monocular priors fail on equirectangular images

Frequently asked questions

What is h-omnistereo?

Research project page for H-OmniStereo, a zero-shot stereo matching method for 360-degree top-and-bottom cameras, with a synthetic dataset and normal estimator coming.

How hard is h-omnistereo to set up?

Setup difficulty is rated hard, with roughly 1day+ to a first successful run.

Who is h-omnistereo for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.