explaingit

hygenie1228/tehor_release

Analysis updated 2026-05-18

9PythonAudience · researcherComplexity · 5/5LicenseSetup · hard

TLDR

A research system that reconstructs a textured 3D human figure and object from a single photo, using text descriptions to capture how they interact.

Mindmap

mindmap
  root((TeHOR))
    What it does
      Single image input
      3D human reconstruction
      3D object reconstruction
      Texture generation
    Key ideas
      Text-guided interaction
      Non-contact semantics
      Appearance alignment
    Pipeline steps
      Preprocess image
      Initialize object mesh
      Run optimization
    Requirements
      Python 3.10
      PyTorch CUDA
      OpenAI API key
      Pretrained models
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Reproduce the CVPR 2026 TeHOR paper results by running the provided scripts on the example images.

USE CASE 2

Use the reconstruction pipeline as a baseline to compare against your own 3D human-object interaction method.

USE CASE 3

Generate textured 3D models of a person and object from a single photograph for non-commercial research or visualization.

What is it built with?

PythonPyTorchCUDAOpenAI API

How does it compare?

hygenie1228/tehor_releasedanieldoradotalaveron-rb/yolosegment-2d-to-3d-rebotarm_pick_and_placeewreaslan/jwttx
Stars999
LanguagePythonPythonPython
Setup difficultyhardhardeasy
Complexity5/55/53/5
Audienceresearcherresearcherdeveloper

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1day+

Requires a CUDA GPU, multiple large pretrained model downloads, and an OpenAI API key for text prompt generation.

CC BY-NC 4.0: free to use and modify for non-commercial purposes, with attribution required.

In plain English

This is the official code release for TeHOR, a computer vision research system published at CVPR 2026 by researchers at Seoul National University. The problem it solves is specific: given a single photo containing a person and an object, reconstruct both as detailed 3D models, complete with surface textures and a realistic physical relationship between them. The key insight in the approach is that contact alone is not enough to understand how people relate to objects. Someone gazing at a book or pointing at a sign is meaningfully interacting with an object without touching it. TeHOR uses text descriptions to capture this broader sense of interaction, so the resulting 3D reconstruction reflects not just where bodies and objects are in space but also the semantic nature of what is happening between them. The output is a pair of textured 3D models that fit together coherently: a human figure and an object, positioned and oriented relative to each other in a way that matches the original photo. The system handles appearance as well as shape, meaning the surfaces have colors and textures drawn from the input image rather than being blank geometry. Running TeHOR requires significant infrastructure. Setup involves Python 3.10, a specific version of PyTorch, a CUDA-capable GPU, and a large number of third-party dependencies installed through multiple setup scripts. The data directory expects several large pretrained models downloaded separately, and the pipeline also calls the OpenAI API for text prompt generation, requiring an API key. The workflow is three steps: preprocess the input image to extract the human and object separately, optionally align an object mesh to a depth estimate, then run the main optimization script. This is a research codebase, not a consumer tool. It is aimed at computer vision researchers or advanced practitioners who want to reproduce the paper's results or build on the method. The license is CC BY-NC 4.0, which allows use and adaptation for non-commercial purposes only.

Copy-paste prompts

Prompt 1
I want to run TeHOR on my own image. Walk me through the preprocess.py and run_tehor.py steps, and explain what each output file in the experiment directory contains.
Prompt 2
I'm setting up TeHOR and the installation is failing at the KNN_CUDA step. What are the likely causes and how do I fix them?
Prompt 3
Explain what TeHOR's text-guided optimization does differently from contact-only human-object reconstruction methods, in plain terms.

Frequently asked questions

What is tehor_release?

A research system that reconstructs a textured 3D human figure and object from a single photo, using text descriptions to capture how they interact.

What language is tehor_release written in?

Mainly Python. The stack also includes Python, PyTorch, CUDA.

What license does tehor_release use?

CC BY-NC 4.0: free to use and modify for non-commercial purposes, with attribution required.

How hard is tehor_release to set up?

Setup difficulty is rated hard, with roughly 1day+ to a first successful run.

Who is tehor_release for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub hygenie1228 on gitmyhub

Verify against the repo before relying on details.