microsoft/trellis

★ 12,578PythonAudience · researcherComplexity · 5/5Setup · hard

Mindmap

mindmap
  root((TRELLIS))
    What It Does
      Text to 3D model
      Image to 3D model
      Multiple output types
    Output Formats
      3D mesh for games
      Radiance field
      3D Gaussians
    Model Sizes
      342M parameters
      Up to 2B parameters
      Hugging Face demo
    Requirements
      NVIDIA GPU 16GB
      Linux only
      Python environment

mindmap root((TRELLIS)) What It Does Text to 3D model Image to 3D model Multiple output types Output Formats 3D mesh for games Radiance field 3D Gaussians Model Sizes 342M parameters Up to 2B parameters Hugging Face demo Requirements NVIDIA GPU 16GB Linux only Python environment

Click or tap to explore — scroll the page freely

Things people build with this

USE CASE 1

Generate a 3D game asset from a product photo without needing a 3D artist.

USE CASE 2

Convert concept art or a reference image into a 3D mesh for use in a game engine or 3D printing.

USE CASE 3

Research 3D generation techniques by training on TRELLIS's publicly available training code.

Tech stack

PythonPyTorchCUDA

Getting it running

Difficulty · hard Time to first run · 1day+

Requires Linux with an NVIDIA GPU with at least 16 GB of VRAM, setup installs GPU-specific Python libraries via a provided setup script.

In plain English

TRELLIS is a research project from Microsoft that generates 3D objects from text descriptions or images. You describe something in words or provide a photo, and the model produces a three-dimensional version of it that can be exported in several common 3D formats used in games, film, and design software. The core idea is that TRELLIS uses a single internal representation to produce multiple output types from the same generation. Rather than committing to one specific 3D file format, it can output as a mesh (the kind of 3D model used in games and 3D printing), as a radiance field (a way of representing 3D scenes that captures lighting), or as 3D Gaussians (another representation suited to rendering). This flexibility is useful for teams with different downstream tools. The project provides pre-trained models ranging from 342 million to 2 billion parameters. Larger models generally produce more detailed results. A live demo is available on Hugging Face, so you can try the tool without installing anything. Image-based generation is recommended over text-based generation when you want the most detailed output, since the text-conditioned models have less training data to work with. Running the code yourself requires a Linux machine with an NVIDIA GPU that has at least 16 GB of memory. Installation involves setting up a Python environment with several GPU-specific libraries, which the project handles through a setup script. The training code is also publicly available for researchers who want to build on the work. The full README is longer than what was shown.

Copy-paste prompts

Prompt 1

Using TRELLIS, generate a 3D mesh from a photo of a chair and export it as a .glb file for use in Unity.

Prompt 2

How do I run TRELLIS image-to-3D generation on a Linux machine with an NVIDIA RTX 3090?

Prompt 3

What is the difference between TRELLIS mesh output, radiance field output, and 3D Gaussian output, and when should I use each?

Open on GitHub → Explain another repo

← microsoft on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.