meituan-longcat/longcat-2.0

Analysis updated 2026-05-18

★ 221Audience · developerComplexity · 4/5LicenseSetup · hard

Mindmap

mindmap
  root((LongCat-2.0))
    Architecture
      1.6T parameters
      Mixture of Experts
      Sparse Attention
      N-gram Embedding
    Context
      1M token window
      Long document QA
      Full repo reads
    Use cases
      Coding agents
      Agentic task runs
      Code editing
    Deployment
      HuggingFace
      ModelScope
      MIT license
    Training
      35T tokens
      AI ASIC hardware

mindmap root((LongCat-2.0)) Architecture 1.6T parameters Mixture of Experts Sparse Attention N-gram Embedding Context 1M token window Long document QA Full repo reads Use cases Coding agents Agentic task runs Code editing Deployment HuggingFace ModelScope MIT license Training 35T tokens AI ASIC hardware

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Run an AI coding agent that reads and edits an entire large codebase in a single context window.

USE CASE 2

Use the model as the reasoning engine behind an agentic task executor for multi-step workflows.

USE CASE 3

Answer questions about very long documents or full books that exceed the limits of typical models.

USE CASE 4

Integrate LongCat-2.0 into Claude Code or a similar AI coding tool as an alternative LLM backend.

What is it built with?

PythonHuggingFaceModelScopeMoE architectureASIC

How does it compare?

	meituan-longcat/longcat-2.0	wubing2023/paperspine	germondai/trawl
Stars	221	220	218
Language	—	Python	TypeScript
Setup difficulty	hard	moderate	moderate
Complexity	4/5	3/5	3/5
Audience	developer	researcher	ops devops

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1h+

Running the full 1.6T-parameter model requires substantial GPU memory, exact hardware requirements are not stated in the README.

Use freely for any purpose, including commercial, as long as you keep the copyright notice (MIT).

In plain English

LongCat-2.0 is a very large AI language model developed by Meituan, a Chinese technology company. Like GPT or Claude, it can read and write text, answer questions, write code, and follow complex instructions. It has 1.6 trillion total parameters, a measure of model size, with about 48 billion active at any given moment. It uses an architecture called Mixture of Experts, where only a portion of the model activates for each request. What sets this model apart is its focus on extremely long context windows. It was trained on sequences up to one million tokens long, which means it can read and reason over an entire large codebase or a very long document in a single pass. Most publicly available models top out at 200,000 tokens or less. The model was built specifically for coding and agent tasks. It is integrated with tools like Claude Code and other AI coding assistants, meaning it can help with reading whole repositories, making large-scale code edits, and running automated tasks across many steps. Benchmark scores show it competing closely with leading models from Google and Anthropic on coding-agent evaluations. The training used custom AI chips called ASIC superpods rather than standard graphics cards. The training run covered more than 35 trillion tokens across millions of accelerator-days without any training failures or rollbacks. Model weights are available on HuggingFace and ModelScope, a Chinese model hosting platform. The license is MIT, meaning you can use it freely including for commercial purposes. The README links to a technical blog post with full architecture details.

Copy-paste prompts

Prompt 1

How do I load and run LongCat-2.0 from HuggingFace for local inference on a multi-GPU machine?

Prompt 2

Show me how to use LongCat-2.0 with Claude Code as the backing model for a code-editing agent on a large repository.

Prompt 3

What is the Mixture of Experts architecture in LongCat-2.0 and how does it differ from a standard transformer?

Prompt 4

Walk me through the LongCat Sparse Attention mechanism and why it helps with 1-million-token context tasks.

Prompt 5

Compare LongCat-2.0 SWE-bench scores to Claude Opus 4.8 and GPT-5.5 and explain what the difference means for coding agents.

Frequently asked questions

What is longcat-2.0?

LongCat-2.0 is a 1.6-trillion-parameter open AI language model built for coding agents and long-context tasks, with a 1-million-token context window and MIT license.

What license does longcat-2.0 use?

Use freely for any purpose, including commercial, as long as you keep the copyright notice (MIT).

How hard is longcat-2.0 to set up?

Setup difficulty is rated hard, with roughly 1h+ to a first successful run.

Who is longcat-2.0 for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub meituan-longcat on gitmyhub

Verify against the repo before relying on details.