LongCat-2.0: Meituan's Open-Source Trillion-Parameter Coding Model

On June 30, 2026, Meituan's LongCat team officially released LongCat-2.0, a trillion-parameter open-source MoE model built for agentic coding. It natively supports 1M token context and scored 59.5 on SWE-bench Pro, surpassing GPT-5.5 and Claude Opus 4.6.

Key Specs

SpecValue
Total Parameters1.6T (1.6 trillion)
Active per Token33B–56B (dynamic), ~48B avg
Context Length1M tokens (native)
ArchitectureMoE (Mixture of Experts)
Training Data30T+ tokens (Chinese, English, multilingual, code)
Compute Cluster50,000 domestic AI accelerators

Three Key Technical Innovations

LongCat Sparse Attention (LSA)

Traditional attention mechanisms scale quadratically — beyond ~100K tokens, models start forgetting. LSA uses sparse attention to selectively attend to key information, dropping complexity from O(n²) to O(n). This enables accurate retrieval across 1 million tokens — equivalent to browsing an entire codebase at once.

Zero-Computation Experts + ScMoE

Code tasks vary wildly in complexity. LongCat-2.0 achieves token-level dynamic activation: simple tokens cost zero compute, while complex tokens automatically receive more expert resources (33B–56B range).

MOPD Multi-Expert Fusion

LongCat-2.0 starts from an SFT checkpoint, branches into three specialized expert groups (Agent, Reasoning, Interaction), then distills them into one unified model via MOPD on domestic accelerator clusters.

Benchmarks

BenchmarkLongCat-2.0Comparison
SWE-bench Pro59.5Beats Gemini 3.1 Pro (54.2), GPT-5.5 (58.6), Claude Opus 4.6 (57.3)
SWE-bench Multilingual77.3On par with Claude Opus 4.6 (77.8)
Terminal-Bench 2.170.8Real terminal interaction
RWSearch78.8Search agent tasks
FORTE73.2Productivity scenarios
BrowseComp79.9Complex browsing

How to Access

  1. longcat.ai — online experience
  2. OpenRouter — global API (Top 3 by call volume)
  3. LongCat API Platform — OpenAI / Anthropic API compatible
  4. Model weights coming soon on Hugging Face / GitHub