venturebeat.com

Meituan open sources LongCat-2.0: a 1.6T agentic coding model trained on Chinese ASICs

Tech News•Jun 30, 2026•

4 min read

Published by AINave Editorial • Reviewed by Ramit

TL;DRMeituan open sourced LongCat-2.0, a 1.6 trillion-parameter MoE model trained entirely on 50,000+ domestic Chinese ASICs. Released under the MIT license, it features a 1-million-token context window, a Zero-Compute Experts framework, and a novel Token Pack pricing model with zero-charge context cache hits. The model previously led OpenRouter under the anonymous Owl Alpha name and scores 59.5 on SWE-bench Pro, surpassing GPT-5.5.

Meituan has open sourced LongCat-2.0, a 1.6 trillion-parameter Mixture-of-Experts model that was previously leading OpenRouter under the anonymous name Owl Alpha. The model is released under the MIT license and is available on GitHub and Hugging Face. What makes this release stand out is that it was trained entirely on a cluster of over 50,000 domestic Chinese ASICs, signaling a potential shift away from Nvidia GPU dependence for large-scale AI training.

What happened

Meituan, the Chinese super app company best known for food delivery, officially unveiled LongCat-2.0 on GitHub, Hugging Face, and its native platform. The model had been operating anonymously as "Owl Alpha" on OpenRouter for two months, where it accounted for approximately 10.1 trillion monthly tokens, averaging 559 billion tokens per day, and secured the top ranking on the Hermes Agent workspace and second place on Claude Code deployments.

LongCat-2.0 uses a Mixture-of-Experts architecture with 1.6 trillion total parameters but activates only 33 billion to 56 billion parameters per token, averaging 48 billion. This is achieved through a "Zero-Compute Experts" framework that routes routine execution through lighter subnetworks, eliminating idle computational overhead.

The model supports a native 1-million-token context window, enabled by LongCat Sparse Attention (LSA), an evolution of DeepSeek Sparse Attention. LSA uses three indexing strategies: Streaming-aware Indexing for coalesced memory access, Cross-Layer Indexing to amortize calculation costs across layers, and Hierarchical Indexing for coarse-to-fine token selection.

Why AI builders should care

For teams building AI products and agentic workflows, LongCat-2.0 offers several practical advantages. The MIT license means enterprises can fork, modify, and deploy the model in private stacks without copyleft obligations, making it suitable for closed-source commercial applications.

The hardware story is equally significant. Training on 50,000+ domestic Chinese ASICs demonstrates that near-frontier models can be built without Nvidia GPUs. This has implications for supply chain resilience and cost structures, especially as U.S. export controls restrict access to advanced Western chips.

Meituan's post-training regime, called Multi-Teacher Optimization via Mixture of Specialized Experts (MOPD), separates optimization into three independent expert clusters: Agent Experts for tool invocation and execution, Reasoning Experts for multi-hop logic and STEM problem-solving, and Interaction Experts for alignment and safety. A dynamic gate-routing mechanism fuses these behaviors at runtime, which could enable more predictable and auditable agent behavior in enterprise deployments.

Practical implications

LongCat-2.0 is designed for autonomous software engineering tasks. In benchmarks, it scores 59.5 on SWE-bench Pro, surpassing GPT-5.5's 58.6. It also scores 70.8 on Terminal-Bench 2.1, 77.3 on SWE-bench Multilingual, and 73.2 on the FORTE corporate workflow simulator.

Pricing is aggressive. Standard pay-as-you-go rates are $0.75 per million input tokens and $2.95 per million output tokens. A limited-time promotional discount drops this to $0.30 and $1.20 respectively. Meituan also offers a Token Pack system with zero-charge context cache hits, meaning repeated reads of the same code repository during long agent sessions do not consume quota. Token packs are released via flash sales four times daily at 10:00, 16:00, 21:00, and 23:00 Beijing Time.

For enterprise teams, this means they can pass entire code repositories into the 1-million-token context window, have the model map dependencies, execute repository-level updates, compile the new codebase, and generate pull requests, all within a private sandbox environment.

Caveats

Most technical claims in this article come from VentureBeat coverage and Meituan's announcements. Independent verification of benchmark results, hardware claims, and real-world performance is not yet available. The model trails premium frontier systems like Claude Opus 4.8 on broader general-agent benchmarks such as FORTE and BrowseComp. Deployment feasibility on non-ASIC hardware is not detailed in the available sources. The promotional pricing is time-limited, and the Token Pack flash sale model may not suit all usage patterns.

FAQs

What is LongCat-2.0 and who released it?

LongCat-2.0 is a 1.6 trillion-parameter Mixture-of-Experts model released by Meituan, the Chinese super app company. It previously operated anonymously as Owl Alpha on OpenRouter. The model is released under the MIT license and is available on GitHub and Hugging Face.

What are the key features of LongCat-2.0 (1.6T parameters, 1-million-token context)?

LongCat-2.0 features 1.6 trillion total parameters with 33-56 billion activated per token via a Zero-Compute Experts framework. It supports a native 1-million-token context window using LongCat Sparse Attention (LSA) with Streaming-aware, Cross-Layer, and Hierarchical Indexing. It also includes an N-gram Embedding module with 135 billion additional parameters.

Under what license is LongCat-2.0 released and where can I access it?

LongCat-2.0 is released under the MIT license, which permits unrestricted use, modification, and commercial deployment without copyleft obligations. The model is available on GitHub, Hugging Face, and Meituan's platform.

What hardware does LongCat-2.0 require (Chinese ASICs vs GPUs)?

LongCat-2.0 was trained entirely on a cluster of over 50,000 domestic Chinese ASICs, demonstrating that near-frontier models can be built without Nvidia GPUs. The model can be deployed on compatible hardware, though specific inference hardware requirements are not detailed in available sources.

Sources

Latest Tech News

Pie's AI marketing stack targets local shops with Front Desk launch and $19.5M raise

20 minutes ago

US lawmakers push to curb AI health data sharing with new Health and Location Data Protection Act

20 minutes ago

AWS FDE: $1B to embed agentic AI on-site in 45 days

20 minutes ago

Google's Nano Banana 2 Lite: speed, cost, and enterprise workflow implications for AI builders

20 minutes ago

SAP hands AI product oversight to CEO and COO in a reshuffle to speed ERP delivery

20 minutes ago

Gemini's free Nano Banana 2 image generation goes live for US users via Personal Intelligence

6 hours ago

UK sovereign cloud push stalls as firms pay a hefty 'sovereignty tax'

6 hours ago

Gulf AI startup 1001 lands $30M to apply AI to aviation, ports, and energy infra

6 hours ago

AI agents as workplace colleagues: what Fanatics, Whoop, and Synopsys revealed at Snowflake Summit

6 hours ago

Claude arrives in Microsoft Foundry for Azure-governed enterprise AI, with dual hosting options and strict data controls

6 hours ago

Google Gemini desktop control: what the APK teardown hints and what it could mean for cross-device automation

6 hours ago

OpenClaw Android app turns your phone into a remote for self-hosted AI agents

6 hours ago

Uzbekistan's $5B AI export plan gains momentum, reshaping Central Asia's tech landscape

12 hours ago

Apple accelerates security updates in response to AI-powered hacking risks

12 hours ago

Taiwan Raid on Supermicro Could Reshape AI Chip Exports Enforcement

12 hours ago

Base44 launches proprietary Base1 model to strengthen defensibility for vibe coding platform

12 hours ago

AI testing startup Arato raises seed funding to validate AI before production

12 hours ago

Cursor iOS app lets developers run and manage AI coding agents from a phone

12 hours ago

California's Claude AI deal: half-price government access and a new procurement playbook

18 hours ago

TIDAL's AI-generated music policy: demonetization and labeling as a test for platform governance

18 hours ago

South Korea's $880B AI and chips mega-plan aims to reshape regional tech leadership

1 day ago

Voters across parties demand AI guardrails and government oversight, per NBC/AIPI poll

1 day ago

Canada pushes AI governance at UN as a multilateral path to safe, equitable adoption

1 day ago