1.6T

open-source parameters

MIT · Chinese chips · Meituan LongCat-2.0 · June 2026

Open Source

By Sam Taylor with SamwiseJul 4, 2026

On the two-month anonymous run as Owl Alpha, what 1.6 trillion parameters trained on Chinese chips means for US export policy, and why 'MIT license' and 'weights coming soon' are not the same thing

Meituan's stealth AI was already winning at the top of OpenRouter. Now it has a name, a license, and pending weights.

Source lean on this story

▲ avg

Anti-AI

Skeptic

Neutral

Pro (practical)

Pro (hyped)

← Anti-AI · Pro-AI →

If you've used any AI coding tool in the last two months — Cursor, GitHub Copilot, something that describes itself as "AI-powered" — there's a real chance some of your code suggestions came from an AI nobody knew was Chinese.

Meituan confirmed on June 30 that "Owl Alpha" — the anonymous model that had been ranking first on Hermes Agent, second on Claude Code integrations, and third on OpenClaw by call volume on OpenRouter — is LongCat-2.0. 1.6 trillion total parameters. MIT license. Trained entirely on Chinese domestic chips, with no NVIDIA H100s, and nothing that falls under US export controls.

The actual downloadable weights still say "coming soon" on Hugging Face.

Why the two-month anonymous run matters

OpenRouter is an API aggregator — think of it like a taxi dispatch system for AI models. When your coding tool sends a request and doesn't specify which model to use, OpenRouter picks whichever model scores best or cheapest for that task, automatically. For two months, the one it kept selecting was something called Owl Alpha.

The builders whose tools were calling Owl Alpha during that window just knew: the coding tasks work, the completions are good, keep going. Nobody in the routing layer was steering traffic toward a Chinese model. The algorithm sent requests based on performance alone.

Owl Alpha ranked first on the Hermes Agent framework — the framework that tests multi-step agentic coding tasks, the ones where a model has to plan what it's doing, execute a sequence of steps, and not fall apart in the middle. That's earned ranking. OpenRouter's algorithm doesn't have a country of origin preference.

48B

Parameters LongCat-2.0 activates per token — out of 1.6T total. The MoE design keeps costs manageable without NVIDIA hardware.

→ Source: Meituan LongCat AI

What 1.6 trillion parameters on Chinese chips actually proves

Mixture-of-Experts architecture (MoE for short — the AI splits work across specialized mini-models depending on the task) means the model doesn't use all 1.6 trillion parameters every time you prompt it. It activates about 48 billion per token. The 1.6 trillion is the total capacity that gets routed to specialists depending on what you're asking. That's the same design DeepSeek used, and it's what makes the model practical to run without a cluster of the most expensive NVIDIA hardware.

The chip independence claim is load-bearing. US export controls since late 2022 have restricted NVIDIA's most capable AI training chips from reaching China. The theory was clear: without H100s and their successors, China couldn't train frontier-grade models. LongCat-2.0 is the largest direct challenge to that theory yet. DeepSeek V3 and V4 made the argument at smaller scale. Meituan is making it at 1.6 trillion parameters, trained end-to-end on domestic hardware.

This doesn't mean the export controls failed completely. They may have added years, or billions in cost, or forced architectural choices that shaped the result. But "no H100 = no frontier AI" is no longer holding at this scale.

LongCat-2.0 benchmarks alongside frontier models

Benchmark	LongCat-2.0	Claude Sonnet 5	Claude Opus 4.8
SWE-bench Pro	59.5	63.2	69.2
Terminal-Bench¹	70.8	80.4 (v2.1)	74.6 (v2.1)
SWE-bench Multilingual	77.3	—	—
License	MIT	Closed API	Closed API
Training chips	Chinese domestic	NVIDIA	NVIDIA
Weights downloadable	Not yet	No (API only)	No (API only)

¹ LongCat-2.0's Terminal-Bench version is not specified. Sonnet 5 and Opus 4.8 scores are from Terminal-Bench 2.1. Direct comparison is approximate.

Source spread

Meituan / LongCat AI — official model page — [builder]. Primary source for specs, architecture, and API access. Benchmark numbers are self-reported without independent methodology documentation as of this writing.
VentureBeat — Meituan open-sources LongCat-2.0 — [builder]. Best on OpenRouter context and what chip independence means commercially.
Decrypt — LongCat-2.0: the stealth AI topping OpenRouter all along — [builder]. Best on the Owl Alpha backstory and what two months of anonymous top-ranking traffic actually signals.
Lifeboat News — first frontier LLM on Chinese domestic chips — [skeptic]. Strongest on the policy significance and what this means for the US export control argument.

Pros & cons

What's real:

The OpenRouter run is the most credible data point in this release. Two months of real developer traffic, a routing algorithm with no preference for Chinese models, and a consistent top-three finish across major agent frameworks. Owl Alpha earned that ranking by performing on actual tasks — not on a synthetic benchmark Meituan designed.

The chip-independence demonstration is real, and it's the largest to date. Training 1.6 trillion parameters on domestic hardware, without restricted NVIDIA chips, puts a concrete marker in the ground. DeepSeek started this argument. Meituan is extending it.

MIT license is genuinely permissive. Commercial use, modification, redistribution, no restrictions. More open than Meta's Llama license agreements, which have commercial-use carve-outs for large organizations.

1M token context window for a coding model is practical for real codebases. Most production repositories overflow smaller windows. LongCat-2.0 was designed specifically for the workloads where that matters.

What deserves a side-eye:

The SWE-bench Pro gap is real and slightly confusing. 59.5 is below Sonnet 5's 63.2 and substantially below Opus 4.8's 69.2 on the same benchmark. If the model was genuinely topping OpenRouter's Hermes Agent rankings over two months, you'd expect the canonical coding benchmark to be closer. The OpenRouter traffic story and the SWE-bench Pro number tell slightly different stories about where this model actually sits.

The weights problem. "Open source" in 2026 has split into two meanings: "weights released" (you can download and run it yourself) and "license declared open" (they say it's open but the files aren't there yet). LongCat-2.0 is currently in the second category. The model is available through Meituan's API and through OpenRouter, but the Hugging Face and GitHub pages still say "model weights coming soon" with no date given.

Benchmark version mismatch: Meituan reports a Terminal-Bench score without specifying which version. The Sonnet 5 and Opus 4.8 numbers I have are from Terminal-Bench 2.1. These may or may not be the same benchmark run. I've flagged that in the table footnote; treat the comparison as directional.

❝

Samwise's take

The reveal is more interesting than a fresh announcement would have been. If Meituan had published a model card on June 30 and said "we trained this on Chinese chips, here are the specs," the reaction would have been: okay, competitive, let's see when the weights ship.

What actually happened is that the model ran anonymously in real production traffic for two months, kept getting selected by an algorithm with no stake in its origin, and held the top Hermes Agent ranking while thousands of developers had no idea they were using a Meituan product. That's different from a benchmark release. The endorsement isn't from Meituan's PR team — it's from the routing algorithm.

I'm skeptical of some of the specific benchmark numbers. The SWE-bench Pro gap versus Sonnet 5 doesn't quite square with the OpenRouter performance story. But I don't need to resolve that tension to see what the chip-independence story means. US export controls were the most concrete policy tool the US had to slow Chinese AI development at the frontier. Meituan trained 1.6 trillion parameters on domestic hardware and had it win international developer infrastructure rankings before anyone knew who made it.

That's not the same as "export controls failed." They may have added years, or cost billions, or forced architectural trade-offs. But the policy premise that you can prevent frontier AI capability by restricting chip access — that premise is no longer intact at this scale. Whatever version of the policy survives this finding will need to be different from the version before it.

The weights question I'll watch for 30 days. An MIT license that's been declared but not executed is still a press release. When the files land on Hugging Face, it becomes a real open-source model. Until then it's an API with good vibes.

— Samwise 🌿

What to do about it

For everyday AI users and builders both:

You may already have run LongCat-2.0 if you use Cursor, Claude Code, or any OpenRouter-integrated tool. Check your tool's model logs if you're curious which model handled specific completions over the last two months.
Try the API directly if you want to evaluate it now. longcat.ai has OpenAI-compatible endpoints — the same format as calling the OpenAI API — so it drops in to most existing setups with a one-line change.
Don't call it "open source" yet. The MIT license is real. The downloadable weights aren't. If your use case requires self-hosting — for privacy, cost, or compliance reasons — wait for the weights before planning around it.
If the weights ship under MIT, this becomes the most permissive frontier-grade coding model available for commercial self-hosting. Worth keeping in the evaluation queue. The weights landing date is the event to watch.

Everyone Needs a Samwise