Meituan LongCat-Next: One Model That Sees, Draws, and Talks Using a Single Token System

Meituan’s 74B open-source model unifies text, vision, and audio as discrete tokens under one autoregressive objective, matching specialist models at 28x compression.
artificial-intelligence
Author

Kabui, Charles

Published

2026-04-04

Keywords

longcat-next, native-multimodal, discrete-tokens, visual-tokenizer, meituan