LLaDA2.0-Uni: One Diffusion Model for Understanding and Generating Text, Images, and Video

Inclusion AI’s 16B Mixture-of-Experts model uses masked diffusion instead of left-to-right generation, handling image understanding, text-to-image, and editing with only ~1B parameters active per token.
artificial-intelligence
Author

Kabui, Charles

Published

2026-04-28

Keywords

discrete-diffusion, multimodal-models, mixture-of-experts, image-generation, inclusion-ai