NVIDIA Nemotron-Cascade 2: IMO Gold Medal Reasoning with Only 3B Active Parameters

NVIDIA’s 30B mixture-of-experts model activates only 3B parameters per query yet earned Gold Medal scores on the 2025 IMO, IOI, and ICPC, making it 20x smaller than the only other open model at this tier.
artificial-intelligence
Author

Kabui, Charles

Published

2026-03-31

Keywords

nvidia-nemotron, mixture-of-experts, mathematical-reasoning, reinforcement-learning, model-distillation