RL-Trained LLM Agents Can Collapse Into Fixed Templates That Entropy Can’t Detect

RAGEN-2 reveals template collapse in RL-trained LLM agents, a failure mode invisible to entropy. Mutual information catches it, and a simple filtering fix improves performance across 4 task domains.
artificial-intelligence
Author

Kabui, Charles

Published

2026-04-20

Keywords

reinforcement-learning, llm-agents, template-collapse, mutual-information, training-diagnostics