Reasoning Fine-Tuning Can Generalize Across Domains, but Safety Pays the Price

A study across 45 models shows supervised fine-tuning with chain-of-thought traces can transfer reasoning to new domains when three conditions align, but safety alignment degrades in the process.
artificial-intelligence
Author

Kabui, Charles

Published

2026-04-22

Keywords

supervised-fine-tuning, reinforcement-learning, chain-of-thought, reasoning-generalization, safety-alignment