Mitigating Safety Tax via Distribution-Grounded Refinement in Large Reasoning Models Paper โข 2602.02136 โข Published 5 days ago โข 7