Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning Paper • 2606.09290 • Published 6 days ago • 7
SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling Paper • 2606.09304 • Published 6 days ago • 6