PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 9 days ago • 13
Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing Paper • 2604.02288 • Published 9 days ago • 27