VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction Paper • 2602.13294 • Published 14 days ago • 12
Expected Harm: Rethinking Safety Evaluation of (Mis)Aligned LLMs Paper • 2602.01600 • Published 21 days ago • 21
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published Feb 26, 2025 • 47