Submitted by akhaliq 451 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning DeepSeek 92k 10
Submitted by akhaliq 91 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding · 15 authors 1.16k 6
Submitted by akhaliq 74 FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces · 10 authors 1.24k 3
Submitted by yaful 61 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback · 5 authors 183 2
Submitted by akhaliq 28 O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning · 9 authors 99 2
Submitted by RicardoL1u 19 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament · 6 authors 15 3
Submitted by jedyang97 17 Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass · 9 authors 1.57k 5
Submitted by Eladlev 13 IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems Plurai 1.23k 2