Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 4 days ago • 13
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 13 days ago • 314
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 15 days ago • 39
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 13 days ago • 52