NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published 5 days ago • 11
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published 7 days ago • 8
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? Paper • 2602.07055 • Published 7 days ago • 20
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 3 days ago • 36
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 2 days ago • 22
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Paper • 2602.08990 • Published 2 days ago • 59
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published 16 days ago • 79
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Paper • 2601.11496 • Published 26 days ago • 47
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 28 days ago • 126
TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents Paper • 2601.05899 • Published Jan 9 • 4
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published about 1 month ago • 114
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published Dec 31, 2025 • 119
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 224
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published Jan 4 • 18
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published Jan 5 • 26