DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 13 days ago • 204
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy Paper • 2605.10344 • Published 22 days ago • 49
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published Apr 29 • 50
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing Paper • 2512.23611 • Published Dec 29, 2025 • 7
Context as a Tool: Context Management for Long-Horizon SWE-Agents Paper • 2512.22087 • Published Dec 26, 2025 • 4
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published Dec 15, 2025 • 17
A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression Paper • 2604.19572 • Published Apr 21 • 23
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published Apr 3 • 233 • 3
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published Apr 3 • 233
Multilingual-Multimodal-NLP/IndustrialCoder-Thinking Text Generation • 32B • Updated Mar 26 • 179 • 4