SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 7 days ago • 108
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models Paper • 2604.18224 • Published Apr 20 • 22
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 305