SWE-bench

community

https://swe-bench.com

AI & ML interests

None defined yet.

Recent Activity

john-b-yang authored a paper 28 days ago

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

john-b-yang authored a paper 28 days ago

OpenThoughts: Data Recipes for Reasoning Models

john-b-yang authored a paper 28 days ago

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

View all activity

Papers

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

View all Papers

SWE-bench 's models 3

SWE-bench/SWE-agent-LM-7B

Text Generation • 8B • Updated Jul 13, 2025 • 29 • • 6

SWE-bench/SWE-Rater-32B

33B • Updated Jun 1, 2025 • 5 • 3

SWE-bench/SWE-agent-LM-32B

Text Generation • 33B • Updated May 12, 2025 • 245 • • 82