InternScience/ResearchClawBench
Benchmark • Updated • 57 • 3.67k • 8
None defined yet.
Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research