Ai2 Open Coding Agents - Django, Sphinx, Sympy Data
AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
View all activity
Papers
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs
Organization Card
spaces 13
pinned
Running
19
AstaBench Leaderboard
🥇
View benchmark leaderboards
pinned
Running
421
Reward Bench Leaderboard
📐
Explore RewardBench model rankings and scores
pinned
Running
2
HREF Leaderboard
📐
Browse and search HREF leaderboard data
pinned
Running
91
Zebra Logic Bench
🦓
Show leaderboard and explore model puzzle results
pinned
Running
3
SUPER Leaderboard
🤖
Display a static leaderboard from a JSON file
pinned
Running
53
ZeroEval Leaderboard
📊
Embed ZeroEval for evaluation
models 852
allenai/Flex-pes2o-2x7B-1T
Text Generation • 12B • Updated
• 168 • 2
allenai/Flex-news-2x7B-1T
Text Generation • 12B • Updated
• 163 • 2
allenai/Flex-creative-2x7B-1T
Text Generation • 12B • Updated
• 274 • 5
allenai/Flex-public-7B-1T
Text Generation • 7B • Updated
• 269 • 5
allenai/Flex-code-2x7B-1T
Text Generation • 12B • Updated
• 392 • 2
allenai/Flex-math-2x7B-1T
Text Generation • 12B • Updated
• 387 • 3
allenai/olmo-3-tokenizer-instruct-release
Updated
• 1
allenai/olmOCR-2-7B-1025-FP8
Image-Text-to-Text • 8B • Updated
• 220k • 203
allenai/Olmo-3-7B-RL-Zero-General
Text Generation • 528k • Updated
• 104 • 7
allenai/Olmo-3-7B-RL-Zero-IF
Text Generation • 528k • Updated
• 76 • 6
datasets 368
allenai/prescience
Viewer
• Updated
• 839k • 34 • 9
allenai/dolma3_pool
Preview
• Updated
• 124k • 32
allenai/dolma3_longmino_mix-100B-1125
Preview
• Updated
• 19.6k • 11
allenai/dolma3_dolmino_mix-100B-1125
Preview
• Updated
• 227k • 19
allenai/asta-summary-citation-counts
Viewer
• Updated
• 47M • 386 • 8
allenai/olmix
Preview
• Updated
• 272 • 38
allenai/Dolci-Instruct-DPO
Viewer
• Updated
• 260k • 2.68k • 7
allenai/olmOCR-bench
Benchmark
• Updated
• 2.12k • 102
allenai/Molmo2-MultiImageQA
Viewer
• Updated
• 44.7k • 163 • 2
allenai/molmospaces
Viewer
• Updated
• 772k • 408 • 39