EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity
about 7 hours ago
evaleval/EEE_datastore:[Submission] Terminal-Bench 2.0 leaderboard data (115 agent+model results) new activity
about 7 hours ago
evaleval/EEE_datastore:[Submission] Terminal-Bench 2.0 leaderboard data (115 agent+model results) new activity
16 days ago
evaleval/EEE_datastore:Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)