Liya PRO

juliazzzvvv

2 5 1

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

juliazzzvvv/open

upvoted a paper 3 days ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

authored a paper 5 days ago

EdgeBench: Unveiling Scaling Laws of Learning from Real-World Environments

View all activity

Organizations

updated a dataset 3 days ago

juliazzzvvv/open

Updated 3 days ago • 125

upvoted a paper 3 days ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published Jun 9 • 22

authored 3 papers 5 days ago

EdgeBench: Unveiling Scaling Laws of Learning from Real-World Environments

Paper • 2607.05155 • Published 21 days ago • 18

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published Jun 9 • 22

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 278

published a dataset 8 days ago

juliazzzvvv/open

Updated 3 days ago • 125

upvoted a collection 20 days ago

M-A-P Full Paper List

Collection

27 items • Updated Dec 16, 2025 • 15

commented a paper about 2 months ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published Jun 9 • 22 •

upvoted a paper 3 months ago

CocoaBench: Evaluating Unified Digital Agents in the Wild

Paper • 2604.11201 • Published Apr 13 • 37

upvoted a paper 6 months ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 52

liked a dataset 7 months ago

m-a-p/LPFQA

Viewer • Updated Nov 10, 2025 • 502 • 119 • 6

updated a dataset 9 months ago

m-a-p/LPFQA

Viewer • Updated Nov 10, 2025 • 502 • 119 • 6

New activity in m-a-p/LPFQA 9 months ago

Update README.md

#2 opened 9 months ago by

JingzheDing

published a dataset 9 months ago

m-a-p/LPFQA

Viewer • Updated Nov 10, 2025 • 502 • 119 • 6

upvoted a paper 11 months ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4, 2025 • 58

Liya PRO

AI & ML interests

Recent Activity

Organizations

juliazzzvvv's activity

Update README.md