Zhengyang Tang

tangzhy

9 26 9

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

PhoneBuddyAI/PhoneBuddy-4B

commentedon a paper 11 days ago

Training Open Models for Agentic Phone Use

authored a paper 12 days ago

Training Open Models for Agentic Phone Use

View all activity

Organizations

liked a model 9 days ago

PhoneBuddyAI/PhoneBuddy-4B

Image-Text-to-Text • 5B • Updated 20 days ago • 79 • 7

commented a paper 11 days ago

Training Open Models for Agentic Phone Use

Paper • 2606.23049 • Published 13 days ago • 16 •

authored a paper 12 days ago

Training Open Models for Agentic Phone Use

Paper • 2606.23049 • Published 13 days ago • 16

upvoted a paper 12 days ago

Training Open Models for Agentic Phone Use

Paper • 2606.23049 • Published 13 days ago • 16

authored a paper 18 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 19 days ago • 58

upvoted a paper 18 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 19 days ago • 58

authored a paper 19 days ago

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Paper • 2606.14832 • Published 23 days ago • 12

upvoted a paper 19 days ago

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Paper • 2606.14832 • Published 23 days ago • 12

upvoted a paper about 1 month ago

PhoneWorld: Scaling Phone-Use Agent Environments

Paper • 2605.29486 • Published May 28 • 11

authored a paper about 1 month ago

PhoneWorld: Scaling Phone-Use Agent Environments

Paper • 2605.29486 • Published May 28 • 11

submitted a paper to Daily Papers about 1 month ago

PhoneWorld: Scaling Phone-Use Agent Environments

Paper • 2605.29486 • Published May 28 • 11

authored a paper about 2 months ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

Paper • 2605.07630 • Published May 8 • 1

submitted a paper to Daily Papers about 2 months ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

Paper • 2605.07630 • Published May 8 • 1

authored a paper 2 months ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

upvoted a paper 2 months ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

authored a paper 3 months ago

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published Apr 17 • 23

upvoted 2 papers 3 months ago

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published Apr 17 • 23

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 69

authored a paper 3 months ago

Do Phone-Use Agents Respect Your Privacy?

Paper • 2604.00986 • Published Apr 1 • 9

upvoted a paper 3 months ago

Do Phone-Use Agents Respect Your Privacy?

Paper • 2604.00986 • Published Apr 1 • 9

Zhengyang Tang

AI & ML interests

Recent Activity

Organizations

tangzhy's activity