lester's picture

10

lester

rongll

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

upvoted a paper 4 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

upvoted a paper 9 days ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

Paper • 2606.08572 • Published 6 days ago • 14

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 6 days ago • 48

upvoted 3 papers 9 days ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Paper • 2606.02320 • Published 12 days ago • 14

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 12 days ago • 54

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Paper • 2606.01993 • Published 12 days ago • 14

upvoted 2 papers about 2 months ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published Apr 20 • 22

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published Apr 16 • 36

upvoted a paper 2 months ago

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published Apr 13 • 38

upvoted 2 papers 8 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 46

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28, 2025 • 68