cuikq's picture

cuikq

cuikq

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Agents' Last Exam

upvoted a paper 9 days ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

new activity 14 days ago

xlangai/osworld_v2_tasks:Update task_048.py

View all activity

Organizations

upvoted a paper 4 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 18 days ago • 355

upvoted a paper 9 days ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 10 days ago • 80

upvoted 2 papers 23 days ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published 27 days ago • 34

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published about 1 month ago • 241

upvoted 6 papers about 1 month ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 114

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

Paper • 2605.03596 • Published May 5 • 11

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 53

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 84

upvoted 7 papers about 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 281

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 231

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 178

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 366

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 508

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 634

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 328

upvoted 3 papers 2 months ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published Apr 13 • 143

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 295