GUANGZENG HAN's picture

GUANGZENG HAN

kwangju

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

upvoted an article about 1 month ago

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

upvoted a paper about 1 month ago

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

View all activity

Organizations

None yet

upvoted a paper 1 day ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 4 days ago • 89

upvoted an article about 1 month ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Weyaxi

•

Jan 2

• 22

upvoted a paper about 1 month ago

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Paper • 2604.05333 • Published Apr 7 • 22

updated a collection 2 months ago

agent-data

6 items • Updated Mar 17

updated a collection 4 months ago

omni

4 items • Updated Jan 26

updated a collection 6 months ago

agent-data

6 items • Updated Mar 17

upvoted a paper 8 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

updated a collection 8 months ago

audio-LLM

1 item • Updated Oct 10, 2025

upvoted 2 papers 9 months ago

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 151

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 105

authored a paper 9 months ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published Sep 2, 2025 • 15

upvoted a paper 9 months ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published Sep 2, 2025 • 15

commented a paper 9 months ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published Sep 2, 2025 • 15 •

upvoted a paper 9 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85