zhu's picture

zhu

haoran7

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

upvoted a paper 8 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

liked a Space 8 months ago

akhaliq/HunyuanImage-3.0

View all activity

Organizations

None yet

upvoted a paper 1 day ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 5 days ago • 90

upvoted a paper 8 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 11 days ago • 155

upvoted 7 papers 8 months ago

Eliciting Secret Knowledge from Language Models

Paper • 2510.01070 • Published Oct 1, 2025 • 6

Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation

Paper • 2509.21989 • Published Sep 26, 2025 • 23

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 42

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 83

Voxtral

Paper • 2507.13264 • Published Jul 17, 2025 • 34

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53

upvoted 7 papers about 1 year ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62

Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity

Paper • 2505.11107 • Published May 16, 2025 • 29

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published May 20, 2025 • 32

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 40

Unified Continuous Generative Models

Paper • 2505.07447 • Published May 12, 2025 • 42

Bielik 11B v2 Technical Report

Paper • 2505.02410 • Published May 5, 2025 • 54

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88