13 23 1

Jinyang Wu

Jinyang23

https://orcid.org/my-orcid?orcid=0009-0006-0220-616X

jinyangwu

AI & ML interests

large language models, reasoning, agentic rl

Recent Activity

upvoted a paper 4 days ago

HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

upvoted a paper 25 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

upvoted a paper about 1 month ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

View all activity

Organizations

None yet

upvoted a paper 4 days ago

HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

Paper • 2603.11975 • Published 8 days ago • 11

upvoted a paper 25 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted 8 papers about 1 month ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Paper • 2602.14492 • Published Feb 16 • 18

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 156

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published Feb 5 • 60

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published Jan 29 • 10

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published Feb 2 • 35

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published Feb 2 • 4

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 259

upvoted 2 papers about 2 months ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published Jan 30 • 12

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published Jan 28 • 23

upvoted 3 papers 2 months ago

MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published Jan 14 • 96

A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Paper • 2601.09274 • Published Jan 14 • 85

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published Jan 7 • 44

upvoted 2 papers 3 months ago

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Paper • 2512.14052 • Published Dec 16, 2025 • 42

From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

Paper • 2512.02580 • Published Dec 2, 2025 • 28

upvoted a paper 5 months ago

Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints

Paper • 2510.08549 • Published Oct 9, 2025 • 7

upvoted a paper 10 months ago

Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities

Paper • 2505.15692 • Published May 21, 2025 • 14

upvoted a paper about 1 year ago

DReSS: Data-driven Regularized Structured Streamlining for Large Language Models

Paper • 2501.17905 • Published Jan 29, 2025 • 2

Jinyang Wu

AI & ML interests

Recent Activity

Organizations

Jinyang23's activity