haipengluo's picture

haipengluo

haipeng1

·

AI & ML interests

None yet

Recent Activity

commentedon a paper 2 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

commentedon a paper 8 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

commentedon a paper 8 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

View all activity

Organizations

Papers 5

arxiv:2606.19236

arxiv:2512.20745

arxiv:2505.15431

arxiv:2407.10627

models 1

haipeng1/hp_intern2

Updated Jun 30, 2024

datasets 0

None public yet