Pelly
Pellypp
AI & ML interests
None yet
Recent Activity
upvoted a paper about 6 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning upvoted a paper about 6 hours ago
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability upvoted a paper about 2 months ago
Meta-CoT: Enhancing Granularity and Generalization in Image EditingOrganizations
None yet