Li Pengyi
LiPengyi29
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
How Far Can Unsupervised RLVR Scale LLM Training? upvoted a paper about 1 month ago
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities submitted
a paper
about 1 month ago
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities