Yuanfang Peng
yfpeng1234
AI & ML interests
None yet
Recent Activity
published a model about 2 months ago
yfpeng1234/RLinf_stack-cube_sft upvoted a paper 4 months ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models updated a dataset about 1 year ago
yfpeng1234/liberoOrganizations
None yet