yuchang's picture

2 4 19

yuchang

hiyuchang

·

AI & ML interests

None yet

Recent Activity

commented on a paper 8 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

authored a paper 10 days ago

Exploring Selective Layer Fine-Tuning in Federated Learning

authored a paper 10 days ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

View all activity

Organizations

authored 7 papers 10 days ago

Exploring Selective Layer Fine-Tuning in Federated Learning

Paper • 2408.15600 • Published Aug 28, 2024

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23, 2025 • 10

Enhancing Latent Computation in Transformers with Latent Tokens

Paper • 2505.12629 • Published May 19, 2025

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Paper • 2509.24203 • Published Sep 29, 2025 • 8

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Paper • 2601.03715 • Published Jan 7 • 2

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published 16 days ago • 53