jian
lipliu
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
Self-Distilled Agentic Reinforcement Learning upvoted a paper 3 days ago
Flow-OPD: On-Policy Distillation for Flow Matching Models upvoted a paper 3 days ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable RewardsOrganizations
None yet