Xiinye Wang's picture

4 1

Xiinye Wang

xinyewang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper about 1 month ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

upvoted a paper about 2 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 11 days ago • 109

upvoted a paper about 1 month ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

Paper • 2604.14164 • Published Mar 23 • 35

upvoted a paper about 2 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 101

upvoted a paper 4 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

liked a model about 1 year ago

meta-llama/Llama-3.1-8B

Text Generation • 8B • Updated Oct 16, 2024 • 1.25M • • 2.21k