kennyKK's picture

3

kennyKK

kennykkk25

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

upvoted an article 4 months ago

Forge: Scalable Agent RL Framework and Algorithm

upvoted a paper over 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

View all activity

Organizations

No public activity