kennyKK
kennykkk25
AI & ML interests
None yet
Recent Activity
upvoted a paper 18 days ago
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning upvoted an article 4 months ago
Forge: Scalable Agent RL Framework and Algorithm upvoted a paper over 1 year ago
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time
Scaling