Starstrek's picture

1707 1411

Starstrek

Stars321123

·

Stars321

AI & ML interests

AI

Recent Activity

upvoted a paper about 12 hours ago

Reinforcement World Model Learning for LLM-based Agents

upvoted a paper about 12 hours ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

upvoted a paper about 13 hours ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

View all activity

Organizations

Stars321123 's collections 1