Xuanfei Ren
xuanfeiren
AI & ML interests
RL and LLM
Recent Activity
upvoted a paper 14 minutes ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States authored a paper 5 days ago
POLCA: Stochastic Generative Optimization with LLM upvoted a paper 6 days ago
Needle In A Multimodal HaystackOrganizations
None yet