chawdoe
chawdoe
AI & ML interests
None yet
Recent Activity
liked
a model 21 days ago
stepfun-ai/Step-3.5-Flash upvoted a paper 4 months ago
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning
for LLMs upvoted a paper 5 months ago
Random Policy Valuation is Enough for LLM Reasoning with Verifiable
Rewards