arxiv:2606.07379
Thanawat Lodkaew
skydddoogg
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness updated a dataset 1 day ago
ishidalab/capcode authored a paper 13 days ago
Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests