Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests Paper • 2606.07379 • Published 9 days ago • 5
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness Paper • 2604.02986 • Published Apr 3 • 2
A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP Paper • 2505.16661 • Published May 22, 2025 • 1