Takashi Ishida's picture

3

Takashi Ishida

tksii

·

https://takashiishida.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

authored a paper 3 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

authored a paper 3 days ago

LLM Routing with Dueling Feedback

View all activity

Organizations

upvoted a paper 2 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

Paper • 2604.02986 • Published Apr 3 • 2

authored 3 papers 3 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

Paper • 2604.02986 • Published Apr 3 • 2

LLM Routing with Dueling Feedback

Paper • 2510.00841 • Published Oct 1, 2025

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 9 days ago • 5

upvoted 2 papers 4 days ago

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Paper • 2505.18102 • Published May 23, 2025 • 2

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 9 days ago • 5

updated a dataset 15 days ago

ishidalab/capbencher

Viewer • Updated 15 days ago • 15.5k • 101 • 2

authored 2 papers 4 months ago

EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements

Paper • 2506.08762 • Published Jun 10, 2025

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Paper • 2505.18102 • Published May 23, 2025 • 2

published a dataset 4 months ago

ishidalab/capbencher

Viewer • Updated 15 days ago • 15.5k • 101 • 2

updated a dataset 4 months ago

ishidalab/capbencher

Viewer • Updated 15 days ago • 15.5k • 101 • 2