Takashi Ishida
tksii
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness authored a paper 2 days ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness authored a paper 2 days ago
LLM Routing with Dueling Feedback