Submitted by
Junjie Ye
AI & ML interests
Natural Language Processing
Papers
CCTU: A Benchmark for Tool Use under Complex Constraints
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Natural Language Processing
CCTU: A Benchmark for Tool Use under Complex Constraints
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning