arxiv:2505.20152
Jiajie Zhang
NeoZ123
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards submitted
a paper
about 2 months ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards published
a dataset about 2 months ago
THU-KEG/CaRR-DeepDive