arxiv:2505.20152
Jiajie Zhang
NeoZ123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
submitted
a paper
about 4 hours ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
published
a dataset
1 day ago
THU-KEG/CaRR-DeepDive