arxiv:2510.04550
Pengfei He
bigboss24
AI & ML interests
Trustworthy
Recent Activity
authored
a paper
about 22 hours ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use
upvoted
a
paper
1 day ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use
upvoted
a
paper
1 day ago
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents
Organizations
None yet