SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces Paper • 2606.01317 • Published 17 days ago
SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces Paper • 2606.01317 • Published 17 days ago