Submitted by Chanyoung Kim 76 LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models Human-centered AI Laboratory 29 5