Yitong Li
Lytttttt
ยท
AI & ML interests
None yet
Recent Activity
new activity 10 days ago
xlangai/osworld_v2_tasks:Fix evaluators: tasks 077 (L0 false positive) + 079/087/096 (L2 robustness) updated a dataset about 1 month ago
xlangai/osworld2.0_human_crosscheck new activity about 1 month ago
xlangai/osworld2.0_human_crosscheck:Add task_004 eval.log (manual cross-check)