Beyond APIs: Probing the Limits of MLLMs in Physical Tool Use Paper • 2606.10803 • Published 4 days ago • 2
Beyond APIs: Probing the Limits of MLLMs in Physical Tool Use Paper • 2606.10803 • Published 4 days ago • 2
PhysTool-Bench Collection PhysTool-Bench is a benchmark that evaluates how well MLLMs perceive, select, and sequence PHYSICAL tools in real-world scenes. • 2 items • Updated 3 days ago • 1
PhysTool-Bench Collection PhysTool-Bench is a benchmark that evaluates how well MLLMs perceive, select, and sequence PHYSICAL tools in real-world scenes. • 2 items • Updated 3 days ago • 1
PhysTool-Bench Collection PhysTool-Bench is a benchmark that evaluates how well MLLMs perceive, select, and sequence PHYSICAL tools in real-world scenes. • 2 items • Updated 3 days ago • 1