view article Article VLX-Go: Vision-Language Short-Horizon Waypoint Prediction for Embodied Navigation omlab β’ 2 days ago β’ 10
view article Article VLX-Go: Vision-Language Short-Horizon Waypoint Prediction for Embodied Navigation omlab β’ 2 days ago β’ 10
view article Article VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation omlab β’ 3 days ago β’ 12
view article Article VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation omlab β’ 3 days ago β’ 12
view article Article VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction omlab β’ 4 days ago β’ 12
Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models Paper β’ 2605.28132 β’ Published May 27 β’ 25
Configuration error Agents Featured 41 SAM3 VLM-FO1 π 41 Complex text label dection using SAM3 with VLM-FO1
Configuration error Agents Featured 41 SAM3 VLM-FO1 π 41 Complex text label dection using SAM3 with VLM-FO1
Configuration error Agents Featured 41 SAM3 VLM-FO1 π 41 Complex text label dection using SAM3 with VLM-FO1
view article Article ImprovingΒ ObjectΒ DetectionΒ throughΒ ReinforcementΒ LearningΒ withΒ VLM-R1 omlab β’ Mar 25, 2025 β’ 3