arxiv:2603.09079
Md Selim Sarowar
selim-sarowar
ยท
AI & ML interests
Vision Language Action Models, World Models, 5D Robot Manipulation, 3D Computer Vision
Recent Activity
authored
a paper
about 3 hours ago
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models upvoted a paper about 7 hours ago
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models upvoted a paper 1 day ago
Unified Vision-Language-Action Model Organizations
None yet