Dhruv's picture

Dhruv

prieuredesion

AI & ML interests

None yet

Organizations

None yet

upvoted an article 12 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene

•

Jun 3, 2025

• 348

upvoted a collection over 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Mar 2 • 90

upvoted 2 papers almost 3 years ago

3D-LLM: Injecting the 3D World into Large Language Models

Paper • 2307.12981 • Published Jul 24, 2023 • 40

ViNT: A Foundation Model for Visual Navigation

Paper • 2306.14846 • Published Jun 26, 2023 • 7