Kevin

kvnptl

·

kvnptl

AI & ML interests

Robot perception

Recent Activity

liked a Space about 1 month ago

lerobot/visualize_dataset

updated a dataset about 1 month ago

kvnptl/so101-teleop-vials-to-rack-real

published a dataset about 1 month ago

kvnptl/so101-teleop-vials-to-rack-real

View all activity

Organizations

None yet

liked a Space about 1 month ago

Visualize Dataset (v2.0+ latest dataset format)

Explore and visualize LeRobot datasets

updated a dataset about 1 month ago

kvnptl/so101-teleop-vials-to-rack-real

Viewer • Updated Jun 17 • 106k • 130

published a dataset about 1 month ago

kvnptl/so101-teleop-vials-to-rack-real

Viewer • Updated Jun 17 • 106k • 130

liked a model about 1 month ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated Jun 12 • 1.34M • 2.78k

updated a dataset about 2 months ago

kvnptl/so101-teleop-ballpen-to-rack-real

Viewer • Updated Jun 5 • 54.8k • 41

published a dataset about 2 months ago

kvnptl/so101-teleop-ballpen-to-rack-real

Viewer • Updated Jun 5 • 54.8k • 41

liked 2 models 4 months ago

nvidia/Cosmos-Predict2.5-2B

Updated 27 days ago • 26.2k • 148

nvidia/Cosmos-Guardrail1

Updated Apr 1, 2025 • 1.22k • 29

liked a model 5 months ago

nvidia/Cosmos-Embed1-448p

1B • Updated Mar 13 • 4.76k • 12

upvoted 3 articles 5 months ago

Article

Deploying Open Source Vision Language Models (VLM) on Jetson

nvidia

•

Feb 24

• 37

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

nvidia

•

Jan 5

• 64

Article

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

nvidia

•

Jan 29

• 49

liked a model 9 months ago

facebook/EdgeTAM

Updated Apr 30, 2025 • 4 • 31

liked a Space 9 months ago

HunyuanWorld-Mirror

Universal 3D World Reconstruction with Any Prior Prompting

liked a Space over 1 year ago

RF-DETR

SOTA real-time object detection model

upvoted an article over 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 426

reacted to maxiw's post with 👍 over 1 year ago

Post

3931

The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.

You can try it out with my space maxiw/Qwen2-VL-Detection

6 replies

·

upvoted an article over 1 year ago

Article

Welcome PaliGemma 2 – New vision language models by Google

+2

merve, andsteing, pcuenq, ariG23498

•

Dec 5, 2024

• 168

liked 2 datasets over 1 year ago

uoft-cs/cifar10

Viewer • Updated Jan 4, 2024 • 60k • 453k • 115

Francesco/animals-ij5d2

Viewer • Updated Mar 30, 2023 • 1k • 104 • 16