10 2 3

Yansong Shi

nanamma

https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

new activity about 15 hours ago

nanamma/RIVER:Add task categories and link to paper

updated a dataset about 15 hours ago

OpenGVLab/RIVER

published a dataset 1 day ago

OpenGVLab/RIVER

View all activity

Organizations

New activity in nanamma/RIVER about 15 hours ago

Add task categories and link to paper

#2 opened about 15 hours ago by

nielsr

updated a dataset about 15 hours ago

OpenGVLab/RIVER

Updated about 15 hours ago • 12

published a dataset 1 day ago

OpenGVLab/RIVER

Updated about 15 hours ago • 12

authored a paper 1 day ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published 2 days ago • 4

submitted a paper to Daily Papers 1 day ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published 2 days ago • 4

upvoted a paper 1 day ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published 2 days ago • 4

updated a dataset 1 day ago

nanamma/RIVER

Updated about 15 hours ago • 34

upvoted a paper 3 months ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published Dec 1, 2025 • 18

authored 2 papers 5 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 29

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1

New activity in qiukingballball/RoboCerebra 7 months ago

how to test

#4 opened 7 months ago by

nanamma

published a dataset 10 months ago

nanamma/RIVER

Updated about 15 hours ago • 34

liked a dataset about 1 year ago

Mutonix/Vript

Viewer • Updated Jun 11, 2024 • 409k • 3.98k • 24

New activity in Enxin/MovieChat-1K_train over 1 year ago

so many quote '"' in captions in json files

#2 opened over 1 year ago by

nanamma

updated a collection over 1 year ago

VideoChat

Collection

Chat-Centric Video Understanding • 8 items • Updated Sep 28, 2025 • 3

updated 2 models over 1 year ago

OpenGVLab/ViCLIP-L-14-hf

0.4B • Updated Sep 17, 2024 • 5.03k • 1

OpenGVLab/ViCLIP-B-16-hf

0.1B • Updated Sep 17, 2024 • 133 • 1

updated a collection over 1 year ago

InternVid

Collection

A Large-Scale Video-Text Dataset • 7 items • Updated Sep 28, 2025

Yansong Shi

AI & ML interests

Recent Activity

Organizations

nanamma's activity

Add task categories and link to paper

how to test

so many quote '"' in captions in json files