Submitted by
Yansong Shi
AI & ML interests
Computer Vision
Recent Activity
View all activity
Papers
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision