OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

qishisuren submitted a paper 6 days ago

Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO

Eurayka authored a paper 10 days ago

InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning

linghan199 authored a paper 10 days ago

InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning

View all activity

Papers

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

RIVER: A Real-Time Interaction Benchmark for Video LLMs

View all Papers

OpenGVLab 's papers 9

Submitted by

Tianxiang Jiang

Imagine Before You Predict: Interleaved Latent Visual Reasoning for Video Event Prediction

OpenGVLab

Submitted by

Yansong Shi

RIVER: A Real-Time Interaction Benchmark for Video LLMs

OpenGVLab

Submitted by

yinanhe

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

OpenGVLab

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

OpenGVLab

Submitted by

Long Cui

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution

OpenGVLab

Submitted by

Yicheng Xu

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

OpenGVLab

Submitted by

Changyao Tian

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

OpenGVLab

Submitted by

Songze Li

Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

OpenGVLab

Submitted by

Cao Yue

Sequential Diffusion Language Models

OpenGVLab