2 6 2

Yunheng Li

lyhisme

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

submitted a paper 1 day ago

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

updated a dataset 1 day ago

AudioVisual-Caption/ASID-1M

View all activity

Organizations

upvoted a paper about 24 hours ago

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

Paper • 2602.13013 • Published 4 days ago • 7

submitted a paper to Daily Papers 1 day ago

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

Paper • 2602.13013 • Published 4 days ago • 7

updated a dataset 1 day ago

AudioVisual-Caption/ASID-1M

Viewer • Updated 1 day ago • 241k • 67 • 3

updated 2 models 1 day ago

AudioVisual-Caption/ASID-Captioner-7B

Image-Text-to-Text • 9B • Updated 1 day ago • 13 • 1

AudioVisual-Caption/ASID-Captioner-3B

Image-Text-to-Text • 5B • Updated 1 day ago • 17 • 1

published 2 models 3 days ago

AudioVisual-Caption/ASID-Captioner-3B

Image-Text-to-Text • 5B • Updated 1 day ago • 17 • 1

AudioVisual-Caption/ASID-Captioner-7B

Image-Text-to-Text • 9B • Updated 1 day ago • 13 • 1

updated a Space 6 days ago

ASID-Caption

🦉

published a Space 6 days ago

ASID-Caption

🦉

liked a dataset 6 days ago

AudioVisual-Caption/ASID-1M

Viewer • Updated 1 day ago • 241k • 67 • 3

published a dataset 7 days ago

AudioVisual-Caption/ASID-1M

Viewer • Updated 1 day ago • 241k • 67 • 3

upvoted a paper 4 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

authored a paper 5 months ago

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

Paper • 2509.18056 • Published Sep 22, 2025 • 27

upvoted a paper 5 months ago

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

Paper • 2509.18056 • Published Sep 22, 2025 • 27

commented a paper 5 months ago

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

Paper • 2509.18056 • Published Sep 22, 2025 • 27 •

authored 4 papers 5 months ago

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Paper • 2406.00670 • Published Jun 2, 2024

Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction

Paper • 2412.06244 • Published Dec 9, 2024

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3, 2025 • 14

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

Paper • 2508.08811 • Published Aug 12, 2025 • 2

upvoted a paper 5 months ago

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3, 2025 • 14

Yunheng Li

AI & ML interests

Recent Activity

Organizations

lyhisme's activity

ASID-Caption

ASID-Caption