1 10 2

Guan Yiran

Catalan258

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

upvoted a paper 15 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

published a dataset 18 days ago

Catalan258/thinkomni_eval

View all activity

Organizations

None yet

upvoted a paper 11 days ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 45

upvoted a paper 15 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 20 days ago • 107

published a dataset 18 days ago

Catalan258/thinkomni_eval

Viewer • Updated 18 days ago • 1k • 1.1k

updated a dataset 18 days ago

Catalan258/thinkomni_eval

Viewer • Updated 18 days ago • 1k • 1.1k

upvoted a paper 18 days ago

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published 18 days ago • 94

upvoted 2 papers 20 days ago

Towards Generalizable Robotic Manipulation in Dynamic Environments

Paper • 2603.15620 • Published 21 days ago • 3

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 21 days ago • 79

liked a dataset 21 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 23 days ago • 1.62M • 56k • 298

submitted a paper to Daily Papers 22 days ago

Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Paper • 2603.12262 • Published 25 days ago • 30

authored a paper 24 days ago

Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Paper • 2603.12262 • Published 25 days ago • 30

upvoted a paper 24 days ago

Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Paper • 2603.12262 • Published 25 days ago • 30

published a model 26 days ago

Catalan258/VST-7B

8B • Updated 26 days ago • 27

updated a model 26 days ago

Catalan258/VST-7B

8B • Updated 26 days ago • 27

upvoted a paper about 2 months ago

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Paper • 2602.09849 • Published Feb 10 • 16

upvoted 2 papers 5 months ago

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Paper • 2511.13026 • Published Nov 17, 2025 • 26

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Paper • 2510.27266 • Published Oct 31, 2025 • 21

liked a dataset 5 months ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated Dec 18, 2025 • 4.58M • 3.89k • 87

upvoted a paper 7 months ago

Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

Paper • 2508.05612 • Published Aug 7, 2025 • 2

Guan Yiran

AI & ML interests

Recent Activity

Organizations

Catalan258's activity