JackShu's picture

JackShu

Shuhuhuhu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Vero: An Open RL Recipe for General Visual Reasoning

upvoted a paper 4 days ago

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

commentedon a paper 5 months ago

SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning

View all activity

Organizations

authored a paper 7 months ago

SAIL-VL2 Technical Report

Paper • 2509.14033 • Published Sep 17, 2025 • 44

authored 4 papers over 1 year ago

Compress & Align: Curating Image-Text Data with Human Knowledge

Paper • 2312.06726 • Published Dec 11, 2023 • 1

Audio-Visual LLM for Video Understanding

Paper • 2312.06720 • Published Dec 11, 2023

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

Paper • 2403.13447 • Published Mar 20, 2024 • 19

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Paper • 2408.15881 • Published Aug 28, 2024 • 21