Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JackShu's picture
1 4 3

JackShu

Shuhuhuhu
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Vero: An Open RL Recipe for General Visual Reasoning
upvoted a paper 4 days ago
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning
commentedon a paper 5 months ago
SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning
View all activity

Organizations

BytedanceDouyinContent's profile picture

authored a paper 7 months ago

SAIL-VL2 Technical Report

Paper • 2509.14033 • Published Sep 17, 2025 • 44
authored 4 papers over 1 year ago

Compress & Align: Curating Image-Text Data with Human Knowledge

Paper • 2312.06726 • Published Dec 11, 2023 • 1

Audio-Visual LLM for Video Understanding

Paper • 2312.06720 • Published Dec 11, 2023

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

Paper • 2403.13447 • Published Mar 20, 2024 • 19

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Paper • 2408.15881 • Published Aug 28, 2024 • 21
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs