1 5 1

Zhenheng Yang

zhenheny

https://zhenheny.github.io/

zhenheny

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

upvoted a paper about 1 year ago

DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

authored a paper about 1 year ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Paper • 2606.11188 • Published 3 days ago • 23

upvoted a paper about 1 year ago

DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

Paper • 2505.11196 • Published May 16, 2025 • 14

authored a paper about 1 year ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11, 2025 • 130

upvoted 2 papers about 1 year ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 50

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11, 2025 • 130

authored a paper about 1 year ago

Long Context Tuning for Video Generation

Paper • 2503.10589 • Published Mar 13, 2025 • 14

authored 4 papers over 1 year ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6, 2025 • 56

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 53

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Paper • 2412.09283 • Published Dec 12, 2024 • 19

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 50

authored a paper almost 2 years ago

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2, 2024 • 55

upvoted a paper almost 2 years ago

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2, 2024 • 55

liked a Space almost 3 years ago

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots

Zhenheng Yang

AI & ML interests

Recent Activity

Organizations

zhenheny's activity

Open LLM Leaderboard