1 17 22

Yang Lin

Yang18

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 1 day ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

upvoted a paper 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 2 days ago • 42

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 9 days ago • 189

upvoted a paper 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 13 • 83

New activity in Tongyi-MAI/Z-Image-Turbo 7 months ago

about the model size compared to flux

#12 opened 7 months ago by

Yang18

liked a model 7 months ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated Jan 30 • 823k • • 4.83k

liked a Space 9 months ago

HunyuanImage-3.0

📊

182

Generate images from text prompts (PRO users only)

liked a Space 11 months ago

3DGen Leaderboard

😻

Display 3D model evaluation leaderboard

upvoted a paper 11 months ago

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Paper • 2508.05609 • Published Aug 7, 2025 • 29

liked a Space 11 months ago

Qwen Image

💻

QwenImage

liked 2 Spaces about 1 year ago

Qwen3 Demo

📊

857

Chat with an AI assistant that thinks before answering

LBM Relighting

✨

421

Fast image relighting using Latent Bridge Matching

liked 2 models about 1 year ago

city96/FLUX.1-dev-gguf

Text-to-Image • 12B • Updated Aug 18, 2024 • 132k • 1.36k

mikeyandfriends/PixelWave_FLUX.1-dev_03

Text-to-Image • 12B • Updated Nov 5, 2024 • 895 • 194

upvoted 4 papers over 1 year ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9, 2025 • 44

liked a model over 1 year ago

tencent/HunyuanVideo

Text-to-Video • Updated Mar 6, 2025 • 1k • • 2.2k

upvoted 2 papers over 1 year ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6, 2025 • 43

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 97