Yihan Bian's picture

Yihan Bian

ybian-umd

·

AI & ML interests

None yet

Recent Activity

authored a paper about 17 hours ago

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

authored a paper 8 months ago

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

authored a paper 8 months ago

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

View all activity

Organizations

authored a paper about 17 hours ago

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Paper • 2606.16700 • Published 13 days ago • 14

authored 3 papers 8 months ago

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7, 2025 • 15

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7, 2025 • 15

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7, 2025 • 15

upvoted a paper 8 months ago

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7, 2025 • 15

updated 5 models 8 months ago

JetLM/SDAR-30B-A3B-Sci

Text Generation • 31B • Updated Oct 21, 2025 • 6 • 1

JetLM/SDAR-30B-A3B-Chat

Text Generation • 31B • Updated Oct 21, 2025 • 120 • 2

JetLM/SDAR-8B-Chat

Text Generation • 8B • Updated Oct 21, 2025 • 2.02k • 4

JetLM/SDAR-4B-Chat

Text Generation • 4B • Updated Feb 13 • 46 • 2

JetLM/SDAR-1.7B-Chat

Text Generation • 2B • Updated Feb 13 • 3.9k • 7

upvoted 2 articles about 1 year ago

Article

The Annotated Diffusion Model

nielsr, kashif

•

Jun 7, 2022

• 362

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 295

New activity in ybian-umd/Qwen2.5-7B-Instruct-gsm8k-1 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-7B-Instruct-gsm8k-2 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-7B-Instruct-gsm8k-3 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-7B-Instruct-gsm8k-4 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-3B-Instruct-gsm8k-3 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-3B-Instruct-gsm8k-4 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

New activity in ybian-umd/Qwen2.5-3B-Instruct-gsm8k-6 about 1 year ago

Improve language tag

#1 opened about 1 year ago by

updated a model over 1 year ago

ybian-umd/gemma-2-2b-it-gsm8k-1

Updated Oct 31, 2024 • 1