zhjiebin's picture

3 3

zhjiebin

zihao123

·

AI & ML interests

nlp

Recent Activity

liked a model 10 days ago

AngelSlim/Qwen3-8b-dflare

upvoted a paper 11 days ago

DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding

liked a model 12 months ago

Qwen/Qwen3-30B-A3B

View all activity

Organizations

None yet

liked a model 10 days ago

AngelSlim/Qwen3-8b-dflare

Text Generation • 1B • Updated 15 days ago • 52 • 1

upvoted a paper 11 days ago

DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding

Paper • 2606.02091 • Published 17 days ago • 1

liked a model 12 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26, 2025 • 2.11M • 900

upvoted a paper about 1 year ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20, 2025 • 49

upvoted a paper over 1 year ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4, 2025 • 29

liked a Space over 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters