Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Weilin Zhao's picture
4 7 21

Weilin Zhao

Achazwl
Lynncc6's profile picture BryantMcGill's profile picture shuyuej's profile picture
·
https://weilin-zhao.com
  • acha_William_
  • Achazwl

AI & ML interests

Efficient LLM

Organizations

OpenBMB's profile picture

upvoted a paper 4 months ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 74
upvoted a paper 8 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 18
upvoted a collection 11 months ago

FR-Spec

Collection
Released ckpt for arxiv.org/abs/2502.14856 • 6 items • Updated Jul 2, 2025 • 1
upvoted a collection 12 months ago

MiniCPM4

Collection
MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 6 days ago • 85
upvoted a paper 12 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 98
upvoted 2 papers about 1 year ago

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published Feb 17, 2025 • 4

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20, 2025 • 8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs