Yuchen Cheng

rudeigerc

https://rudeigerc.dev

AI & ML interests

Kubernetes / LLMOps

Recent Activity

liked a model 3 days ago

zai-org/GLM-5

liked a model 3 months ago

deepseek-ai/DeepSeek-V3.2

upvoted a paper 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

View all activity

Organizations

None yet

liked a model 3 days ago

zai-org/GLM-5

Text Generation • 754B • Updated 1 day ago • 66.8k • • 1.15k

liked a model 3 months ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 329k • • 1.24k

upvoted a paper 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 123

liked a Space 4 months ago

The Smol Training Playbook

📚

2.98k

The secrets to building world-class LLMs

liked 2 models 4 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • Updated Nov 4, 2025 • 2.99M • 3.15k

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 450k • • 1.48k

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • Updated Nov 18, 2025 • 31.4k • • 952

liked 8 models 6 months ago

liked a model 7 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • Updated 16 days ago • 262k • • 2.32k

upvoted a paper 8 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

liked 2 models 8 months ago

MiniMaxAI/MiniMax-M1-80k

Text Generation • Updated Jul 7, 2025 • 24.7k • • 689

mistralai/Magistral-Small-2506

24B • Updated Jul 28, 2025 • 28.9k • 609

upvoted a paper 8 months ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 45

Yuchen Cheng

AI & ML interests

Recent Activity

Organizations

rudeigerc's activity

The Smol Training Playbook