5 4

zhang

Ahitrider

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

liked a Space about 2 months ago

nanotron/ultrascale-playbook

upvoted a paper 3 months ago

MemEvolve: Meta-Evolution of Agent Memory Systems

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published 7 days ago • 8

liked a Space about 2 months ago

The Ultra-Scale Playbook

🌌

3.76k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 3 months ago

MemEvolve: Meta-Evolution of Agent Memory Systems

Paper • 2512.18746 • Published Dec 21, 2025 • 31

upvoted a paper 5 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

liked a Space 5 months ago

The Smol Training Playbook

📚

3.08k

The secrets to building world-class LLMs

upvoted an article 8 months ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

upvoted a paper 12 months ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3, 2025 • 58

liked 2 models almost 3 years ago

Langboat/mengzi-bert-base-fin

Fill-Mask • 0.1B • Updated May 8, 2023 • 376 • 14

google-bert/bert-base-chinese

Fill-Mask • Updated Jul 3, 2025 • 3.43M • • 1.41k