Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
moonshotai/Kimi-Linear-48B-A3B-Instruct Text Generation • 49B • Updated Dec 16, 2025 • 42.3k • 549
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 454
view article Article A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons Feb 4, 2025 • 31
Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 252