🚀 Qwen-MTP Collection ⚡ MTP (Multi Token Prediction) speculative decoding enables models like Qwen3.6 to have ~1.4-2.2x faster generation with no change in accuracy. • 7 items • Updated 1 day ago • 25
💻 Qwopus-Coder Collection Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 7 items • Updated 1 day ago • 17
tvall43/Qwen3.5-14B-A3B-Claude-4.6-Opus-Reasoning-Distilled-reap-gguf Text Generation • 14B • Updated Mar 9 • 5.38k • 46
Jackrong/MLX-Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-6bit Text Generation • 9B • Updated Mar 7 • 401 • 8
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 95.9k • • 2.88k
Jackrong/MLX-Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-4bit Text Generation • 1B • Updated Mar 19 • 1.65k • 18
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper • 2407.09025 • Published Jul 12, 2024 • 140
Running on CPU Upgrade 248 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 248 Explore synthetic data benchmarks via an interactive bookshelf
mlx-community/Huihui-Qwen3.5-35B-A3B-abliterated-6bit Text Generation • 35B • Updated Mar 4 • 240 • 4