view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 18 days ago • 108
view article Article The Open Source Community is backing OpenEnv for Agentic RL +15 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 8 days ago • 80
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI Paper • 2511.07885 • Published Nov 11, 2025 • 16
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 12 days ago • 56
view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • 14 days ago • 31
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 43 items • Updated 24 days ago • 46
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published Feb 13 • 36
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • 22 days ago • 108
view article Article Personal Copilot: Train Your Own Coding Assistant smangrul, sayakpaul • Oct 27, 2023 • 79
🍎 Qwopus3.6 Collection This collection features the advanced Qwopus3.6 series of multimodal large models, which are fine-tuned from the Qwen3.6 base models with a focus on e • 10 items • Updated 23 days ago • 65
Qwopus3.5-v3.5/v3 Collection 🌟Qwopus3.5-v3.5 is the latest model in the Claude series. • 14 items • Updated 24 days ago • 106
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction Paper • 2508.03613 • Published Aug 5, 2025 • 16
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 burtenshaw, evalstate, merve, pcuenq • Jan 28 • 157
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published Dec 27, 2025 • 51