When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
unsloth/Qwen3-Coder-Next-GGUF Text Generation • 80B • Updated about 18 hours ago • 502k • 397
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 Text Generation • 18B • Updated 3 days ago • 259k • 101
mlx-community/Jan-v3-4B-base-instruct-4bit Text Generation • 0.6B • Updated 28 days ago • 389 • 2
mlx-community/Jan-v3-4B-base-instruct-8bit Text Generation • 1B • Updated 28 days ago • 194 • 3
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 23
meituan-longcat/LongCat-Flash-Thinking-2601 Text Generation • 562B • Updated Jan 23 • 4.37k • 102
unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF Text Generation • 80B • Updated Jan 14 • 44.3k • 165
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 181
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 120
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 43