RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation β’ 8B β’ Updated 8 days ago β’ 25.2k β’ 9
view article Article Building Tensors from Scratch in Rust (Part 1.2): View Operations Jun 18, 2025 β’ 4
Running 593 Scaling test-time compute π 593 Run advanced search strategies to boost LLM problem solving
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation β’ 8B β’ Updated May 29, 2025 β’ 146k β’ β’ 1.04k
Search-R1 Collection Preliminary checkpoints with outcome-only RL. β’ 15 items β’ Updated Aug 12, 2025 β’ 17
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification β’ 8B β’ Updated Oct 25, 2024 β’ 29k β’ 39
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16, 2025 β’ 169
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 580k β’ β’ 2.67k