SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 26 days ago • 60
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 3 days ago • 123
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression Paper • 2510.13999 • Published Oct 15, 2025 • 14
Ming-V2 Collection Ming is the multi-modal series of any-to-any models developed by Ant Ling team. • 14 items • Updated 14 days ago • 35
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 547
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 268
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 188
Democratizing AI scientists using ToolUniverse Paper • 2509.23426 • Published Sep 27, 2025 • 40
Bark Collection Bark is a transformer-based text-to-audio model created by Suno. Currently, two checkpoints are supported: a small and a large version. • 3 items • Updated Sep 14, 2023 • 20