view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models 11 days ago โข 13
view article Article ๐๏ธ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do 30 days ago โข 38