From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 48
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models Paper • 2604.21106 • Published Apr 27 • 9