Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published 6 days ago • 10
Meta-Reinforcement Learning with Self-Reflection for Agentic Search Paper • 2603.11327 • Published 7 days ago • 8
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks Paper • 2603.11487 • Published 6 days ago • 2