view article Article Prompt Phrasing Shifts Model Performance More Than Tier — and Which LLMs Will Get Your Jokes Wayfinder6 • 3 days ago • 2
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents Paper • 2606.06036 • Published 26 days ago • 75
FastContext: Training Efficient Repository Explorer for Coding Agents Paper • 2606.14066 • Published 18 days ago • 93
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 20 days ago • 121
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 19 days ago • 142
Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 20 days ago • 60
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 14 days ago • 76
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 24 days ago • 81
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 20 days ago • 89
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 15 days ago • 119
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 25 days ago • 99
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research Paper • 2606.09730 • Published 22 days ago • 54
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts Paper • 2606.05922 • Published 25 days ago • 69
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 26 days ago • 66
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 28 days ago • 67