colabfit/Halide_Perovskite_Ion_Migration_MLFF_neutral_iodide_interstitial Viewer • Updated 8 days ago • 2.01k • 22 • 1
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 23 days ago • 204
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published May 7 • 45
Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models Paper • 2605.09681 • Published May 10 • 10
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 506
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 632