SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published 3 days ago • 20
Exploring Autonomous Agentic Data Engineering for Model Specialization Paper • 2605.30407 • Published 6 days ago • 19
LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis Paper • 2605.30434 • Published 6 days ago • 17
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models Paper • 2605.30219 • Published 6 days ago • 22
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 6 days ago • 37
Rethinking Memory as Continuously Evolving Connectivity Paper • 2605.28773 • Published 7 days ago • 32
MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems Paper • 2605.28732 • Published 7 days ago • 39
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research Paper • 2605.22878 • Published 14 days ago • 58
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published Apr 25 • 15
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published Apr 27 • 22
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 22
OceanPile Collection A Large-Scale Multimodal Ocean Corpus for Foundation Models • 8 items • Updated 15 days ago • 2
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published Apr 6 • 35
How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities Paper • 2603.02578 • Published Mar 3 • 25