ReFreeKV: Towards Threshold-Free KV Cache Compression Paper • 2502.16886 • Published 8 days ago • 47
BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding Paper • 2606.31315 • Published 4 days ago • 71
Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs Paper • 2606.32032 • Published 4 days ago • 21
AsyncOPD: How Stale Can On-Policy Distillation Be? Paper • 2606.24143 • Published 11 days ago • 29
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training Paper • 2606.30406 • Published 5 days ago • 11
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting Paper • 2606.18394 • Published 9 days ago • 35
Information-Aware KV Cache Compression for Long Reasoning Paper • 2606.26875 • Published 9 days ago • 11
The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar Paper • 2606.26015 • Published 10 days ago • 10
Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs Paper • 2606.27378 • Published May 7 • 58
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 5
Demystifying Training-Time Augmentation for Data-Constrained Language Model Pretraining Paper • 2606.16246 • Published 15 days ago • 4
Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation Paper • 2606.18844 • Published 17 days ago • 18
Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models Paper • 2606.16700 • Published 19 days ago • 14
RepSelect: Robust LLM Unlearning via Representation Selectivity Paper • 2606.17168 • Published 19 days ago • 5