Measuring Maximum Activations in Open Large Language Models Paper • 2605.15572 • Published 5 days ago • 16
Measuring Maximum Activations in Open Large Language Models Paper • 2605.15572 • Published 5 days ago • 16
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper • 2605.14589 • Published 6 days ago • 12
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security Paper • 2401.05459 • Published May 8, 2024
SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget Paper • 2308.15030 • Published May 29, 2024
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper • 2605.14589 • Published 6 days ago • 12
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design Paper • 2405.17741 • Published May 28, 2024
PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification Paper • 2308.11822 • Published Aug 22, 2023
FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding Paper • 2601.00644 • Published Jan 2
Measuring Maximum Activations in Open Large Language Models Paper • 2605.15572 • Published 5 days ago • 16
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper • 2605.14589 • Published 6 days ago • 12
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper • 2605.00425 • Published 12 days ago • 22