Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Paper β’ 2606.06428 β’ Published 16 days ago β’ 25
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper β’ 2603.21065 β’ Published Mar 22 β’ 78
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper β’ 2602.03619 β’ Published Feb 3 β’ 28
meituan-longcat/LongCat-Flash-Thinking-ZigZag Text Generation β’ 562B β’ Updated Feb 2 β’ 40 β’ 32
AMO-Bench: Large Language Models Still Struggle in High School Math Competitions Paper β’ 2510.26768 β’ Published Oct 30, 2025 β’ 36
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion Paper β’ 2509.01215 β’ Published Sep 1, 2025 β’ 52
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper β’ 2508.14460 β’ Published Aug 20, 2025 β’ 86
view article Article π€ππ¬π₯οΈπ Kimi-VL-A3B-Thinking-2506: A Quick Navigation moonshotai β’ Jun 21, 2025 β’ 77
A Survey of Context Engineering for Large Language Models Paper β’ 2507.13334 β’ Published Jul 17, 2025 β’ 264
RegMix: Data Mixture as Regression for Language Model Pre-training Paper β’ 2407.01492 β’ Published Jul 1, 2024 β’ 41