ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 12 days ago • 38
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 7
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 7