Reasoning
updated
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper
• 2408.06195
• Published
• 73
Thinking LLMs: General Instruction Following with Thought Generation
Paper
• 2410.10630
• Published
• 20
Democratizing Reasoning Ability: Tailored Learning from Large Language
Model
Paper
• 2310.13332
• Published
• 16
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks
with Reinforcement Fine-Tuning
Paper
• 2412.16849
• Published
• 9
o1-Coder: an o1 Replication for Coding
Paper
• 2412.00154
• Published
• 44
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree
Search for Code Generation
Paper
• 2411.11053
• Published
• 4
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate
Prioritized Experience Replay
Paper
• 2410.12236
• Published
• 1
Efficiently Serving LLM Reasoning Programs with Certaindex
Paper
• 2412.20993
• Published
• 36
Viewer
• Updated
• 753k • 3.75k
• 526
nvidia/OpenCodeReasoning-2
Viewer
• Updated
• 2.16M • 1.81k
• 50