Cosmos 3: Omnimodal World Models for Physical AI Paper • 2606.02800 • Published 10 days ago • 110
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14, 2025 • 90
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble Paper • 2401.16635 • Published Jan 30, 2024 • 1
Planning with Large Language Models for Code Generation Paper • 2303.05510 • Published Mar 9, 2023
codellama/CodeLlama-7b-Python-hf Text Generation • 7B • Updated Apr 12, 2024 • 4.49k • • 146