BaseReward: A Strong Baseline for Multimodal Reward Model Paper โข 2509.16127 โข Published Sep 19, 2025 โข 21
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper โข 2602.12705 โข Published about 1 month ago โข 66
Running on CPU Upgrade Featured 3.05k The Smol Training Playbook ๐ 3.05k The secrets to building world-class LLMs