CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated Mar 15 • 35
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated Mar 12 • 34