CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 29 days ago • 141
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated Mar 12 • 38
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated Mar 12 • 38
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 29 days ago • 141
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated Mar 2 • 3
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated Mar 2 • 3