whaleL 's Collections

MedicalGPT

医疗领域后训练模型:sft、reward model、grpo