AI & ML interests
None yet
Organizations
None yet
duygunuryldz/GRPO_Qwen3-4B_aegis
Updated
duygunuryldz/GRPON_Qwen3-4B_csqa
Updated
duygunuryldz/GRPON_Qwen3-4B_aegis
Updated
duygunuryldz/myGRPO_minus_adv_norm_Qwen3-4B_aegis
Updated
duygunuryldz/GRPON_Qwen3-1.7B_aegis
Updated
duygunuryldz/myGRPO_norm_adv_Qwen3-4B_aegis
Updated
duygunuryldz/GRPO_Qwen3-8B_aegis
Updated
duygunuryldz/myGRPON_Qwen3-4B_aegis
Updated
duygunuryldz/myGRPO_Qwen3-4B_aegis
Updated
duygunuryldz/myGRPO2_Qwen3-4B_aegis
Updated
duygunuryldz/trainer_output
Updated
duygunuryldz/5GRPO_Qwen3-4B_aegis
Updated
duygunuryldz/5GRPO_Qwen3-4B_csqa
Updated
duygunuryldz/5GRPO_Qwen3-1.7B_csqa
Updated
duygunuryldz/5GRPO_Qwen3-1.7B_aegis
Updated
duygunuryldz/1GRPO_Qwen3-1.7B_aegis
Updated
duygunuryldz/Qwen3-0.6B-GRPO-test
Updated
duygunuryldz/0GRPO_Qwen3-1.7B_aegis
Updated
duygunuryldz/Qwen2-0.5B-GRPO-test
Updated
0.1B • Updated • 366
0.1B • Updated • 529