AI & ML interests
None yet
Organizations
Echoandland/olmo3-7b-physics-grpo-purerl-step9
Reinforcement Learning
• 7B • Updated • 2
Echoandland/olmo3-7b-physics-grpo-purerl-step7
Reinforcement Learning
• 7B • Updated • 4
Echoandland/qwen3-8b-dapo-high-entropy-step2
Reinforcement Learning
• 8B • Updated Echoandland/qwen3-8b-dapo-high-entropy-step8
Reinforcement Learning
• 8B • Updated • 5
Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6
Reinforcement Learning
• 7B • Updated Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7
Reinforcement Learning
• 7B • Updated Echoandland/olmo3-7b-grpo-purerl-creativity-step28
Reinforcement Learning
• 7B • Updated • 3
Echoandland/olmo3-7b-grpo-purerl-creativity-step5
Reinforcement Learning
• 7B • Updated Echoandland/qwen3-8b-grpo-purerl-creativity-step21
Reinforcement Learning
• 8B • Updated Echoandland/qwen3-8b-grpo-purerl-creativity-step9
Reinforcement Learning
• 8B • Updated • 2
Echoandland/qwen3-8b-dapo-fulltokens-creativity-step8
Reinforcement Learning
• 8B • Updated Echoandland/qwen3-8b-dapo-fulltokens-creativity-step11
Reinforcement Learning
• 8B • Updated • 3
Echoandland/qwen3-8b-creativity-grpo-step15-no-use
8B • Updated • 1
Echoandland/qwen3-8b-creativity-grpo-step9-no-use
8B • Updated Echoandland/qwen3-8b-creativity-grpo-step13-no-use
8B • Updated Echoandland/qwen3-8b-creativity-grpo-step7-no-use
8B • Updated Echoandland/qwen3-8b-creativity-grpo-step4-no-use
8B • Updated Echoandland/qwen3-8b-creativity-grpo-step2
8B • Updated Echoandland/17Nov_testing_physics_pure_rl_step160
8B • Updated Echoandland/17Nov_testing_medcase_reasoning_pure_rl
8B • Updated Echoandland/qwen2.5-7b-instruct-medcasereasoning-sft-full-params-step150
8B • Updated Echoandland/qwen2.5-7b-instruct-medcasereasoning-sft-full-step150
8B • Updated