ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_act_tok_2500 Text Generation • 8B • Updated Dec 2, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_act_tok_2000 Text Generation • 8B • Updated Dec 2, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_act_tok_1500 Text Generation • 8B • Updated Dec 2, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_act_tok_1000 Text Generation • 8B • Updated Dec 2, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_act_tok_500 Text Generation • 8B • Updated Dec 2, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_3800 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_3500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_3000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_3350 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_2500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_3000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_2000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_2500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_1500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_2000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_1000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_1500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_cont_force_action_fix_term_500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_1000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_fix_term_500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_3500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_3000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_2500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_2000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_1500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_1000 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/q3_8b_untrained_actor_pos_random_force_action_500 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/qwen3_8b_reasoning_sft_ft_untrained_actor_pos_cont_actual_force_action_390 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/qwen3_8b_reasoning_sft_ft_untrained_actor_pos_cont_actual_force_action_300 Text Generation • 8B • Updated Dec 1, 2025 • 1
ccui46/qwen3_8b_reasoning_sft_ft_untrained_actor_pos_cont_actual_force_action_200 Text Generation • 8B • Updated Dec 1, 2025 • 1