·
AI & ML interests
None yet
Organizations
1231czx/qw_self_corr_math_demo
Viewer
• Updated • 500 • 3
1231czx/packed_self_corr_sft_math_regular_prompt_ep3
Viewer
• Updated • 496 • 3
1231czx/packed_self_corr_sft_math_regular_prompt_ep2
Viewer
• Updated • 496 • 4
1231czx/packed_self_corr_sft_math_ep3
Viewer
• Updated • 496 • 3
1231czx/packed_self_corr_sft_math_ep2
Viewer
• Updated • 496 • 3
1231czx/packed_self_corr_sft_math_ep1
Viewer
• Updated • 496 • 3
1231czx/self_corr_sft_math_ep2
Viewer
• Updated • 496 • 3
1231czx/self_corr_sft_math_ep1
Viewer
• Updated • 496 • 4
Viewer
• Updated • 20k • 2
1231czx/qwq_warm_up_dpo_iter5_gen2_numia_hard
Viewer
• Updated • 10k • 3
1231czx/qwq_warm_up_dpo_iter5_gen1_numia_hard
Viewer
• Updated • 10k • 3
1231czx/raft_no_penlaty_iter3_gen
Viewer
• Updated • 20k • 2
1231czx/qwq2ep_raft_iter1_gen
Viewer
• Updated • 20k • 3
1231czx/new_script_raft_iter4_gen
Viewer
• Updated • 20k • 3
1231czx/new_script_raft_iter3_gen
Viewer
• Updated • 20k • 3
1231czx/new_script_raft_iter2_gen
Viewer
• Updated • 20k • 3
1231czx/new_script_raft1_gen_numia_tmp07
Viewer
• Updated • 496 • 2
1231czx/henry_dong_raft1_gen_numia_tmp07
Viewer
• Updated • 496 • 3
1231czx/henry_dong_raft1_gen_tmp07
Viewer
• Updated • 496 • 3
1231czx/self_corr_sft_gen_tmp07
Viewer
• Updated • 160 • 3
1231czx/self_corr_first_wrong_qwenbase_prompt2_gen2
Viewer
• Updated • 6.47k • 3
1231czx/qwq_warmup_dpo_iter2_gen_data
Viewer
• Updated • 20k • 3
1231czx/self_corr_first_wrong_qwenbase_prompt2_gen1
Viewer
• Updated • 6.47k • 3
1231czx/self_corr_first_wrong_qwenbase_prompt1_gen2
Viewer
• Updated • 1.75k • 3
1231czx/self_corr_first_wrong_qwenbase_prompt1_gen1
Viewer
• Updated • 1.75k • 3
1231czx/base_gen_iter3_prompt
Viewer
• Updated • 80k • 1
1231czx/qwen_qwq_3ep_iter1_gen
Viewer
• Updated • 20k • 3
1231czx/base_gen_iter2_prompt
Viewer
• Updated • 40k • 3
1231czx/corr_iter2_gen_wrong
Viewer
• Updated • 21.8k • 3
Viewer
• Updated • 45k • 3