TongZheng PRO
TongZheng1999
AI & ML interests
Natural Language Processing
Recent Activity
updated a model about 24 hours ago
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge published a model 1 day ago
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge updated a dataset 1 day ago
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge