Commit History

Upload 060-grpo-3b_nextgqa-ckpt_418/generation_config.json with huggingface_hub
0d238cc
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/model-00002-of-00002.safetensors with huggingface_hub
1bd5c61
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/config.json with huggingface_hub
eb103db
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/model.safetensors.index.json with huggingface_hub
8016452
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/tokenizer.json with huggingface_hub
82aa732
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/chat_template.json with huggingface_hub
4e1355f
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/training_args.bin with huggingface_hub
b0ecd9e
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/vocab.json with huggingface_hub
37d9bbe
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/video_preprocessor_config.json with huggingface_hub
56c9dc3
verified

CserDu123 commited on

Upload 060-grpo-3b_nextgqa-ckpt_418/trainer_state.json with huggingface_hub
4d2e6b7
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/merges.txt with huggingface_hub
4e6107c
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/preprocessor_config.json with huggingface_hub
6a57d17
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/special_tokens_map.json with huggingface_hub
9d98902
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/model-00001-of-00004.safetensors with huggingface_hub
233b8b0
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/tokenizer_config.json with huggingface_hub
8544f20
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/added_tokens.json with huggingface_hub
f40a24a
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/generation_config.json with huggingface_hub
b32f44b
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/model-00004-of-00004.safetensors with huggingface_hub
34671c1
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/config.json with huggingface_hub
fcdff08
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/model-00003-of-00004.safetensors with huggingface_hub
67ac420
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/model.safetensors.index.json with huggingface_hub
55e6641
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/tokenizer.json with huggingface_hub
94297d1
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/model-00002-of-00004.safetensors with huggingface_hub
2276f8b
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/chat_template.json with huggingface_hub
7c64e85
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/training_args.bin with huggingface_hub
b3a97f4
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/vocab.json with huggingface_hub
5309fab
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/video_preprocessor_config.json with huggingface_hub
7fbbc40
verified

CserDu123 commited on

Upload 054-dapo-7b_nextgqa-ckpt_418/trainer_state.json with huggingface_hub
19d00bb
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/merges.txt with huggingface_hub
b2894de
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/preprocessor_config.json with huggingface_hub
6f88b84
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/special_tokens_map.json with huggingface_hub
747edf3
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/model-00001-of-00004.safetensors with huggingface_hub
b1c795b
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/tokenizer_config.json with huggingface_hub
eebb2c2
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/added_tokens.json with huggingface_hub
2a45d5b
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/generation_config.json with huggingface_hub
cec407f
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/model-00004-of-00004.safetensors with huggingface_hub
970e216
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/config.json with huggingface_hub
7e38197
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/model-00003-of-00004.safetensors with huggingface_hub
bc7b8f1
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/model.safetensors.index.json with huggingface_hub
bbd6391
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/tokenizer.json with huggingface_hub
6e45ffa
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/model-00002-of-00004.safetensors with huggingface_hub
2443524
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/chat_template.json with huggingface_hub
2100c9b
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/training_args.bin with huggingface_hub
8ac6a05
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/vocab.json with huggingface_hub
9b0bdd5
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/video_preprocessor_config.json with huggingface_hub
a8a1b0b
verified

CserDu123 commited on

Upload 055-arpo_kl-7b-ablation_seed_bench_r1-ckpt_376/trainer_state.json with huggingface_hub
8988ba1
verified

CserDu123 commited on

Upload 053-tw_grpo-7b_perception_test-ckpt_396/merges.txt with huggingface_hub
2b7b874
verified

CserDu123 commited on

Upload 053-tw_grpo-7b_perception_test-ckpt_396/preprocessor_config.json with huggingface_hub
73f20e2
verified

CserDu123 commited on

Upload 053-tw_grpo-7b_perception_test-ckpt_396/special_tokens_map.json with huggingface_hub
9590b3a
verified

CserDu123 commited on

Upload 053-tw_grpo-7b_perception_test-ckpt_396/model-00001-of-00004.safetensors with huggingface_hub
0096963
verified

CserDu123 commited on