Commit History

Upload 048-arpo_kl-3b_perception_test-ckpt_396/config.json with huggingface_hub
a150980
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/model.safetensors.index.json with huggingface_hub
e8a662c
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/tokenizer.json with huggingface_hub
488713a
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/chat_template.json with huggingface_hub
67c74b8
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/training_args.bin with huggingface_hub
6b99932
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/vocab.json with huggingface_hub
673032f
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/video_preprocessor_config.json with huggingface_hub
63e3109
verified

CserDu123 commited on

Upload 048-arpo_kl-3b_perception_test-ckpt_396/trainer_state.json with huggingface_hub
1545d03
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/merges.txt with huggingface_hub
1fe392c
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/preprocessor_config.json with huggingface_hub
c13da7d
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/special_tokens_map.json with huggingface_hub
554e6cc
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/model-00001-of-00004.safetensors with huggingface_hub
2b4f378
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/tokenizer_config.json with huggingface_hub
0620816
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/added_tokens.json with huggingface_hub
2a41196
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/generation_config.json with huggingface_hub
0be1de6
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/model-00004-of-00004.safetensors with huggingface_hub
efc9b3c
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/config.json with huggingface_hub
df23db8
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/model-00003-of-00004.safetensors with huggingface_hub
1564805
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/model.safetensors.index.json with huggingface_hub
3649cfa
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/tokenizer.json with huggingface_hub
8453638
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/model-00002-of-00004.safetensors with huggingface_hub
f67a047
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/chat_template.json with huggingface_hub
8d5c824
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/training_args.bin with huggingface_hub
29d6d5d
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/vocab.json with huggingface_hub
f40388b
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/video_preprocessor_config.json with huggingface_hub
25fde0d
verified

CserDu123 commited on

Upload 038-arpo_kl-7b_nextqa-ckpt_2133/trainer_state.json with huggingface_hub
cbca841
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/merges.txt with huggingface_hub
d0865dd
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/preprocessor_config.json with huggingface_hub
aedfcc7
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/special_tokens_map.json with huggingface_hub
b2abd44
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/model-00001-of-00002.safetensors with huggingface_hub
b41a9df
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/tokenizer_config.json with huggingface_hub
c3bd328
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/added_tokens.json with huggingface_hub
b052940
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/generation_config.json with huggingface_hub
e4f7080
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/model-00002-of-00002.safetensors with huggingface_hub
c2ff9c8
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/config.json with huggingface_hub
d82c401
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/model.safetensors.index.json with huggingface_hub
59fdd8a
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/tokenizer.json with huggingface_hub
21ab03e
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/chat_template.json with huggingface_hub
aa01d7c
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/training_args.bin with huggingface_hub
ca65584
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/vocab.json with huggingface_hub
b1be8f1
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/video_preprocessor_config.json with huggingface_hub
44b891d
verified

CserDu123 commited on

Upload 036-arpo_kl-3b_nextqa-ckpt_2133/trainer_state.json with huggingface_hub
da2fadd
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/merges.txt with huggingface_hub
8edb589
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/preprocessor_config.json with huggingface_hub
33a16ed
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/special_tokens_map.json with huggingface_hub
a25974f
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/model-00001-of-00004.safetensors with huggingface_hub
82ccc2d
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/tokenizer_config.json with huggingface_hub
0a864c2
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/added_tokens.json with huggingface_hub
e6daac8
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/generation_config.json with huggingface_hub
4bcaf28
verified

CserDu123 commited on

Upload 039-arpo_kl-7b_seed_bench_r1-ckpt_376/model-00004-of-00004.safetensors with huggingface_hub
984838e
verified

CserDu123 commited on