Commit History

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub
cd2194b
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub
00e67e0
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub
2efd320
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub
a3e17b8
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub
fe5093a
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub
5b6c77a
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub
9de2e63
verified

CserDu123 commited on

Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub
86f7c97
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub
1b3fbcd
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub
fa77c99
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub
365f1bf
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub
6042c85
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub
b7f68d0
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub
fa7c4dc
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub
5649d5d
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub
1f94920
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub
bb9b65a
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub
02cb8d2
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub
a438c6e
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub
1d6dc84
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub
eae7814
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub
e12ea4c
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub
07272cc
verified

CserDu123 commited on

Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub
ccaeac1
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub
d758d0b
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub
fd45db0
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub
691c1a3
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub
50d3377
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub
df6d597
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub
4e5ccfe
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub
6a9a924
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub
2ce242f
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub
c046437
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub
02a0957
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub
def6676
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub
5863163
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub
c521b96
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub
e0f6148
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub
23a4151
verified

CserDu123 commited on

Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub
33e9b36
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub
862138b
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub
08234e3
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub
ee35027
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub
870d311
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub
b191fa5
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub
392143a
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub
69337a6
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub
88b72cd
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub
ae665c5
verified

CserDu123 commited on

Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub
a7d71ac
verified

CserDu123 commited on