Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub cd2194b verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub 00e67e0 verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub 2efd320 verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub a3e17b8 verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub fe5093a verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub 5b6c77a verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub 9de2e63 verified CserDu123 commited on Nov 2, 2025
Upload 027-tw_grpo-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub 86f7c97 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub 1b3fbcd verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub fa77c99 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub 365f1bf verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub 6042c85 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub b7f68d0 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub fa7c4dc verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub 5649d5d verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub 1f94920 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub bb9b65a verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub 02cb8d2 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub a438c6e verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub 1d6dc84 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub eae7814 verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub e12ea4c verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub 07272cc verified CserDu123 commited on Nov 2, 2025
Upload 025-arpo_attn_base_grpo-kl_1e-2-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub ccaeac1 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub d758d0b verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub fd45db0 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub 691c1a3 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub 50d3377 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub df6d597 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub 4e5ccfe verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub 6a9a924 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub 2ce242f verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub c046437 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub 02a0957 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/tokenizer.json with huggingface_hub def6676 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/chat_template.json with huggingface_hub 5863163 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/training_args.bin with huggingface_hub c521b96 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/vocab.json with huggingface_hub e0f6148 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/video_preprocessor_config.json with huggingface_hub 23a4151 verified CserDu123 commited on Nov 2, 2025
Upload 024-arpo_attn-3b-seed_bench_r1_6k-ckpt-376/trainer_state.json with huggingface_hub 33e9b36 verified CserDu123 commited on Nov 2, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/merges.txt with huggingface_hub 862138b verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/preprocessor_config.json with huggingface_hub 08234e3 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/special_tokens_map.json with huggingface_hub ee35027 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model-00001-of-00002.safetensors with huggingface_hub 870d311 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/tokenizer_config.json with huggingface_hub b191fa5 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/added_tokens.json with huggingface_hub 392143a verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/generation_config.json with huggingface_hub 69337a6 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model-00002-of-00002.safetensors with huggingface_hub 88b72cd verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/config.json with huggingface_hub ae665c5 verified CserDu123 commited on Nov 1, 2025
Upload 022-grpo-3b-kl_1e-2-seed_bench_r1_6k-ckpt-376/model.safetensors.index.json with huggingface_hub a7d71ac verified CserDu123 commited on Nov 1, 2025