Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
rosieyzh
's Collections
Synthetic Multimodal Datasets
Qwen2.5-1.5B SFT - Unstructured Code
Llama-3.2-1B Warmstart RLVR - Summarization
Llama-3.2-1B RLVR - Summarization
Llama-3.2-1B SFT - Summarization
Qwen2.5-1.5B Warmstart RLVR - GSM8K
Qwen2.5-1.5B RLVR - GSM8K
Llama-3.2-1B Warmstart RLVR - Translation
Llama-3.2-1B RLVR - Translation
Llama-3.2-1B SFT - Translation
Qwen2.5-1.5B Warmstart RLVR - Code
Qwen2.5-1.5B RLVR - Code
Qwen2.5-1.5B SFT - Code
OLMo-150M and OLMo-1B Pretrained Models
OLMo-1B-as_fm3_tg_omi1_omi2
OLMo-1B-as_fm3_tg_omi2
Llama-3.2-1B RLVR - Summarization
updated
Jan 26
rosieyzh/rlvr_llama1_rouge_xsum_rbz_{32,64}_ckpt_{i}_of_10
Upvote
-
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_1_of_10
1B
•
Updated
Jan 26
•
2
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_2_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_3_of_10
1B
•
Updated
Jan 26
•
3
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_4_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_5_of_10
1B
•
Updated
Jan 26
•
2
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_6_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_7_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_8_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_9_of_10
1B
•
Updated
Jan 26
•
1
rosieyzh/rlvr_llama1_rouge_xsum_rbz_64_ckpt_10_of_10
1B
•
Updated
Jan 26
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections