·
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas/preference_dataset_mixture2_and_safe_pku30k_and_argilla_math_and_ultra_code_for_preference_model
Viewer
• Updated • 606k • 3
weqweasdas/preference_dataset_mixture2_and_safe_pku30k_for_preference_model
Viewer
• Updated • 554k • 3
weqweasdas/ultra_feedback_binarized_for_preference_no_chat_all
Viewer
• Updated • 60.9k • 3
weqweasdas/ultra_feedback_binarized_for_preference_no_chat_40k
Viewer
• Updated • 40k • 2
weqweasdas/gemma_ultra_feedback_binarized_for_preference_15k
Viewer
• Updated • 15k • 3
weqweasdas/zephyr_ultra_feedback_binarized_for_preference
Viewer
• Updated • 60.9k • 3
weqweasdas/ultra_feedback_binarized_for_preference_no_chat
Viewer
• Updated • 60.9k • 3
weqweasdas/ultra_feedback_binarized_for_preference
Viewer
• Updated • 60.9k • 3
weqweasdas/zephyr_ultra_feedback_model1
Viewer
• Updated • 7.5k • 3
weqweasdas/zephyr_ultra_feedback_n32
Viewer
• Updated • 15k • 3
weqweasdas/open_chat_0106_ultra_feedback_n32
Viewer
• Updated • 60k • 3
weqweasdas/openchat_model0_data_with_rewards
Viewer
• Updated • 1 • 2
weqweasdas/rsf_pi0_iter1_with_len
Viewer
• Updated • 1 • 2
weqweasdas/rsf_pi0_mistrav_02_prompt0
Viewer
• Updated • 1 • 3
weqweasdas/rsf_gemma_2b_iter1
Viewer
• Updated • 1 • 3
Viewer
• Updated • 1 • 3
weqweasdas/preference_dataset_mix2
Viewer
• Updated • 528k • 19
• 3
Viewer
• Updated • 116k • 3
weqweasdas/preference_dataset_mixture2_and_safe_pku150k
Viewer
• Updated • 678k • 3
weqweasdas/ultra_prompt_split
Viewer
• Updated • 60k • 3
• 2
weqweasdas/preference_dataset_mixture
Viewer
• Updated • 256k • 5