·
AI & ML interests
None yet
Organizations
shirwu/rm_debug_unfreeze-last_quant_Skywork-Reward-Llama-3.1-8B-v0.2
Updated
shirwu/rm_freeze_last_Skywork-Reward-Llama-3.1-8B-v0.2
Updated
shirwu/rm_unfreeze_old_template_last_Llama-3.1-8B-Instruct
Updated
shirwu/rm_freeze_1e-4_last_Skywork-Reward-Llama-3.1-8B-v0.2
Updated
shirwu/rm_unfreeze_last_Skywork-Reward-Llama-3.1-8B-v0.2
Updated
shirwu/rm_train_Skywork-Reward-Llama-3.1-8B-v0.2
Updated
shirwu/rm_train_Llama-3.1-8B-Instruct
Updated
shirwu/preference_iterative_hard-answer_generator-iter0
Text Classification
• 8B • Updated • 2
shirwu/dpo-personal-preference-llama3.2-1b-tokenizer
Updated
shirwu/dpo-personal-preference-llama3.2-1b-model
Updated
shirwu/Meta-Llama-3-8B-Instruct_epoch-300_lr-2e-05
Updated
shirwu/Meta-Llama-3-8B-Instruct_epoch-3_lr-2e-05
Updated