·
AI & ML interests
None yet
Organizations
None yet
unfair221/fdu_deeplearning_pj2
Updated
unfair221/llama_fcs_wo_on_policy
8B • Updated • 1
unfair221/qwen_original_sol_sft
8B • Updated • 1
unfair221/qwen_fcs_wo_on_policy
8B • Updated unfair221/llama_original_sol_sft
8B • Updated unfair221/qwen_binary_4096_10000_dpo_wo_sftloss
unfair221/qwen_binary_4096_10000_dpo_only_sft01
Text Generation
• 8B • Updated unfair221/qwen_binary_4096_5000_dpo_only_sft01
Text Generation
• 8B • Updated • 4
unfair221/llama_binary_4096_5000_dpo_only_sft01
Text Generation
• 8B • Updated • 5
unfair221/llama_binary_4096_10000_dpo_wo_sftloss
8B • Updated • 1
unfair221/llama_binary_4096_10000_dpo_only_sft01
unfair221/llama_binary_4096_5000_dpo_wo_sftloss
unfair221/llama_binary_4096_20000_dpo_sft_01
unfair221/qwen_binary_4096_20000_dpo_sft01_new
unfair221/train_qwen_w_llama_binary_4096_5000_dpo_sft01
unfair221/train_llama_w_qwen_binary_4096_5000_dpo_sft01
unfair221/qwen_binary_4096_20000_dpo_sft01
8B • Updated • 3
unfair221/qwen_binary_4096_10000_dpo_sft01
unfair221/llama_binary_4096_10000_dpo_sft005
unfair221/llama_binary_4096_10000_dpo_sft02
unfair221/llama_binary_4096_10000_dpo_sft03
unfair221/llama_binary_4096_10000_dpo_sft001
unfair221/llama_binary_4096_10000_dpo_sft01
unfair221/llama_binary_4096_5000_dpo_sft03
unfair221/llama_binary_4096_5000_dpo_sft02
unfair221/llama_binary_4096_5000_dpo_sft001
unfair221/llama_binary_4096_5000_dpo_sft01
unfair221/train_qwen_w_llama_binary_sft
Updated
unfair221/train_llama_w_qwen_binary_sft
Updated
unfair221/qwen_random_wo_error_25000_sft-lora_None-ckpt_None-25-04-26-17_09_38