·
AI & ML interests
None yet
Organizations
princeton-nlp/warm-start__ppo__think__Qwen2.5-7B
8B • Updated • 2
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B
8B • Updated • 3
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B
8B • Updated • 4
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B-Instruct
8B • Updated princeton-nlp/warm-start__dpo__think__Llama-3.1-8B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B
8B • Updated • 1
princeton-nlp/warm-start__dpo__think__Llama-3.1-8B
8B • Updated • 1
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B
8B • Updated • 3
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B
8B • Updated • 2
princeton-nlp/warm-start__sft__think__Qwen2.5-7B-Instruct
8B • Updated • 2
• 1
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__sft__think__Qwen2.5-7B
8B • Updated • 7
princeton-nlp/warm-start__sft__think__Llama-3.1-8B
8B • Updated • 2
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct
8B • Updated • 5
princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct
8B • Updated • 2
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
8B • Updated • 9.08k
• 26
princeton-nlp/Llama-3-8B-ProLong-512k-Base
8B • Updated • 7.72k
• 9
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation
• 8B • Updated • 7.83k
• • 13
princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation
• 8B • Updated • 7.82k
• • 6
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
• 7B • Updated • 63
• 1
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
• 7B • Updated • 82
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
• 9B • Updated • 482
• • 172
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
• 9B • Updated • 36
• • 9
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
• 8B • Updated • 104
• • 8
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
• 8B • Updated • 46
• princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2
Text Generation
• 8B • Updated • 50
• princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2
Text Generation
• 8B • Updated • 50
• princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2
Text Generation
• 8B • Updated • 52
•