Inference Providers
Active filters: simpo
radm/forerunner-qwen32b-simpo-awq
Text Generation
• 33B • Updated • 9
Text Generation
• 8B • Updated • 3
AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
• 2B • Updated • 1
yakazimir/simpo-exps_qwen05b
Text Generation
• 0.5B • Updated • 194
Sean13/mistral-7b-instruct-v0.2-rsimpo-full
Text Generation
• 7B • Updated • 1
Boko99/llama3-instruct-simpo
Text Generation
• 266k • Updated • 3
Text Generation
• 266k • Updated • 3
Sean13/mistral-7b-instruct-v0.2-simpo-full
Text Generation
• 7B • Updated • 1
Sean13/llama-8b-instruct-simpo-full
Text Generation
• 8B • Updated • 1
Sean13/llama-8b-instruct-rsimpo-full
Text Generation
• 8B • Updated • 4
Text Generation
• 9B • Updated • 1
jz666/simpo-train-large-correct
Text Generation
• 9B • Updated • 1
jz666/simpo-train-largest-30-ppl-rejected
Text Generation
• 9B • Updated • 2
jz666/simpo-train-largest-30-ppl-chosen
Text Generation
• 9B • Updated • 2
jz666/simpo-train-largest-30-abs-diff
Text Generation
• 9B • Updated • 2
jz666/simpo-train-smallest-30-abs-diff
Text Generation
• 9B • Updated • 3
jz666/simpo-train-small-correct
Text Generation
• 9B • Updated • 1
jz666/simpo-train-small-wrong
Text Generation
• 9B • Updated • 2
jz666/simpo-train-filtered-full
Text Generation
• 9B • Updated • 1
jz666/simpo-train-large-wrong
Text Generation
• 9B • Updated • 1
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
• 9B • Updated • 1
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
• 9B • Updated • 1
Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated • 2
Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1
Text Generation
• 266k • Updated • 2
Text Generation
• 3B • Updated • 8
• 1
mradermacher/Quanta-X-3B-GGUF
3B • Updated • 81
• 1
mradermacher/Quanta-X-3B-i1-GGUF
3B • Updated • 143
• 1
Any-to-Any
• Updated tomofusa/exp020-simpo-merged
Text Generation
• 4B • Updated • 2