Models

33

Full-text search

Active filters: simpo

aiyets/test

Updated Oct 13, 2024 • 2

radm/forerunner-qwen32b-simpo-awq

Text Generation • 33B • Updated May 1, 2025 • 9

yakazimir/simpo-exps

Text Generation • 8B • Updated Nov 11, 2024 • 3

AIR-hl/Qwen2.5-1.5B-SimPO

Text Generation • 2B • Updated Jan 3, 2025 • 1

yakazimir/simpo-exps_qwen05b

Text Generation • 0.5B • Updated Apr 10, 2025 • 194

Sean13/mistral-7b-instruct-v0.2-rsimpo-full

Text Generation • 7B • Updated Sep 6, 2025 • 1

Boko99/llama3-instruct-simpo

Text Generation • 266k • Updated Jul 27, 2025 • 3

Boko99/llama3-base-simpo

Text Generation • 266k • Updated Jul 27, 2025 • 3

Sean13/mistral-7b-instruct-v0.2-simpo-full

Text Generation • 7B • Updated Sep 6, 2025 • 1

Sean13/llama-8b-instruct-simpo-full

Text Generation • 8B • Updated Sep 24, 2025 • 1

Sean13/llama-8b-instruct-rsimpo-full

Text Generation • 8B • Updated Sep 24, 2025 • 4

jz666/simpo

Text Generation • 9B • Updated Sep 29, 2025 • 1

jz666/simpo-train-large-correct

Text Generation • 9B • Updated Oct 14, 2025 • 1

jz666/simpo-train-largest-30-ppl-rejected

Text Generation • 9B • Updated Oct 14, 2025 • 2

jz666/simpo-train-largest-30-ppl-chosen

Text Generation • 9B • Updated Oct 14, 2025 • 2

jz666/simpo-train-largest-30-abs-diff

Text Generation • 9B • Updated Oct 14, 2025 • 2

jz666/simpo-train-smallest-30-abs-diff

Text Generation • 9B • Updated Oct 14, 2025 • 3

jz666/simpo-train-small-correct

Text Generation • 9B • Updated Oct 14, 2025 • 1

jz666/simpo-train-small-wrong

Text Generation • 9B • Updated Oct 14, 2025 • 2

jz666/simpo-train-filtered-full

Text Generation • 9B • Updated Oct 14, 2025 • 1

jz666/simpo-train-large-wrong

Text Generation • 9B • Updated Oct 16, 2025 • 1

jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full

Text Generation • 9B • Updated Oct 17, 2025 • 1

jz666/gemma-2-9b-it-dpo-train_filtered_full

Text Generation • 9B • Updated Oct 20, 2025 • 1

Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 2

Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 2

szili2011/Quanta-X-3B

Text Generation • 3B • Updated Jan 18 • 8 • 1

mradermacher/Quanta-X-3B-GGUF

3B • Updated Jan 1 • 81 • 1

mradermacher/Quanta-X-3B-i1-GGUF

3B • Updated Jan 1 • 143 • 1

mr3haque/OmniAgent

Any-to-Any • Updated 4 days ago

tomofusa/exp020-simpo-merged

Text Generation • 4B • Updated Feb 26 • 2