Models

181

Full-text search

Active filters: cpo

NBA55/Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2

Updated May 12, 2024

smohammadi/llama2-lora-aligned-cpo

Updated Jul 20, 2024 • 1

NBA55/Final_Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2

Updated Aug 24, 2024

Siddartha10/outputs_cpo

Text Generation • 0.1B • Updated Sep 14, 2024 • 3

ravithejads/test_model_sft

Text Generation • 0.1B • Updated Sep 15, 2024 • 2

maxmyn/c4ai-takehome-model-simpo

Text Generation • 0.1B • Updated Sep 15, 2024 • 1

twigs/smolm-cposimpo

Text Generation • 0.1B • Updated Sep 16, 2024 • 1

sarthakrw/cpo_model

Text Generation • 0.1B • Updated Sep 16, 2024 • 3

CharlesLi/OpenELM-1_1B-SimPO

Text Generation • 1B • Updated Sep 20, 2024 • 2

CharlesLi/OpenELM-1_1B-CPO

Text Generation • 1B • Updated Sep 20, 2024 • 2

NBA55/CPO_with_baseline_modalh

Text Generation • 7B • Updated Oct 1, 2024 • 2

NBA55/CPO_with_trained_model_for_all_3_issues-epoch-2

Updated Oct 1, 2024

rawsh/mirrorqwen2.5-0.5b-SimPO

Text Generation • 0.5B • Updated Nov 10, 2024 • 6

rawsh/simpo-math-model

Text Generation • 0.5B • Updated Nov 10, 2024 • 1

rawsh/mirrorqwen2.5-0.5b-SimPO-0

Text Generation • 0.5B • Updated Nov 10, 2024 • 6

mradermacher/mirrorqwen2.5-0.5b-SimPO-GGUF

0.5B • Updated Nov 10, 2024 • 94

mradermacher/mirrorqwen2.5-0.5b-SimPO-0-GGUF

0.5B • Updated Nov 10, 2024 • 363

rawsh/mirrorqwen2.5-0.5b-SimPO-1

Text Generation • 0.5B • Updated Nov 11, 2024 • 7

rawsh/mirrorqwen2.5-0.5b-SimPO-2

Text Generation • 0.5B • Updated Nov 11, 2024 • 7

rawsh/mirrorqwen2.5-0.5b-SimPO-3

Text Generation • 0.5B • Updated Nov 11, 2024 • 5

mradermacher/mirrorqwen2.5-0.5b-SimPO-1-GGUF

0.5B • Updated Nov 12, 2024 • 44

mradermacher/mirrorqwen2.5-0.5b-SimPO-2-GGUF

0.5B • Updated Nov 12, 2024 • 137

mradermacher/mirrorqwen2.5-0.5b-SimPO-3-GGUF

0.5B • Updated Nov 12, 2024 • 74

botways/llama-CPO

Updated Nov 26, 2024

Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter1

Text Generation • 27B • Updated Dec 15, 2024 • 3 • 1

Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter2

Text Generation • 27B • Updated Dec 16, 2024 • 10 • 1

Aratako/gemma-2-2b-axolotl-simpo-v1.0

Text Generation • Updated Dec 10, 2024 • 2

Aratako/gemma-2-2b-axolotl-simpo-v1.0-merged

Text Generation • Updated Dec 10, 2024 • 9 • 1

mradermacher/gemma-2-2b-axolotl-simpo-v1.0-merged-GGUF

3B • Updated Dec 11, 2024 • 174

mjhamar/Meta-Llama-3.1-8B-Instruct-cpo-beir

Text Generation • 8B • Updated Dec 12, 2024 • 5