FuseChat: Knowledge Fusion of Chat Models
Paper • 2408.07990 • Published • 15
This is a merge of pre-trained language models created using mergekit.
This model was merged using the SCE merge method using sthenno-com/miscii-14b-1225 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
merge_method: sce
models:
- model: sthenno/tempesthenno-nuslerp-0124
- model: Qwen/Qwen2.5-14B-Instruct-1M
- model: sthenno/tempesthenno-0126-ckpt150
- model: arcee-ai/Virtuoso-Small
- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
- model: SicariusSicariiStuff/Impish_QWEN_14B-1M
- model: ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign
base_model: sthenno-com/miscii-14b-1225
parameters:
select_topk: 1.0
dtype: bfloat16
normalize: true