Editing Models with Task Arithmetic
Paper • 2212.04089 • Published • 8
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Task Arithmetic merge method using Undi95/Llama-3-Unholy-8B as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
slices:
- sources:
- model: Undi95/Llama-3-Unholy-8B
layer_range: [0, 32]
parameters:
weight: 0.55
- model: Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B
layer_range: [0, 32]
parameters:
weight: 0.5
- model: SicariusSicariiStuff/Dusk_Rainbow
layer_range: [0, 32]
parameters:
weight: 0.25
- model: Dampfinchen/Llama-3-8B-Ultra-Instruct
layer_range: [0, 32]
parameters:
weight: 0.15
- model: R136a1/Bungo-L3-8B
layer_range: [0, 32]
parameters:
weight: 0.1
merge_method: task_arithmetic
base_model: Undi95/Llama-3-Unholy-8B
normalize: False
dtype: bfloat16