ComfyUI_temp_bhdgz_00006_

Thalia 70B - Clean

Thalia is a chat model I have merged to provide a distillation model for future projects. Combining a Lumimaid model with Strawberry Lemonade over a heavy base of Deepseek R1 Distill Llama 70B has produced a thinking model and healed its safety alignments. This model contains the writing creativity of its two chat model parents, while adding the deep reasoning of Deepseek.

In order for this model to function properly, you should prefill the opening <think> tag. This model's ancestry results in a hybrid thinker that sometimes chooses to think without <think> tags.

Merge Details

This is a merge of pre-trained language models created using mergekit.

Merge Method

This model was merged using the DARE TIES merge method using unsloth/Llama-3.3-70B-Instruct as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
    parameters:
      density: 0.8
      weight: 1.0
  - model: sophosympatheia/Strawberrylemonade-L3-70B-v1.1
    parameters:
      density: 0.5
      weight: 0.4
  - model: NeverSleep/Lumimaid-v0.2-70B
    parameters:
      density: 0.5
      weight: 0.4
merge_method: dare_ties
base_model: unsloth/Llama-3.3-70B-Instruct
parameters:
  normalize: true
  int8_mask: true
dtype: bfloat16
Downloads last month
18
Safetensors
Model size
71B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Nabbers1999/Thalia-70B-0307-Clean

Collection including Nabbers1999/Thalia-70B-0307-Clean

Paper for Nabbers1999/Thalia-70B-0307-Clean