Thalia 70B - Clean

Thalia is a chat model I have merged to provide a distillation model for future projects. Combining a Lumimaid model with Strawberry Lemonade over a heavy base of Deepseek R1 Distill Llama 70B has produced a thinking model and healed its safety alignments. This model contains the writing creativity of its two chat model parents, while adding the deep reasoning of Deepseek.

In order for this model to function properly, you should prefill the opening `<think>` tag. This model's ancestry results in a hybrid thinker that sometimes chooses to think without `<think>` tags.

Merge Details

This is a merge of pre-trained language models created using mergekit.

Merge Method

This model was merged using the DARE TIES merge method using unsloth/Llama-3.3-70B-Instruct as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
    parameters:
      density: 0.8
      weight: 1.0
  - model: sophosympatheia/Strawberrylemonade-L3-70B-v1.1
    parameters:
      density: 0.5
      weight: 0.4
  - model: NeverSleep/Lumimaid-v0.2-70B
    parameters:
      density: 0.5
      weight: 0.4
merge_method: dare_ties
base_model: unsloth/Llama-3.3-70B-Instruct
parameters:
  normalize: true
  int8_mask: true
dtype: bfloat16