llama-stampede-64x101m

llama-stampede-64x101m is a merge of the following models using mergekit:

🧩 Configuration

base_model: BEE-spoke-data/smol_llama-101M-GQA
gate_mode: random
dtype: bfloat16
experts:
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
Downloads last month
7
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support