Instructions to use OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1")
model = AutoModelForCausalLM.from_pretrained("OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1

SGLang

How to use OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1 with Docker Model Runner:
```
docker model run hf.co/OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1
```

Broken Output - Frankenstein Patch Attempt

by OrobasVault - opened 14 days ago

Discussion

OrobasVault

Owner 14 days ago

To test this "Frankenstein patch," you actually want to use the linear merge method with strict 1.0 and 0.0 filters, rather than passthrough.

Here is why: passthrough expects exactly one model to be present for any given tensor. Trying to route pre/post weights (like lm_head) alongside layer weights using passthrough in the YAML can cause routing crashes. Using linear with normalize: false and binary weights achieves the exact same "copy-paste" result but is 100% stable.

This patch will take the "Body" (the MLP and Attention layers) of your broken TensorGuard merge, and surgically attach the "Mouth and Ears" (lm_head and embed_tokens) of Cydonia.

The "Frankenstein Patch" YAML

Create a new file called patch.yaml:

merge_method: linear
parameters:
  normalize: false # CRITICAL: Ensures weights stay exactly at 1.0 and 0.0
models:
  # 1. The "Body" (Your broken TensorGuard merge)
  - model: /workspace/merges/TENSORGUARD-prototype 
    parameters:
      weight:
        - filter: embed_tokens
          value: 0.0  # Delete the broken embeddings
        - filter: lm_head
          value: 0.0  # Delete the broken head
        - value: 1.0  # Keep 100% of the TensorGuard MLP/Attention layers

  # 2. The "Head" (Cydonia)
  - model: /workspace/models/Cydonia-24B-v4.3
    parameters:
      weight:
        - filter: embed_tokens
          value: 1.0  # Inject Cydonia's embeddings
        - filter: lm_head
          value: 1.0  # Inject Cydonia's head
        - value: 0.0  # Ignore all of Cydonia's MLP/Attention layers

tokenizer:
  # CRITICAL: The tokenizer must exactly match the model you stole the embeddings from
  source: /workspace/models/Cydonia-24B-v4.3 
dtype: bfloat16
name: TensorGuard-Cydonia-Patch

How to run it:

Run this just like a normal merge. Because it is a simple linear copy-paste, it will finish in about 2 minutes on your RunPod.

mergekit-yaml /workspace/patch.yaml /workspace/merges/TENSORGUARD-PATCHED \
    --copy-tokenizer \
    --out-shard-size 5B \
    --lazy-unpickle \
    --cuda

Why this is a brilliant diagnostic test:

If you run this patched model and the looping/early termination is completely gone, you have successfully proven that the TensorGuard dense averaging destroyed the vocabulary embeddings.

If the model still loops after this patch, it means the TensorGuard averaging actually destroyed the internal mlp (knowledge) layers, and the model is mathematically incapable of forming a coherent thought regardless of its vocabulary.

OrobasVault

Owner 7 days ago

This doesn't work, model is BROKEN

OrobasVault changed discussion status to closed 7 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment