borg-merge-v1-chat / README.md
Optitransfer's picture
Configure ZeroGPU with streaming and generation controls
d4c3a33 verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade
metadata
title: Qwen2.5-7B-Instruct Borg Merge v1
emoji: 🤖
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 5.12.0
python_version: '3.12'
app_file: app.py
hardware: zero-a10g
pinned: false
license: apache-2.0
models:
  - Optitransfer/Qwen2.5-7B-Instruct-borg-merge-v1
short_description: Chat with a 9-model cross-family merged checkpoint

Qwen2.5-7B-Instruct -- Borg Merge v1

Chat with a model created by merging 9 models from 4 architecture families (Llama, Phi, NeoX, OPT) into a single checkpoint using training-free cross-family weight merging.

Headline results: +3.3 pp GSM8K, +3.2 pp ARC-Challenge, +2.6 pp IFEval over the unmerged anchor -- no fine-tuning and no distillation.

Model card | Paper (SSRN) | crdt-merge | Deep-dive article