Instructions to use Anserwise/AWAXIS-Think-28B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Anserwise/AWAXIS-Think-28B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Anserwise/AWAXIS-Think-28B")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("Anserwise/AWAXIS-Think-28B")
model = AutoModelForImageTextToText.from_pretrained("Anserwise/AWAXIS-Think-28B")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Anserwise/AWAXIS-Think-28B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Anserwise/AWAXIS-Think-28B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Anserwise/AWAXIS-Think-28B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Anserwise/AWAXIS-Think-28B

SGLang

How to use Anserwise/AWAXIS-Think-28B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Anserwise/AWAXIS-Think-28B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Anserwise/AWAXIS-Think-28B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Anserwise/AWAXIS-Think-28B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Anserwise/AWAXIS-Think-28B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Anserwise/AWAXIS-Think-28B with Docker Model Runner:
```
docker model run hf.co/Anserwise/AWAXIS-Think-28B
```

Anserwise commited on 23 days ago

Commit

70749e5

verified ·

1 Parent(s): f3409de

README: Mother model is rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled; Son is Darwin-28B-Opus

Browse files

Files changed (1) hide show

README.md +38 -30

README.md CHANGED Viewed

@@ -25,42 +25,45 @@ library_name: transformers
 base_model:
 - FINAL-Bench/Darwin-28B-Opus
 - Qwen/Qwen3.6-27B
-- FINAL-Bench/Darwin-28B-Opus
 ---
 # AWAXIS-Think-28B
 **Anserwise가 공개하는 한국어 특화 28B 추론 LLM**
-Qwen3.6 세대 백본 위에 Darwin V7 진화적 병합으로 탄생한 Darwin-28B-Opus(자), 그 자 모델 위에 한국어 LoRA를 적용하여 완성된 모델입니다.
 ---
 ## 🧬 모델 계보 (Family Tree)
 ```
-┌─────────────────────────────────────────────────────────────┐
-│  부 (Father):  Qwen/Qwen3.6-27B                             │
-│                └─ Alibaba Qwen 팀의 최신 Qwen3.6 세대 백본     │
-│                                                             │
-│  모 (Mother):  FINAL-Bench/Darwin-28B-Opus                  │
-│                └─ Qwen3.5 기반 한국어·추론 특화 모델           │
-│                                                             │
-│              │                                              │
-│              │  Darwin V7 진화적 병합 (CMA-ES 기반 최적화)     │
-│              ▼                                              │
-│  자 (Son):   FINAL-Bench/Darwin-28B-Opus                    │
-│              └─ 28B 멀티모달, Qwen3.5 하이브리드 아키텍처       │
-│                                                             │
-│              │                                              │
-│              │  한국어 LoRA 미세조정 (r=8, 영향도 최소)          │
-│              ▼                                              │
-│  본 모델:    Anserwise/AWAXIS-Think-28B  ⭐                  │
-│              └─ 한국어 특화 28B (K-AI 리더보드 타겟)            │
-└─────────────────────────────────────────────────────────────┘
 ```
 **3대 가계 구조**:
-- **부·모**가 Darwin V7 시스템에서 **진화적으로 교배**되어 **자(Darwin-28B-Opus)** 탄생
 - 그 **자 모델 위에 한국어 LoRA**를 입혀 **AWAXIS-Think-28B** 완성
 ---
@@ -74,7 +77,7 @@ Qwen3.6 세대 백본 위에 Darwin V7 진화적 병합으로 탄생한 Darwin-2
 | 컨텍스트 | 262,144 tokens |
 | 자(子) 모델 | FINAL-Bench/Darwin-28B-Opus |
 | 부(父) 백본 | Qwen/Qwen3.6-27B |
-| 모(母) 베이스 | FINAL-Bench/Darwin-28B-Opus |
 | 추가 학습 | **한국어 LoRA 미세조정** |
 | 라이선스 | Apache 2.0 |
 | 데이터형 | BF16 |
@@ -84,14 +87,18 @@ Qwen3.6 세대 백본 위에 Darwin V7 진화적 병합으로 탄생한 Darwin-2
 ## ✨ 주요 특징
-### 1. Darwin 진화적 병합 — 자(子) Darwin-28B-Opus
-FINAL-Bench 팀의 **Darwin V7 시스템**이 Qwen3.6-27B(부)와 Darwin-28B-Opus(모)를 **CMA-ES 기반 진화적 탐색**으로 병합.
-레이어별 최적 혼합 비율을 수십 세대 진화시켜 **부모 세대의 강점(신세대 백본 + 한국어·추론력)을 함께 계승**한 28B 자(子) 모델 탄생.
 ### 2. 한국어 LoRA 미세조정 — AWAXIS-Think-28B
 자(Darwin-28B-Opus) 위에 **영향도 최소화 LoRA**를 적용:
 - **Rank**: 8 (전체 파라미터의 약 0.02%만 조정)
-- **Target modules**: q_proj, k_proj, v_proj, o_proj (어텐션만)
 - **Alpha**: 16, Dropout: 0.05
 - **학습 데이터**: KoAlpaca-v1.1a + KOpen-platypus (선별 3,000 샘플)
 - **Epoch**: 1, Learning rate: 1e-5 cosine schedule
@@ -100,7 +107,7 @@ FINAL-Bench 팀의 **Darwin V7 시스템**이 Qwen3.6-27B(부)와 Darwin-28B-Opu
 ### 3. Qwen3.5 하이브리드 어텐션
 - **Linear Attention (Gated Delta Net)** × 48 — 262K 긴 컨텍스트 효율
-- **Full Attention** × 16 — 핵심 토큰 간 강한 관계
 - **4:1 패턴**: `[L, L, L, F]` × 16 반복
 ### 4. K-AI 리더보드 호환 패키지
@@ -176,8 +183,9 @@ Apache 2.0 — 상업·학술 목적 사용 가능. 재배포 시 **베이스
 ## 🙏 감사의 말
-- **Alibaba Qwen 팀** — 부(父) Qwen3.6-27B 기반 백본 제공
-- **FINAL-Bench 팀** — 모(母) Darwin-28B-Opus 및 Darwin V7 진화적 병합 시스템
 - **한국어 SFT 데이터셋 기여자** — KoAlpaca, KOpen-platypus
 - **HuggingFace / Transformers / PEFT** 커뮤니티

 base_model:
 - FINAL-Bench/Darwin-28B-Opus
 - Qwen/Qwen3.6-27B
+- rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled
 ---
 # AWAXIS-Think-28B
 **Anserwise가 공개하는 한국어 특화 28B 추론 LLM**
+Qwen3.6-27B(부)와 rico03의 Claude Opus 증류 추론 모델(모)을 **Darwin V7 진화적 병합**으로 교배하여 탄생한 **Darwin-28B-Opus(자)**, 그 자 모델 위에 **한국어 LoRA**를 입혀 완성된 모델입니다.
 ---
 ## 🧬 모델 계보 (Family Tree)
 ```
+┌───────────────────────────────────────────────────────────────────────┐
+│                                                                       │
+│  부 (Father):  Qwen/Qwen3.6-27B                                       │
+│                └─ Alibaba Qwen 팀의 최신 Qwen3.6 세대 백본               │
+│                                                                       │
+│  모 (Mother):  rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled     │
+│                └─ Claude Opus 추론 능력을 증류한 Qwen3.6-27B 기반 모델    │
+│                                                                       │
+│              │  Darwin V7 진화적 병합 (CMA-ES 기반 최적 혼합 탐색)        │
+│              ▼                                                        │
+│                                                                       │
+│  자 (Son):   FINAL-Bench/Darwin-28B-Opus                              │
+│              └─ 부·모 DNA 결합 28B 모델                                 │
+│                 Qwen3.5 하이브리드 아키텍처 (Linear × 48 + Full × 16)    │
+│                                                                       │
+│              │  한국어 LoRA 미세조정 (r=8, 어텐션 전용, 영향도 최소)      │
+│              ▼                                                        │
+│                                                                       │
+│  본 모델:    Anserwise/AWAXIS-Think-28B  ⭐                            │
+│              └─ 한국어 특화 28B (K-AI 리더보드 타겟)                     │
+│                                                                       │
+└───────────────────────────────────────────────────────────────────────┘
 ```
 **3대 가계 구조**:
+- **부·모**(Qwen3.6-27B × rico03 Claude-Opus-Reasoning-Distilled)가 Darwin V7 시스템에서 **진화적으로 교배**되어 **자(Darwin-28B-Opus)** 탄생
 - 그 **자 모델 위에 한국어 LoRA**를 입혀 **AWAXIS-Think-28B** 완성
 ---
 | 컨텍스트 | 262,144 tokens |
 | 자(子) 모델 | FINAL-Bench/Darwin-28B-Opus |
 | 부(父) 백본 | Qwen/Qwen3.6-27B |
+| 모(母) 베이스 | rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled |
 | 추가 학습 | **한국어 LoRA 미세조정** |
 | 라이선스 | Apache 2.0 |
 | 데이터형 | BF16 |
 ## ✨ 주요 특징
+### 1. Darwin V7 진화적 병합으로 태어난 자(子) Darwin-28B-Opus
+FINAL-Bench 팀의 **Darwin V7 시스템**이:
+- **부(Qwen/Qwen3.6-27B)** — 최신 Qwen3.6 세대 범용 백본
+- **모(rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled)** — Claude Opus급 추론력을 Qwen3.6-27B에 증류한 모델
+두 부모를 **CMA-ES 기반 진화적 탐색**으로 레이어별 최적 혼합 비율을 수십 세대 진화시켜 병합.
+→ **부모의 범용 백본 + 고급 추론력이 동시 계승된 28B 자(子) 모델** 탄생.
 ### 2. 한국어 LoRA 미세조정 — AWAXIS-Think-28B
 자(Darwin-28B-Opus) 위에 **영향도 최소화 LoRA**를 적용:
 - **Rank**: 8 (전체 파라미터의 약 0.02%만 조정)
+- **Target modules**: q_proj, k_proj, v_proj, o_proj (어텐션만, FFN 미개입)
 - **Alpha**: 16, Dropout: 0.05
 - **학습 데이터**: KoAlpaca-v1.1a + KOpen-platypus (선별 3,000 샘플)
 - **Epoch**: 1, Learning rate: 1e-5 cosine schedule
 ### 3. Qwen3.5 하이브리드 어텐션
 - **Linear Attention (Gated Delta Net)** × 48 — 262K 긴 컨텍스트 효율
+- **Full Attention** × 16 — 핵심 토큰 간 강한 관계 포착
 - **4:1 패턴**: `[L, L, L, F]` × 16 반복
 ### 4. K-AI 리더보드 호환 패키지
 ## 🙏 감사의 말
+- **Alibaba Qwen 팀** — 부(父) Qwen/Qwen3.6-27B 기반 백본 제공
+- **rico03** — 모(母) Qwen3.6-27B-Claude-Opus-Reasoning-Distilled (Claude Opus 추론 증류)
+- **FINAL-Bench 팀** — 자(子) Darwin-28B-Opus 및 Darwin V7 진화적 병합 시스템
 - **한국어 SFT 데이터셋 기여자** — KoAlpaca, KOpen-platypus
 - **HuggingFace / Transformers / PEFT** 커뮤니티