KoGemopus-4
Collection
๐ฐ๐ท A curated collection of Korean reasoning models distilled from Opus. โข 1 item โข Updated
Gemma4-26B-A4B ๊ธฐ๋ฐ ํ๊ตญ์ด Reasoning SFT ๋ชจ๋ธ. Claude Opus 4.6 distilled ํ๊ตญ์ด reasoning ๋ฐ์ดํฐ 12K๋ก ํ์ต. LR 5e-5, alpha=2รr.
| ํญ๋ชฉ | ๋ด์ฉ |
|---|---|
| Base Model | unsloth/gemma-4-26B-A4B-it |
| ํ์ต ๋ฐฉ๋ฒ | LoRA SFT (Unsloth + TRL) |
| ํ๋ ์์ํฌ | transformers, peft |
| ๋ผ์ด์ผ์ค | Apache 2.0 |
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")
tokenizer = AutoTokenizer.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")
vllm serve jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7 --max-model-len 8192 --reasoning-parser gemma4