Model Card for gemma-4-31B-K1

This is an experimental fine-tuning using a lot of iannicity/KIMI-K2.5-1000000x and iannicity/Hunter-Alpha-SFT.

If you enjoy my work, feel free to support me on Ko-fi with a coffee.

Every bit of your support directly helps me keep creating and spend more time making even better work:

Quick start

from transformers import pipeline

question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="None", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])

Training procedure

This model was trained with SFT.

Framework versions

PEFT 0.18.1
TRL: 0.24.0
Transformers: 5.6.0.dev0
Pytorch: 2.11.0+cu128
Datasets: 4.3.0
Tokenizers: 0.22.2

Downloads last month: 29

Model tree for win10/gemma-4-31B-K1

Base model

google/gemma-4-31B-it

Finetuned

coder3101/gemma-4-31B-it-heretic

Adapter

(1)

this model