Model Card for gemma-4-31B-K1
This is an experimental fine-tuning using a lot of iannicity/KIMI-K2.5-1000000x and iannicity/Hunter-Alpha-SFT.
If you enjoy my work, feel free to support me on Ko-fi with a coffee.
Every bit of your support directly helps me keep creating and spend more time making even better work:
Quick start
from transformers import pipeline
question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="None", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])
Training procedure
This model was trained with SFT.
Framework versions
- PEFT 0.18.1
- TRL: 0.24.0
- Transformers: 5.6.0.dev0
- Pytorch: 2.11.0+cu128
- Datasets: 4.3.0
- Tokenizers: 0.22.2
- Downloads last month
- 29