Testing checkpoint of an attempt at distilling the style of Kimi K2 Instruct, inasmuch as possible in this size, by fine-tuning Granite 4.0 h 1b.

Long sequences made Granite prone to looping, so I made kimi-generated datasets system-prompted for brevity then filtered at max 1000t single asssistant response.

This is clearly not the final checkpoint but I'd appreciate testing to see what needs to be improved.

Downloads last month: 3

Safetensors

Model size

1B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ramendik/miki-pebble-20260131-safetensors

Base model

ibm-granite/granite-4.0-h-1b-base

Finetuned

ibm-granite/granite-4.0-h-1b

Finetuned

(9)

this model

Quantizations

1 model

ramendik
/

miki-pebble-20260131-safetensors

Model tree for ramendik/miki-pebble-20260131-safetensors

Datasets used to train ramendik/miki-pebble-20260131-safetensors