A newer version of this model is available: canbingol/gemma3_1B_base-tr-cpt-2nd_epoch_stage1

Model Card: Gemma3-1B Turkish CPT (50K Subset, 1 Epoch – Stage 1)

Overview

This model is a Turkish Continued Pretraining (CPT) variant of google/gemma-3-1b-pt.

The base model was further trained for 1 epoch on the first 50,000 samples of a Turkish web corpus.
This stage represents a broader data exposure regime compared to small-subset experiments and aims to improve:

  • Turkish fluency
  • Vocabulary coverage
  • Domain familiarity
  • Statistical adaptation to Turkish web distribution

This release corresponds to Stage 1 of a multi-stage CPT pipeline and is intended for research and experimental analysis.


Base Model

  • google/gemma-3-1b-pt

Training Setup

  • Dataset: canbingol/vngrs-web-corpus-200k
  • Subset Used: First 50,000 samples
  • Training Objective: Continued Pretraining
  • Epochs: 1
  • Data Regime: Plain text
  • Token Count (approximate): ~21.5M tokens

Usage Example

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "canbingol/gemma3_1B_base-tr-cpt-1epoch_stage1"

device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(model_id)
tokenizer = AutoTokenizer.from_pretrained(model_id)

model = model.to(device)

prompt = "bundan böyle"
inputs = tokenizer(prompt, return_tensors="pt").to(device)

outputs = model.generate(
    **inputs,
    max_new_tokens=50,
    do_sample=True,
    temperature=0.8,
    top_p=0.9
)

generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(generated_text)
Downloads last month
28
Safetensors
Model size
1.0B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for canbingol/gemma3_1B_base-tr-cpt-1epoch_stage1

Finetuned
(419)
this model
Finetunes
1 model

Dataset used to train canbingol/gemma3_1B_base-tr-cpt-1epoch_stage1

Collection including canbingol/gemma3_1B_base-tr-cpt-1epoch_stage1