YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

🧠 Myanmar LLM Training

Fine-tune Qwen2.5-0.5B-Instruct with Myanmar language dataset.

⚑ No License Required!

This model is fully open. No Llama license needed!

πŸ“‹ Requirements

  • Python 3.8+
  • GPU with 6GB+ VRAM
  • HuggingFace Account

πŸš€ Quick Start

1. Install dependencies

pip install -r requirements.txt

2. Login to HuggingFace

huggingface-cli login

3. Run training

python train.py

βš™οΈ Configuration

Parameter Default Description
MODEL_NAME Qwen/Qwen2.5-0.5B-Instruct Base model (fully open!)
num_train_epochs 3 Training iterations
per_device_train_batch_size 4 Batch size
gradient_accumulation_steps 4 Effective batch = 16
learning_rate 2e-5 Learning rate

πŸ“Š Features

  • βœ… Fully open model - α€œα€­α€―α€„α€Ία€…α€„α€Ία€™α€œα€­α€―α€•α€«α€žα€Šα€Ία‹
  • βœ… FP16 precision - α€•α€­α€―α€™α€­α€―α€™α€Όα€”α€Ία€•α€«α€žα€Šα€Ία‹
  • βœ… Gradient checkpointing - Memory α€α€»α€½α€±α€α€¬α€•α€«α€žα€Šα€Ία‹
  • βœ… Test/Validation evaluation - α€”α€Ύα€…α€Ία€α€―α€œα€―α€Άα€Έα€‘α€α€½α€€α€Ί α€…α€™α€Ία€Έα€žα€•α€Ία€•α€«α€žα€Šα€Ία‹

πŸ“Š Training Data

Dataset: amkyawdev/AmkyawDev-Dataset

Split Samples
Train ~29,100
Validation ~29,100
Test ~29,100

Note: Each file (train.jsonl, test.jsonl, validation.jsonl) has ~29,100 conversations!

πŸ’Ύ Output

Trained model saved to ./myanmar-qwen-output/

πŸ“€ Upload to HuggingFace

cd myanmar-qwen-output
huggingface-cli upload amkyawdev/my-myanmar-qwen . --repo-type model

πŸ–₯️ Google Colab

# Install
!pip install transformers datasets torch accelerate

# Login
from huggingface_hub import login
login("YOUR_TOKEN")

# Run
%run train.py

Built by amkyawdev

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support