YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Amkyw-Myanmar-LLMV1

A fine-tuned large language model for Myanmar (Burmese) language processing and generation.

πŸ“‹ Overview

Amkyw-Myanmar-LLMV1 is a state-of-the-art language model fine-tuned for Myanmar language tasks, including:

  • Text generation in Myanmar script
  • Conversational AI for Myanmar speakers
  • Myanmar language understanding and processing

This model is fine-tuned from a base LLM (Llama, Gemma, etc.) using the Myanmar LLM Dataset from GitHub.

πŸ—οΈ Architecture

  • Base Model: [To be specified - e.g., meta-llama/Llama-2-7b-hf]
  • Fine-tuning Method: LoRA (Low-Rank Adaptation)
  • Tokenization: Custom Myanmar tokenizer

πŸ“¦ Model Files

model/
β”œβ”€β”€ config.json               # Model architecture configuration
β”œβ”€β”€ tokenizer.json            # Custom tokenizer
β”œβ”€β”€ tokenizer_config.json    # Tokenizer settings
β”œβ”€β”€ vocab.json               # Vocabulary file
β”œβ”€β”€ model.safetensors        # Model weights (after training)
β”œβ”€β”€ adapter_config.json      # LoRA adapter configuration
└── adapter_model.safetensors # LoRA adapter weights (after training)

πŸš€ Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "amkyawdev/Amkyw-Myanmar-LLMV1"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

prompt = "မြန်မာစာနဲ့ စာရေးပါစိုးပါနော်"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0]))

πŸ§ͺ Training

Dataset

This model was fine-tuned on the Myanmar LLM Dataset:

from datasets import load_dataset

dataset = load_dataset(
    "json",
    data_files={
        "train": "https://raw.githubusercontent.com/amkyawdev/myanmar-llm-dataset/main/data/processed/train.jsonl",
        "validation": "https://raw.githubusercontent.com/amkyawdev/myanmar-llm-dataset/main/data/processed/validation.jsonl",
        "test": "https://raw.githubusercontent.com/amkyawdev/myanmar-llm-dataset/main/data/processed/test.jsonl"
    }
)

πŸ“ Project Structure

Amkyw-Myanmar-LLMV1/
β”œβ”€β”€ README.md
β”œβ”€β”€ LICENSE
β”œβ”€β”€ .gitignore
β”œβ”€β”€ .gitattributes
β”œβ”€β”€ model/
β”œβ”€β”€ training/
β”œβ”€β”€ inference/
β”œβ”€β”€ tests/
β”œβ”€β”€ scripts/
└── .github/workflows/

πŸ“ License

MIT License


This model was created by amkyawdev

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support