Spaces:

rajkr
/

ml-trainer-space

Sleeping

rajkr commited on 12 days ago

Commit

b85a115

verified ·

1 Parent(s): 868ce8f

Upload README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,12 +1,49 @@
 ---
-title: Ml Trainer Space
-emoji: 🌍
-colorFrom: gray
-colorTo: yellow
 sdk: gradio
-sdk_version: 6.13.0
 app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: ML Model Trainer
+emoji: 🤖
+colorFrom: blue
+colorTo: green
 sdk: gradio
+sdk_version: 4.44.0
 app_file: app.py
 pinned: false
 ---
+# 🤖 ML Model Trainer
+Free tool to generate training scripts for fine-tuning open-source LLMs.
+## Features
+- **SFT** (Supervised Fine-Tuning) - Full model fine-tuning
+- **DPO** (Direct Preference Optimization) - Preference alignment
+- **LoRA** - Parameter-efficient fine-tuning
+## Supported Models
+| Size | Models |
+|------|--------|
+| Small (0.5-1.5B) | Qwen2.5-0.5B, Qwen2.5-1.5B, Phi-3-mini, Gemma-2B, Llama-3.2-1B |
+| Medium (7B) | Qwen2.5-7B, Llama-3.2-3B, Mistral-7B |
+| Large (14B+) | Qwen2.5-14B, Mixtral-8x7B |
+## Public Datasets
+- HuggingFaceH4/ultrachat_200k
+- openai/gsm8k
+- meta-math/MATH
+- anthropic/hh-rlhf
+- stanfordnlp/SHP
+## How to Use
+1. Select model, training method, and dataset
+2. Configure hyperparameters (epochs, learning rate, batch size)
+3. Generate training script
+4. Copy and run locally or on Hugging Face Jobs
+## Requirements
+```bash
+pip install transformers trl torch datasets accelerate peft
+```