Spaces:
Sleeping
Sleeping
A newer version of the Streamlit SDK is available: 1.57.0
metadata
title: ML Model Trainer
emoji: 🤖
colorFrom: blue
colorTo: green
sdk: streamlit
sdk_version: 1.28.0
app_file: app.py
pinned: false
🤖 ML Model Trainer
Free tool to generate training scripts for fine-tuning open-source LLMs.
Features
- SFT (Supervised Fine-Tuning) - Full model fine-tuning
- DPO (Direct Preference Optimization) - Preference alignment
- LoRA - Parameter-efficient fine-tuning
Supported Models
| Size | Models |
|---|---|
| Small (0.5-1.5B) | Qwen2.5-0.5B, Qwen2.5-1.5B, Llama-3.2-1B, Phi-3-mini, Gemma-2B |
| Medium (7B) | Qwen2.5-7B, Llama-3.2-3B, Mistral-7B |
Public Datasets
- HuggingFaceH4/ultrachat_200k
- openai/gsm8k
- meta-math/MATH
- anthropic/hh-rlhf
How to Use
- Select model, training method, and dataset
- Configure hyperparameters (epochs, learning rate, batch size)
- Generate training script
- Copy and run locally or on Hugging Face Jobs
Requirements
pip install transformers trl torch datasets accelerate peft