ml-trainer-space / README.md
rajkr's picture
Upload README.md
8892e63 verified

A newer version of the Streamlit SDK is available: 1.57.0

Upgrade
metadata
title: ML Model Trainer
emoji: 🤖
colorFrom: blue
colorTo: green
sdk: streamlit
sdk_version: 1.28.0
app_file: app.py
pinned: false

🤖 ML Model Trainer

Free tool to generate training scripts for fine-tuning open-source LLMs.

Features

  • SFT (Supervised Fine-Tuning) - Full model fine-tuning
  • DPO (Direct Preference Optimization) - Preference alignment
  • LoRA - Parameter-efficient fine-tuning

Supported Models

Size Models
Small (0.5-1.5B) Qwen2.5-0.5B, Qwen2.5-1.5B, Llama-3.2-1B, Phi-3-mini, Gemma-2B
Medium (7B) Qwen2.5-7B, Llama-3.2-3B, Mistral-7B

Public Datasets

  • HuggingFaceH4/ultrachat_200k
  • openai/gsm8k
  • meta-math/MATH
  • anthropic/hh-rlhf

How to Use

  1. Select model, training method, and dataset
  2. Configure hyperparameters (epochs, learning rate, batch size)
  3. Generate training script
  4. Copy and run locally or on Hugging Face Jobs

Requirements

pip install transformers trl torch datasets accelerate peft