Pin / README.md
LH-Tech-AI's picture
Update README.md
f0d2786 verified
metadata
datasets:
  - starhopp3r/TinyChat
language:
  - en
pipeline_tag: text-generation
tags:
  - tiny
  - chat
  - small
  - cpu
  - open
  - open-source
  - gpt2
  - gpt

Welcome to ๐Ÿค Pin Model Series

The models in the LH-Tech AI Pin Series are very small models that were trained on starhopp3r/TinyChat.

Models overview

Model Parameters Training iters Final Train Loss Quality Example Chat
Pin-5M 5.37M 1000 3.170788 Very Poor Yes, a bright day is shining and makes everything have a good day a lot.
Pin-10M 10.06M 1500 2.562048 Very Poor That sounds nice, I agree, it is nice to talk about new ideas.
Pin-15M 14.84M 1500 2.358367 Low It is hard to see your plans when you want to enjoy the day.
Pin-20M 21.03M 1500 2.217588 Medium Yes, sunny days are wonderful! I love hearing about the sunshine and the sun's shining on.
Pin-25M 26.76M 1500 2.139837 Medium Sunny days make everything look brighter, especially with a nice friend who cares.
Pin-Ultra-25M 26.76M 8000 ... ... Coming soon...

* All models were prompted with What is the weather like today?.

๐Ÿš€ We recommend using Pin-Ultra-25M.

Training

Training data

We trained on starhopp3r/TinyChat and used the gpt-2 tokenizer.

Training code/scripts

You can find the full training code for the Pin Model Series in this repo.
Tip: If you want to train one of these models yourself, make sure to adjust the model config like this:

Model n_layer n_head n_embd n_inner
Pin-5M 4 8 96 384
Pin-10M 6 8 160 640
Pin-15M 8 8 208 832
Pin-20M 10 8 256 1024
Pin-25M 12 12 288 1152
Pin-Ultra-25M 12 12 288 1152

Have fun :D

Training details

We trained all these models in ~30 minutes on a single T4 GPU in a Kaggle Session.
So you are able to easily recreate all of the Pin model without having to launch a 8xH100 cluster ๐Ÿ˜‚

How to use the model

You can easily use the favorite model of the Pin series like this:

  1. Download use.py from this repo.
  2. Adjust the subfolder argument here:
    answer = run_pin_inference(user_query, model_id="LH-Tech-AI/Pin", subfolder="Pin-25M") # use your favorite model here, e.g. "Pin-25M" or "Pin-15M"...
    
  3. Adjust the input prompt here:
    user_query = "What is the weather like today?" # insert your prompt here
    
  4. Launch it with Torch installed in Python3.
  5. Have fun :D

Acknowledgements...

...to:

  • HF Transformers
  • Kaggle for the T4 GPU
  • starhopp3r for his TinyChat dataset