LH-Tech-AI commited on
Commit
ba1bd8f
·
verified ·
1 Parent(s): e0ef0ed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -60,7 +60,7 @@ To use our model, just run this code using HF Transformers to execute the model:
60
 
61
  ## Training guide
62
  We trained Supra Mini 0.1M on a single T4 GPU in ~45 minutes for 2 epochs.<br>
63
- The full training code can be found in this repo as `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 250), `train.py` (train the model) and `inference.py` (test the model).<br>
64
  The model was trained on the first 500 million tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.
65
 
66
  ## Final thoughts
 
60
 
61
  ## Training guide
62
  We trained Supra Mini 0.1M on a single T4 GPU in ~45 minutes for 2 epochs.<br>
63
+ The full training code can be found in this repo as `run.sh` (easily run the complete pipeline), `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 250), `train.py` (train the model) and `inference.py` (test the model).<br>
64
  The model was trained on the first 500 million tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.
65
 
66
  ## Final thoughts