LH-Tech-AI commited on
Commit
8f95b93
·
verified ·
1 Parent(s): aedb965

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -102,5 +102,5 @@ print("\nOutput:\n" + generate_text(test_prompt))
102
 
103
  ## Training guide
104
  We trained Supra Mini v3 0.5M on a single NVIDIA RTX 5060 Ti 16GB in ~1 hour for 2 epochs.<br>
105
- The full training code can be found in this repo as `run.sh` (easily run the complete pipeline), `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 2048), `train.py` (train the model) and `inference.py` (test the model).<br>
106
  The model was trained on the first 1 billion tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.
 
102
 
103
  ## Training guide
104
  We trained Supra Mini v3 0.5M on a single NVIDIA RTX 5060 Ti 16GB in ~1 hour for 2 epochs.<br>
105
+ The full training code can be found in this repo as `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 4096), `train.py` (train the model) and `inference.py` (test the model).<br>
106
  The model was trained on the first 1 billion tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.