LH-Tech-AI commited on
Commit
74d1409
·
verified ·
1 Parent(s): 6f975a2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -15,6 +15,7 @@ tags:
15
  - mini
16
  - open
17
  - open-source
 
18
  ---
19
 
20
  # 🦅 Supra Mini v4 2M
@@ -100,4 +101,4 @@ print("\nOutput:\n" + generate_text(test_prompt))
100
  ## Training guide
101
  We trained Supra Mini v4 2M on a single NVIDIA RTX 5060 Ti 16GB in ~3 hours for 2 epochs.<br>
102
  The full training code can be found in this repo as `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 8192), `train.py` (train the model) and `inference.py` (test the model).<br>
103
- The model was trained on the first 3 billion tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.
 
15
  - mini
16
  - open
17
  - open-source
18
+ new_version: SupraLabs/Supra-Mini-v5-8M
19
  ---
20
 
21
  # 🦅 Supra Mini v4 2M
 
101
  ## Training guide
102
  We trained Supra Mini v4 2M on a single NVIDIA RTX 5060 Ti 16GB in ~3 hours for 2 epochs.<br>
103
  The full training code can be found in this repo as `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 8192), `train.py` (train the model) and `inference.py` (test the model).<br>
104
+ The model was trained on the first 3 billion tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.