LH-Tech-AI commited on
Commit
537b7d7
·
verified ·
1 Parent(s): cd197e3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md CHANGED
@@ -1,3 +1,55 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - HuggingFaceFW/fineweb-edu
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ library_name: transformers
9
+ tags:
10
+ - small
11
+ - cpu
12
+ - supra
13
+ - tiny
14
+ - mini
15
+ - open
16
+ - open-source
17
  ---
18
+
19
+ # 🦅 Supra Mini 0.1M
20
+ Supra Mini 0.1M is a very tiny base model trained on 500 million tokens of Fineweb-Edu for 2 epochs to prove how well very tiny models can perform on world knowledge.
21
+
22
+ ## Benchmarks
23
+
24
+ All benchmarks were executed using `lm-eval`.
25
+
26
+ | Task | Value | Rating |
27
+ | :------------ | :----------: | ---------: |
28
+ | Arc_Easy | 0.2x | RATING IN WORDS HERE |
29
+ | Wikitext | xx | RATING IN WORDS HERE |
30
+ | BLiMP | 5x | RATING IN WORDS HERE |
31
+
32
+ ## Examples
33
+ **Prompt:** PROMPT_HERE<br>
34
+ **Output:**: OUTPUT_HERE
35
+ <br><br>
36
+ **Prompt:** PROMPT_HERE<br>
37
+ **Output:**: OUTPUT_HERE
38
+ <br><br>
39
+ **Prompt:** PROMPT_HERE<br>
40
+ **Output:**: OUTPUT_HERE
41
+
42
+ ## Usage
43
+ To use our model, just run this code using HF Transformers to execute the model:
44
+ ```python3
45
+ ### CODE HERE
46
+ ```
47
+
48
+ ## Training guide
49
+ We trained Supra Mini 0.1M on a single T4 GPU in ~45 minutes for 2 epochs.<br>
50
+ The full training code can be found in this repo as `train_tokenizer.py` (train costum BPE tokenizer with vocab size of 250), `train.py` (train the model) and `inference.py` (test the model).<br>
51
+ The model was trained on the first 500 million tokens of Sample-10BT from Fineweb-Edu using streaming tokenization.
52
+
53
+ ## Final thoughts
54
+ As the new founded organization **SupraLabs**, we are proud the introduce our first Tiny-LLM to prove that our pipeline is running.<br>
55
+ More models will release soon...