LH-Tech-AI commited on
Commit
f16ee11
·
verified ·
1 Parent(s): b129b0b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - HuggingFaceFW/fineweb-edu
4
+ language:
5
+ - en
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - base
9
+ - small
10
+ - cpu
11
+ - open-source
12
+ - open
13
+ - spark
14
+ - lh-tech
15
+ - llm
16
+ - llama
17
+ - tiny
18
+ ---
19
+
20
+ # ✨ Spark v4
21
+ Today, we are introducing Spark v4, a 5M parameter Llama base model trained on 0.7B tokens of the Sample-10BT of Fineweb-Edu.
22
+
23
+ ## Results
24
+ - Final Loss / Val Loss: ~3.1 / 3.108
25
+ - Output quality: 5/10
26
+ - PIQA: 0.5593
27
+ - LAMBADA (PPL): 588.26
28
+ - HellaSwag: 0.2695
29
+
30
+ ## Usage
31
+ You can use the model with `use.py`.
32
+
33
+ ## Example output
34
+ **Input:** The main concept of physics<br>
35
+ **Output:** *is that it gives some unlimited means to think about the universe. It helps us not only to think about how the universe is created but also how we think about the universe. In this way, an inner universe can be made to our own universe. This is because it is not a matter of fact and that is the object of the universe. It can take a lot of time to understand how it is created and why it must be made.
36
+ In the first place, the Universe is a complex and interesting part of it. It can be a kind of a real, creative, and universal part of our universe. It can be just that the universe was created. It could be a kind of universe. It can be a kind of kind of complex concept. That could be something that does something that really needs to be a kind of universe, or something that*
37
+
38
+ ## Training code
39
+ The full training code can be found as `train.ipynb` in this repo.
40
+
41
+ ## Training specs
42
+ - GPU: 2x T4 on Kaggle
43
+ - Time: ~4 hours
44
+ - more detailed info in the notebook :-)
45
+
46
+ ## Have fun :D