tinymemorylm
CompactAI commited on
Commit
b3d54fb
·
verified ·
1 Parent(s): 2d2909c

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -26
README.md CHANGED
@@ -1,52 +1,53 @@
1
  ---
2
- license: mit
3
  new_version: CompactAI-O/Glint-1
4
  ---
5
 
6
- Note: You must use the custom python script to run this model properly, you can download it from [here](https://huggingface.co/spaces/CompactAI-O/Homepage) by going into the downloads option and scrolling down.
7
- # TMLM-Haiku-2.3
 
8
 
9
  **It speaks. It actually speaks. Mostly.**
10
 
11
- We have come so far. From the dark ages of `couldcouldoldbloodblood` to actual, coherent, structured sentences. This is TMLM-Haiku-2.3. It is 1 million parameters. It is small. It is trying its best. And unlike its ancestors, it usually succeeds.
12
 
13
- ## Quick Stats
14
 
15
- - **Parameters:** 1,000,000 (Yes, really. 1M.)
16
  - **Training Tokens:** 10 Billion
17
  - **Context Window:** 2048 tokens
18
- - **Vibe:** Chaotic good, but mostly good.
19
 
20
  ## What Is This?
21
 
22
- Haiku-2.3 is the latest evolution of the TMLM-Haiku series. It builds on Haiku-2 by adding **SPIN (Self-Play Fine-Tuning)** to the training loop. This model represents a **3x improvement** in combined performance score over the original Haiku. Coherence has jumped from 1.99 to 6.03. Relevance is no longer zero. It is a miracle.
23
 
24
- ### The Journey
25
 
26
- | Model | Era | Typical Output | Combined Score |
27
  | :--- | :--- | :--- | :--- |
28
- | **Haiku-1** | The Dark Ages | `couldcouldoldbloodbloodbodybody` | 1.62 |
29
- | **Haiku-1.3** | The Pipe Character Incident | `\|fdish\|\|\|\|\|!@\|` | 1.21 |
30
- | **Haiku-2** | The Awakening | `It is about **competent development**...` | 3.87 |
31
- | **Haiku-2.3 (SPIN)** | **Current Era** | `The artificial intelligence is a problem...` | **4.84** |
32
 
33
- **Expected Output:**
34
  > "The simple terms arrived in simulant explorers and honey are specific or forecasters. They allow the structure of their similar..."
35
 
36
  ## Disclaimer
37
 
38
- This is a **1 million parameter model**.
39
  - It is not GPT-5.
40
  - It is not GPT-2.
41
  - It is a tiny neural network running on a prayer and a GPU.
42
- - It might still output `chuamliamce` occasionally. If it does, just try again. It is shy.
43
- - For best results, use temperature around 0.7. If you crank it to 2.0, you are on your own.
44
 
45
  ## Benchmarks
46
 
47
- We benchmarked Haiku-2.3 against all previous versions using a standard 7-question suite.
48
 
49
- | Metric | Haiku-1 | Haiku-1.3 | Haiku-2 | **Haiku-2.3 (SPIN)** |
50
  | :--- | :---: | :---: | :---: | :---: |
51
  | **Fluency** | 0.50 | 1.69 | 8.35 | **8.78** |
52
  | **Coherence** | 1.99 | 1.56 | 5.72 | **6.03** |
@@ -56,17 +57,16 @@ We benchmarked Haiku-2.3 against all previous versions using a standard 7-questi
56
 
57
  ## Related Models
58
 
59
- Check out the rest of the family:
60
- - [TMLM-Haiku-1](https://huggingface.co/CompactAI-O/TMLM-Haiku-1) (The ancestor)
61
- - [TMLM-Haiku-1.3](https://huggingface.co/CompactAI-O/TMLM-Haiku-1.3) (The pipe character one)
62
- - [TMLM-Haiku-2](https://huggingface.co/CompactAI-O/TMLM-Haiku-2) (The breakthrough)
63
 
64
  ## Acknowledgments
65
 
66
- Built with curiosity over compute. Trained on FineWeb-Edu. SPIN optimized. And a lot of hope.
67
 
68
  ---
69
 
70
  **Built by [CompactAI](https://huggingface.co/CompactAI-O).**
71
  *If you like tiny models that try their best, give us a follow.*
72
- ```
 
1
  ---
2
+ license: gpl-3.0
3
  new_version: CompactAI-O/Glint-1
4
  ---
5
 
6
+ Note: You need the custom python script to run this model. Grab it from [the model runner Space](https://huggingface.co/spaces/CompactAI-O/CompactAIModelRunner).
7
+
8
+ # Glint-0.4
9
 
10
  **It speaks. It actually speaks. Mostly.**
11
 
12
+ We came so far. From the dark ages of `couldcouldoldbloodblood` to actual, coherent sentences. This is Glint-0.4. 1 million parameters. Small. Trying its best. Unlike its ancestors, it usually succeeds.
13
 
14
+ ## Quick Stats
15
 
16
+ - **Parameters:** 1,000,000 (yes, really)
17
  - **Training Tokens:** 10 Billion
18
  - **Context Window:** 2048 tokens
19
+ - **Vibe:** Chaotic good, but mostly good
20
 
21
  ## What Is This?
22
 
23
+ Glint-0.4 is the latest of the old guard before we started the Glint line proper. It builds on Glint-0.3 by adding SPIN (Self-Play Fine-Tuning) to the training loop. This model represents a 3x improvement in combined score over the original Glint-0.1. Coherence jumped from 1.99 to 6.03. Relevance is no longer zero. It is a miracle.
24
 
25
+ ### The journey
26
 
27
+ | Model | Era | Typical Output | Combined |
28
  | :--- | :--- | :--- | :--- |
29
+ | **Glint-0.1** | The Dark Ages | `couldcouldoldbloodbloodbodybody` | 1.62 |
30
+ | **Glint-0.2** | Pipe Character Incident | `\|fdish\|\|\|\|\|!@\|` | 1.21 |
31
+ | **Glint-0.3** | The Awakening | `It is about **competent development**...` | 3.87 |
32
+ | **Glint-0.4 (SPIN)** | Current Era | `The artificial intelligence is a problem...` | **4.84** |
33
 
34
+ **Expected output:**
35
  > "The simple terms arrived in simulant explorers and honey are specific or forecasters. They allow the structure of their similar..."
36
 
37
  ## Disclaimer
38
 
39
+ This is a 1 million parameter model.
40
  - It is not GPT-5.
41
  - It is not GPT-2.
42
  - It is a tiny neural network running on a prayer and a GPU.
43
+ - It might still output `chuamliamce`. If it does, try again. It is shy.
44
+ - For best results, use temperature around 0.7. At 2.0, you are on your own.
45
 
46
  ## Benchmarks
47
 
48
+ We benchmarked Glint-0.4 against all previous versions using a standard 7-question suite. Yes, 7 questions. We kept it small because we were running on a laptop.
49
 
50
+ | Metric | Glint-0.1 | Glint-0.2 | Glint-0.3 | **Glint-0.4 (SPIN)** |
51
  | :--- | :---: | :---: | :---: | :---: |
52
  | **Fluency** | 0.50 | 1.69 | 8.35 | **8.78** |
53
  | **Coherence** | 1.99 | 1.56 | 5.72 | **6.03** |
 
57
 
58
  ## Related Models
59
 
60
+ - [Glint-0.1](https://huggingface.co/CompactAI-O/Glint-0.1) (the ancestor)
61
+ - [Glint-0.2](https://huggingface.co/CompactAI-O/Glint-0.2) (the pipe character one)
62
+ - [Glint-0.3](https://huggingface.co/CompactAI-O/Glint-0.3) (the breakthrough)
63
+ - [Glint-1](https://huggingface.co/CompactAI-O/Glint-1) (the current gen)
64
 
65
  ## Acknowledgments
66
 
67
+ Built with curiosity over compute. Trained on FineWeb-Edu. SPIN optimized. A lot of hope.
68
 
69
  ---
70
 
71
  **Built by [CompactAI](https://huggingface.co/CompactAI-O).**
72
  *If you like tiny models that try their best, give us a follow.*