tinymemorylm
Glint-0.4 / README.md
CompactAI's picture
Upload README.md
b3d54fb verified
---
license: gpl-3.0
new_version: CompactAI-O/Glint-1
---
Note: You need the custom python script to run this model. Grab it from [the model runner Space](https://huggingface.co/spaces/CompactAI-O/CompactAIModelRunner).
# Glint-0.4
**It speaks. It actually speaks. Mostly.**
We came so far. From the dark ages of `couldcouldoldbloodblood` to actual, coherent sentences. This is Glint-0.4. 1 million parameters. Small. Trying its best. Unlike its ancestors, it usually succeeds.
## Quick Stats
- **Parameters:** 1,000,000 (yes, really)
- **Training Tokens:** 10 Billion
- **Context Window:** 2048 tokens
- **Vibe:** Chaotic good, but mostly good
## What Is This?
Glint-0.4 is the latest of the old guard before we started the Glint line proper. It builds on Glint-0.3 by adding SPIN (Self-Play Fine-Tuning) to the training loop. This model represents a 3x improvement in combined score over the original Glint-0.1. Coherence jumped from 1.99 to 6.03. Relevance is no longer zero. It is a miracle.
### The journey
| Model | Era | Typical Output | Combined |
| :--- | :--- | :--- | :--- |
| **Glint-0.1** | The Dark Ages | `couldcouldoldbloodbloodbodybody` | 1.62 |
| **Glint-0.2** | Pipe Character Incident | `\|fdish\|\|\|\|\|!@\|` | 1.21 |
| **Glint-0.3** | The Awakening | `It is about **competent development**...` | 3.87 |
| **Glint-0.4 (SPIN)** | Current Era | `The artificial intelligence is a problem...` | **4.84** |
**Expected output:**
> "The simple terms arrived in simulant explorers and honey are specific or forecasters. They allow the structure of their similar..."
## Disclaimer
This is a 1 million parameter model.
- It is not GPT-5.
- It is not GPT-2.
- It is a tiny neural network running on a prayer and a GPU.
- It might still output `chuamliamce`. If it does, try again. It is shy.
- For best results, use temperature around 0.7. At 2.0, you are on your own.
## Benchmarks
We benchmarked Glint-0.4 against all previous versions using a standard 7-question suite. Yes, 7 questions. We kept it small because we were running on a laptop.
| Metric | Glint-0.1 | Glint-0.2 | Glint-0.3 | **Glint-0.4 (SPIN)** |
| :--- | :---: | :---: | :---: | :---: |
| **Fluency** | 0.50 | 1.69 | 8.35 | **8.78** |
| **Coherence** | 1.99 | 1.56 | 5.72 | **6.03** |
| **Relevance** | 1.22 | 0.00 | 0.00 | **2.25** |
| **Format** | 3.29 | 3.29 | 3.29 | **3.29** |
| **Combined** | 1.62 | 1.21 | 3.87 | **4.84** |
## Related Models
- [Glint-0.1](https://huggingface.co/CompactAI-O/Glint-0.1) (the ancestor)
- [Glint-0.2](https://huggingface.co/CompactAI-O/Glint-0.2) (the pipe character one)
- [Glint-0.3](https://huggingface.co/CompactAI-O/Glint-0.3) (the breakthrough)
- [Glint-1](https://huggingface.co/CompactAI-O/Glint-1) (the current gen)
## Acknowledgments
Built with curiosity over compute. Trained on FineWeb-Edu. SPIN optimized. A lot of hope.
---
**Built by [CompactAI](https://huggingface.co/CompactAI-O).**
*If you like tiny models that try their best, give us a follow.*