CompactAI-O
/

Glint-0.4

Model card Files Files and versions

Glint-0.4 / README.md

CompactAI's picture

Upload README.md

b3d54fb verified about 13 hours ago

|

history blame contribute delete

3 kB

	---
	license: gpl-3.0
	new_version: CompactAI-O/Glint-1
	---

	Note: You need the custom python script to run this model. Grab it from [the model runner Space](https://huggingface.co/spaces/CompactAI-O/CompactAIModelRunner).

	# Glint-0.4

	It speaks. It actually speaks. Mostly.

	We came so far. From the dark ages of `couldcouldoldbloodblood` to actual, coherent sentences. This is Glint-0.4. 1 million parameters. Small. Trying its best. Unlike its ancestors, it usually succeeds.

	## Quick Stats

	- Parameters: 1,000,000 (yes, really)
	- Training Tokens: 10 Billion
	- Context Window: 2048 tokens
	- Vibe: Chaotic good, but mostly good

	## What Is This?

	Glint-0.4 is the latest of the old guard before we started the Glint line proper. It builds on Glint-0.3 by adding SPIN (Self-Play Fine-Tuning) to the training loop. This model represents a 3x improvement in combined score over the original Glint-0.1. Coherence jumped from 1.99 to 6.03. Relevance is no longer zero. It is a miracle.

	### The journey

	\| Model \| Era \| Typical Output \| Combined \|
	\| :--- \| :--- \| :--- \| :--- \|
	\| Glint-0.1 \| The Dark Ages \| `couldcouldoldbloodbloodbodybody` \| 1.62 \|
	\| Glint-0.2 \| Pipe Character Incident \| `\\|fdish\\|\\|\\|\\|\\|!@\\|` \| 1.21 \|
	\| Glint-0.3 \| The Awakening \| `It is about competent development...` \| 3.87 \|
	\| Glint-0.4 (SPIN) \| Current Era \| `The artificial intelligence is a problem...` \| 4.84 \|

	Expected output:
	> "The simple terms arrived in simulant explorers and honey are specific or forecasters. They allow the structure of their similar..."

	## Disclaimer

	This is a 1 million parameter model.
	- It is not GPT-5.
	- It is not GPT-2.
	- It is a tiny neural network running on a prayer and a GPU.
	- It might still output `chuamliamce`. If it does, try again. It is shy.
	- For best results, use temperature around 0.7. At 2.0, you are on your own.

	## Benchmarks

	We benchmarked Glint-0.4 against all previous versions using a standard 7-question suite. Yes, 7 questions. We kept it small because we were running on a laptop.

	\| Metric \| Glint-0.1 \| Glint-0.2 \| Glint-0.3 \| Glint-0.4 (SPIN) \|
	\| :--- \| :---: \| :---: \| :---: \| :---: \|
	\| Fluency \| 0.50 \| 1.69 \| 8.35 \| 8.78 \|
	\| Coherence \| 1.99 \| 1.56 \| 5.72 \| 6.03 \|
	\| Relevance \| 1.22 \| 0.00 \| 0.00 \| 2.25 \|
	\| Format \| 3.29 \| 3.29 \| 3.29 \| 3.29 \|
	\| Combined \| 1.62 \| 1.21 \| 3.87 \| 4.84 \|

	## Related Models

	- [Glint-0.1](https://huggingface.co/CompactAI-O/Glint-0.1) (the ancestor)
	- [Glint-0.2](https://huggingface.co/CompactAI-O/Glint-0.2) (the pipe character one)
	- [Glint-0.3](https://huggingface.co/CompactAI-O/Glint-0.3) (the breakthrough)
	- [Glint-1](https://huggingface.co/CompactAI-O/Glint-1) (the current gen)

	## Acknowledgments

	Built with curiosity over compute. Trained on FineWeb-Edu. SPIN optimized. A lot of hope.

	---

	Built by [CompactAI](https://huggingface.co/CompactAI-O).
	If you like tiny models that try their best, give us a follow.