Eeppa
/

TinyBuddy-30M

Text Generation

built-with-llama

Model card Files Files and versions

Eeppa commited on 6 days ago

Commit

c360acf

·

verified ·

1 Parent(s): 9b9c2ab

Create README.txt

Files changed (1) hide show

README.txt +28 -0

README.txt ADDED Viewed

	@@ -0,0 +1,28 @@

+# TinyBuddy-30M
+A 30 million parameter GPT-style transformer trained on TinyStories.
+## Architecture
+- 6 layers, 8 attention heads, 256 embedding dim
+- 50,000 vocabulary size (untied weights)
+- 512 context length (trained on 128 for speed)
+## Training
+- Dataset: TinyStories (5,000 stories)
+- Steps: 1,500
+- Hardware: CPU only
+- Loss: ~5.5 (coherent but not good)
+## What It Can Do
+- Generate 2-3 word fragments that resemble story patterns
+- Sometimes repeat words from the prompt
+- Produce gibberish that's trying to be English
+## What It Cannot Do
+- Tell a coherent story
+- Answer questions
+- Anything useful
+## Why It Exists
+To demonstrate that even a tiny transformer learns *patterns*, not rules.
+This is a real AI, just a very small one.