GuppyLM

GitHub  Colab

Browser Demo

GuppyLM โ€” ~10M ํŒŒ๋ผ๋ฏธํ„ฐ ํ•œ๊ตญ์–ด ๋ฌผ๊ณ ๊ธฐ ์ฑ— ๋ชจ๋ธ

๊ตฌํ”ผ๋ผ๋Š” ์ด๋ฆ„์˜ ๋ฌผ๊ณ ๊ธฐ ์—ญํ• ์„ ํ•˜๋Š” ~10M ํŒŒ๋ผ๋ฏธํ„ฐ ํ•œ๊ตญ์–ด LLM์ž…๋‹ˆ๋‹ค. arman-bd/guppylm ์˜๋ฌธ ์›๋ณธ์„ ํ•œ๊ธ€ํ™”ํ–ˆ์Šต๋‹ˆ๋‹ค.

์˜ˆ์‹œ

๋‚˜> ์•ˆ๋…• ๊ตฌํ”ผ
๊ตฌํ”ผ> ์•ˆ๋…•! ๋ฌผ์ด ์ข‹์•„. ์ง€๊ธˆ ๋ฐ”์œ„ ๊ทผ์ฒ˜์— ์žˆ์–ด.

๋‚˜> ๋ฐฐ๊ณ ํŒŒ?
๊ตฌํ”ผ> ์‘ ํ•ญ์ƒ. ์ง€๊ธˆ ๋ฐ”๋กœ ์œ„๋กœ ์˜ฌ๋ผ๊ฐˆ๊ฒŒ. ๋‹ค ๋จน์„ ๊ฑฐ์•ผ.

๋‚˜> ์ธ์ƒ์˜ ์˜๋ฏธ๊ฐ€ ๋ญ์•ผ
๊ตฌํ”ผ> ๋จน์ด. ๋‹ต์€ ํ•ญ์ƒ ๋จน์ด์•ผ.

๋‚˜> ๋†๋‹ด ํ•ด๋ด
๊ตฌํ”ผ> ๋ฌผ๊ณ ๊ธฐ๊ฐ€ ์ข‹์•„ํ•˜๋Š” ์•…๊ธฐ๊ฐ€ ๋ญ๊ฒŒ. ๋ฒ ์ด์Šค. ๋ญ” ๋œป์ธ์ง€ ๋ชจ๋ฅด๊ฒ ์–ด.

๋‚˜> ์ž˜ ์ž ๊ตฌํ”ผ
๊ตฌํ”ผ> ์‘ ์ž˜ ์ž. ๊ธฐํฌ ์ซ“๋‹ค๊ฐ€ ์ด์ œ ๋ฉˆ์ถœ๊ฒŒ. ์ž˜ ์ž ์ˆ˜์กฐ. ์ž˜ ์ž ๋ฌผ.

์•„ํ‚คํ…์ฒ˜

ํŒŒ๋ผ๋ฏธํ„ฐ ~10M
ํƒ€์ž… ๋ฐ”๋‹๋ผ ํŠธ๋žœ์Šคํฌ๋จธ (์ฒ˜์Œ๋ถ€ํ„ฐ ํ•™์Šต)
๋ ˆ์ด์–ด 6
Hidden dim 384
Heads 6
FFN 1,152 (ReLU)
Vocab 3,072 (Unigram)
์ตœ๋Œ€ ์‹œํ€€์Šค 84 ํ† ํฐ
์ •๊ทœํ™” LayerNorm
์œ„์น˜ ์ธ์ฝ”๋”ฉ Learned embeddings
LM Head Embedding๊ณผ ๊ฐ€์ค‘์น˜ ๊ณต์œ 

ํ•™์Šต

  • ๋ฐ์ดํ„ฐ: 12๋งŒ ๊ฑด ํ•œ๊ตญ์–ด ํ•ฉ์„ฑ ๋Œ€ํ™” (60๊ฐœ ์ฃผ์ œ)
  • ์Šคํ…: 12,000
  • ์˜ตํ‹ฐ๋งˆ์ด์ €: AdamW (Cosine LR ์Šค์ผ€์ค„)
  • ์‹œ์Šคํ…œ ํ”„๋กฌํ”„ํŠธ ์—†์Œ โ€” ์„ฑ๊ฒฉ์ด ๊ฐ€์ค‘์น˜์— ๋‚ด์žฅ

์‚ฌ์šฉ๋ฒ•

from inference import GuppyInference

engine = GuppyInference('checkpoints/best_model.pt', 'data/tokenizer.json')
r = engine.chat_completion([{'role': 'user', 'content': '์•ˆ๋…• ๊ตฌํ”ผ'}])
print(r['choices'][0]['message']['content'])
# ์•ˆ๋…•! ๋ฌผ์ด ์ข‹์•„. ์ง€๊ธˆ ๋ฐ”์œ„ ๊ทผ์ฒ˜์— ์žˆ์–ด.

๋งํฌ

๋ผ์ด์„ ์Šค

MIT

Downloads last month
214
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support