Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
euclaise 's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements

Interesting smol pretraining expirements

updated Sep 10, 2024
Upvote
-

  • UUFO-Aigis/Pico-OpenLAiNN-250M

    0.3B • Updated Feb 24, 2025 • 4 • 3

  • crumb/distilpythia

    Text Generation • 95.6M • Updated Jul 20, 2023 • 531 • 4

  • crumb/GLORT2

    Text Generation • 0.2B • Updated Aug 26, 2024 • 18

  • pszemraj/jamba-900M-v0.13-KIx2

    Text Generation • 0.9B • Updated Dec 29, 2025 • 28 • 4

  • pszemraj/mega-ar-350m-v0.13

    Text Generation • 0.3B • Updated Dec 29, 2025 • 18

  • BEE-spoke-data/smol_llama-220M-GQA

    Text Generation • 0.2B • Updated Dec 29, 2025 • 1.1k • 13

  • pszemraj/stablelm-4e1t-2b-v0.1

    Text Generation • 2B • Updated Dec 29, 2025 • 2

  • Locutusque/TinyMistral-248M-v2

    Text Generation • 0.2B • Updated Jan 8, 2024 • 1.05k • 17

  • upstage/TinySolar-248m-4k

    Text Generation • 0.2B • Updated Feb 7, 2024 • 185 • 10

  • appvoid/arco

    0.5B • Updated Dec 5, 2024 • 18 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs