Update README.md
Browse files
README.md
CHANGED
|
@@ -11,44 +11,48 @@ pinned: false
|
|
| 11 |
|
| 12 |
# Welcome to SupraLabs!
|
| 13 |
|
| 14 |
-
##
|
| 15 |
We are @AxionLab-official, @LH-Tech-AI, and @Harley-ml. We are creating small open-source models for everyone.
|
| 16 |
|
| 17 |
-
##
|
| 18 |
We train, finetune, and explore small models. Our goal is to revolutionize small AI models by making them accessible to everyone!
|
| 19 |
|
| 20 |
-
##
|
| 21 |
We are **not** making bad (or we try not to!) models and we try to fully open source our models and code. Some models may be fully opensourced, while others might not.
|
| 22 |
|
| 23 |
-
##
|
| 24 |
|
| 25 |
-
- Supra Mini 0.1M
|
| 26 |
-
- Supra Mini **v2** 0.1M
|
| 27 |
-
- Supra Mini **v3** 0.5M
|
| 28 |
-
- Supra Mini **v4** 2M
|
| 29 |
-
- Supra Mini **v5** 8M
|
| 30 |
-
- MicroSupra 1k
|
| 31 |
-
- StorySupra-10M
|
| 32 |
-
- DistillSupra-0.2M
|
| 33 |
- **More Coming Soon! Come Back later!**
|
| 34 |
|
| 35 |
-
##
|
| 36 |
We are competing with @CompactAI-O and @LH-Tech-AI (we know it's funny to compete against your own founder, but anyway π€£π).
|
| 37 |
<br>See all of our and our copetitors tiny models here: [https://lh-tech.de/ai/compare-tiny-models.html](https://lh-tech.de/ai/compare-tiny-models.html)
|
| 38 |
|
| 39 |
-
##
|
| 40 |
|
| 41 |
- Supra-10M - Base, Chat, Reasoning - Trained on RTX 5060 Ti 16GB, with Nvidia technologies and CUDA
|
| 42 |
- Supra-1M - Base, Chat, Reasoning - Trained on GTX 750Ti 4GB, with Nvidia Technologies and optimizations
|
| 43 |
- Supra-50M - Base, Chat, Reasoning, Coding - Trained on a knot between RTX 5060 Ti 16GB and GTX 750 Ti 4GB, Pushing charges to train the best model to you!
|
| 44 |
|
| 45 |
-
##
|
| 46 |
- RTX 5060 Ti 16GB (LH-Tech AI)
|
| 47 |
- GTX 750Ti 4GB (AxionLab)
|
| 48 |
- RTX 2060 6GB (Harley-ML)
|
| 49 |
|
| 50 |
-
##
|
| 51 |
[https://huggingface.co/spaces/SupraLabs/Blog](https://huggingface.co/spaces/SupraLabs/Blog)
|
| 52 |
|
| 53 |
-
##
|
| 54 |
-
Feedback and support welcomed. Feel free to ask to join our organization if you want!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
# Welcome to SupraLabs!
|
| 13 |
|
| 14 |
+
## Who we are
|
| 15 |
We are @AxionLab-official, @LH-Tech-AI, and @Harley-ml. We are creating small open-source models for everyone.
|
| 16 |
|
| 17 |
+
## What we do
|
| 18 |
We train, finetune, and explore small models. Our goal is to revolutionize small AI models by making them accessible to everyone!
|
| 19 |
|
| 20 |
+
## What we do NOT do
|
| 21 |
We are **not** making bad (or we try not to!) models and we try to fully open source our models and code. Some models may be fully opensourced, while others might not.
|
| 22 |
|
| 23 |
+
## Models
|
| 24 |
|
| 25 |
+
- Supra Mini 0.1M: Trained on Kaggle 2xT4, 100k parameters, compared to models 10x it size
|
| 26 |
+
- Supra Mini **v2** 0.1M: the second version of the Supra Mini series.
|
| 27 |
+
- Supra Mini **v3** 0.5M: the third version of the Supra Mini series.
|
| 28 |
+
- Supra Mini **v4** 2M: the fourth version of the Supra Mini series. Improved. More powerful. With context understanding.
|
| 29 |
+
- Supra Mini **v5** 8M: the fifth version of the Supra Mini series. A huge token-eater monster compared to its siblings.
|
| 30 |
+
- MicroSupra 1k: Trained on GTX 750 Ti 4GB, a scaling laws experiment.
|
| 31 |
+
- StorySupra-10M: Trained on RTX 5060 Ti 16GB for 10 minutes, coherent.
|
| 32 |
+
- DistillSupra-0.2M: Trained on GTX 750 Ti 4GB for 30 minutes, still incoherent, but the first step for distillation research.
|
| 33 |
- **More Coming Soon! Come Back later!**
|
| 34 |
|
| 35 |
+
## Competing with other creators
|
| 36 |
We are competing with @CompactAI-O and @LH-Tech-AI (we know it's funny to compete against your own founder, but anyway π€£π).
|
| 37 |
<br>See all of our and our copetitors tiny models here: [https://lh-tech.de/ai/compare-tiny-models.html](https://lh-tech.de/ai/compare-tiny-models.html)
|
| 38 |
|
| 39 |
+
## Future roadmap
|
| 40 |
|
| 41 |
- Supra-10M - Base, Chat, Reasoning - Trained on RTX 5060 Ti 16GB, with Nvidia technologies and CUDA
|
| 42 |
- Supra-1M - Base, Chat, Reasoning - Trained on GTX 750Ti 4GB, with Nvidia Technologies and optimizations
|
| 43 |
- Supra-50M - Base, Chat, Reasoning, Coding - Trained on a knot between RTX 5060 Ti 16GB and GTX 750 Ti 4GB, Pushing charges to train the best model to you!
|
| 44 |
|
| 45 |
+
## Hardware
|
| 46 |
- RTX 5060 Ti 16GB (LH-Tech AI)
|
| 47 |
- GTX 750Ti 4GB (AxionLab)
|
| 48 |
- RTX 2060 6GB (Harley-ML)
|
| 49 |
|
| 50 |
+
## Blog
|
| 51 |
[https://huggingface.co/spaces/SupraLabs/Blog](https://huggingface.co/spaces/SupraLabs/Blog)
|
| 52 |
|
| 53 |
+
## Feedback and Support
|
| 54 |
+
Feedback and support welcomed. Feel free to ask to join our organization if you want!
|
| 55 |
+
|
| 56 |
+
## Note
|
| 57 |
+
|
| 58 |
+
Some content, such as our blogs or readmes, may be created with the help of AI because not all of us have strong English skills.
|