SupraLabs
/

MicroSupra-1k

Text Generation

text-generation-inference

Model card Files Files and versions

AxionLab-official commited on 10 days ago

Commit

49c5dcd

·

verified ·

1 Parent(s): 015e1c1

Update README.md

Files changed (1) hide show

README.md +48 -2

README.md CHANGED Viewed

@@ -12,12 +12,58 @@ tags:
 - supra
 - SupraLabs
 - gtx
-- gpu
 - rtx
 - nvidia
 - llama
 ---
 ## **MicroSupra-1k**
-This model is being finished yet LH Tech, but here are the weights

 - supra
 - SupraLabs
 - gtx
 - rtx
 - nvidia
 - llama
+- lhtech
+- axionlab
+library_name: transformers
 ---
 ## **MicroSupra-1k**
+So... have you ever seen a model that runs on a 3 dollars hardware? No? If no, Now you're seeing!
+MicroSupra-1k is a 1k parameters model trained on 300M Tokens of FineWeb-Edu for 1 minute(Yes! 59 seconds!) on a GTX 750Ti 4GB(AxionLab Hardware)
+**Some model outputs:**
+[*] Prompt: The main concept of physics is
+[*] Output: The main concept of physics is  a,s and the. thet to, theing.... the,a then,c,i to, thee in b. toed.,,e theyalp the in,er thees- s,el,,,,
+ and, the of ine,,s the of cs of thesss the. f. to. thesining andor dar,,al the,. of p.
+ the.s the.,,s. anded,e. of, ofed, l toinging and themsr the of of. to
+ to thes thes aen,., ofes of a.
+[*] Prompt: My name is
+[*] Output: My name is ed and. as the, to. the, iningt thee the ofingi in
+ the., anda.-eo
+ ofles, b the,er,s fing.ssp the the
+, of of, the,al, d to the m, the, to toed,
+seng,,.y. in the,., in and them the thened.sing to
+ the of of andan the the,, the
+ to..,,sing,,.aring the the. of.al.,s ofcal ar s..e and.sssor of, and and.
+[*] Prompt: Question: What is the capital of France?\nAnswer:
+[*] Output: Question: What is the capital of France?
+Answer:,. and to the. toc. ofs the m,a thee.. the, f ofling. as.,,y bt, the p, in, the,,ees toed ing to.o,
+ thes. the..,s the.ed and andang,,ed the of,,ms. of, thei the, the,ey,,s l.ing toe the the,se the to, the, the,aror, the of-. in the. the. the,e the of ds to,ic the the aal at the..
+ingssy s and and
+**🚫What the model CAN'T do:**
+Think
+Chat
+even predict the next token correctly lol
+**Why SupraLabs created this???**
+Because we are experimenting sizes, experiments(like 1Bit quant, distillation(NEW THINGS ARE COMING WITH DISTILLATION! GET TUNED!), pruning, all to better your experience! We are working to big things!)
+**Final thought**
+Even without any intelligence, it shows that scaling laws are real. This ant model doesn't know how to talk, but we all know it emotions 🤖🫶
+SupraLabs are excited to work at more things to you!