SupraLabs
/

MicroSupra-1k

Text Generation

text-generation-inference

Model card Files Files and versions

AxionLab-official commited on 10 days ago

Commit

cf95d44

·

verified ·

1 Parent(s): af1e16e

Update README.md

Files changed (1) hide show

README.md +42 -3

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ tags:
 - axionlab
 library_name: transformers
 ---
-## **i'm not releasing yet LH**
 ## **MicroSupra-1k**
@@ -38,7 +38,7 @@ MicroSupra-1k is a 1k parameters model trained on 300M Tokens of FineWeb-Edu for
 [*] Prompt: My name is
-[*] Output: My name is ed and. as the, to. the, iningt thee the ofingi in
  the., anda.-eo
  ofles, b the,er,s fing.ssp the the
 , of of, the,al, d to the m, the, to toed,
@@ -54,6 +54,45 @@ Answer:,. and to the. toc. ofs the m,a thee.. the, f ofling. as.,,y bt, the p, i
  thes. the..,s the.ed and andang,,ed the of,,ms. of, thei the, the,ey,,s l.ing toe the the,se the to, the, the,aror, the of-. in the. the. the,e the of ds to,ic the the aal at the..
 ingssy s and and
 **🚫What the model CAN'T do:**
 Think
 Chat
@@ -61,7 +100,7 @@ even predict the next token correctly lol
 **Why SupraLabs created this???**
-Because we are experimenting sizes, experiments(like 1Bit quant, distillation(NEW THINGS ARE COMING WITH DISTILLATION! GET TUNED!), pruning, all to better your experience! We are working to big things!)
 **Final thought**

 - axionlab
 library_name: transformers
 ---
+## **i'm not releasing yet LH**!
 ## **MicroSupra-1k**
 [*] Prompt: My name is
+[*] Output: My name is edie and. as the, to. the, iningt thee the ofingi in
  the., anda.-eo
  ofles, b the,er,s fing.ssp the the
 , of of, the,al, d to the m, the, to toed,
  thes. the..,s the.ed and andang,,ed the of,,ms. of, thei the, the,ey,,s l.ing toe the the,se the to, the, the,aror, the of-. in the. the. the,e the of ds to,ic the the aal at the..
 ingssy s and and
+## Get Started 🚀
+``python
+print("[*] Loading libraries...")
+import torch
+from transformers import LlamaForCausalLM, PreTrainedTokenizerFast
+model_path = "SupraLabs/MicroSupra-1k"
+print("[*] Loading tokenizer...")
+tokenizer = PreTrainedTokenizerFast.from_pretrained(model_path)
+print("[*] Loading model...")
+model = LlamaForCausalLM.from_pretrained(model_path)
+model.eval()
+prompt = "The most intelligent person on the world is "
+print(f"[*] Prompt: {prompt!r}")
+inputs = tokenizer(prompt, return_tensors="pt")
+with torch.no_grad():
+    outputs = model.generate(
+        input_ids=inputs["input_ids"],
+        attention_mask=inputs["attention_mask"],
+        max_new_tokens=150,
+        do_sample=True,
+        temperature=0.35,
+        top_p=0.85,
+        repetition_penalty=1.2,
+        pad_token_id=tokenizer.pad_token_id,
+        eos_token_id=tokenizer.eos_token_id,
+    )
+print("[*] Output:", tokenizer.decode(outputs[0], skip_special_tokens=True))
+``
 **🚫What the model CAN'T do:**
 Think
 Chat
 **Why SupraLabs created this???**
+Because we are experimenting sizes, experiments(like 1Bit quant, distillation(NEW THINGS ARE COMING WITH DISTILLATION! GET TUNED!), pruning) all to better your experience! We are working to big things!
 **Final thought**