AxionLab-official commited on
Commit
cf95d44
·
verified ·
1 Parent(s): af1e16e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -19,7 +19,7 @@ tags:
19
  - axionlab
20
  library_name: transformers
21
  ---
22
- ## **i'm not releasing yet LH**
23
 
24
  ## **MicroSupra-1k**
25
 
@@ -38,7 +38,7 @@ MicroSupra-1k is a 1k parameters model trained on 300M Tokens of FineWeb-Edu for
38
 
39
  [*] Prompt: My name is
40
 
41
- [*] Output: My name is ed and. as the, to. the, iningt thee the ofingi in
42
  the., anda.-eo
43
  ofles, b the,er,s fing.ssp the the
44
  , of of, the,al, d to the m, the, to toed,
@@ -54,6 +54,45 @@ Answer:,. and to the. toc. ofs the m,a thee.. the, f ofling. as.,,y bt, the p, i
54
  thes. the..,s the.ed and andang,,ed the of,,ms. of, thei the, the,ey,,s l.ing toe the the,se the to, the, the,aror, the of-. in the. the. the,e the of ds to,ic the the aal at the..
55
  ingssy s and and
56
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
57
  **🚫What the model CAN'T do:**
58
  Think
59
  Chat
@@ -61,7 +100,7 @@ even predict the next token correctly lol
61
 
62
  **Why SupraLabs created this???**
63
 
64
- Because we are experimenting sizes, experiments(like 1Bit quant, distillation(NEW THINGS ARE COMING WITH DISTILLATION! GET TUNED!), pruning, all to better your experience! We are working to big things!)
65
 
66
  **Final thought**
67
 
 
19
  - axionlab
20
  library_name: transformers
21
  ---
22
+ ## **i'm not releasing yet LH**!
23
 
24
  ## **MicroSupra-1k**
25
 
 
38
 
39
  [*] Prompt: My name is
40
 
41
+ [*] Output: My name is edie and. as the, to. the, iningt thee the ofingi in
42
  the., anda.-eo
43
  ofles, b the,er,s fing.ssp the the
44
  , of of, the,al, d to the m, the, to toed,
 
54
  thes. the..,s the.ed and andang,,ed the of,,ms. of, thei the, the,ey,,s l.ing toe the the,se the to, the, the,aror, the of-. in the. the. the,e the of ds to,ic the the aal at the..
55
  ingssy s and and
56
 
57
+
58
+ ## Get Started 🚀
59
+
60
+ ``python
61
+ print("[*] Loading libraries...")
62
+ import torch
63
+ from transformers import LlamaForCausalLM, PreTrainedTokenizerFast
64
+
65
+ model_path = "SupraLabs/MicroSupra-1k"
66
+
67
+ print("[*] Loading tokenizer...")
68
+ tokenizer = PreTrainedTokenizerFast.from_pretrained(model_path)
69
+
70
+ print("[*] Loading model...")
71
+ model = LlamaForCausalLM.from_pretrained(model_path)
72
+ model.eval()
73
+
74
+ prompt = "The most intelligent person on the world is "
75
+ print(f"[*] Prompt: {prompt!r}")
76
+
77
+ inputs = tokenizer(prompt, return_tensors="pt")
78
+
79
+ with torch.no_grad():
80
+ outputs = model.generate(
81
+ input_ids=inputs["input_ids"],
82
+ attention_mask=inputs["attention_mask"],
83
+ max_new_tokens=150,
84
+ do_sample=True,
85
+ temperature=0.35,
86
+ top_p=0.85,
87
+ repetition_penalty=1.2,
88
+ pad_token_id=tokenizer.pad_token_id,
89
+ eos_token_id=tokenizer.eos_token_id,
90
+ )
91
+
92
+ print("[*] Output:", tokenizer.decode(outputs[0], skip_special_tokens=True))
93
+ ``
94
+
95
+
96
  **🚫What the model CAN'T do:**
97
  Think
98
  Chat
 
100
 
101
  **Why SupraLabs created this???**
102
 
103
+ Because we are experimenting sizes, experiments(like 1Bit quant, distillation(NEW THINGS ARE COMING WITH DISTILLATION! GET TUNED!), pruning) all to better your experience! We are working to big things!
104
 
105
  **Final thought**
106