Cheng98 commited on
Commit
7add242
·
verified ·
1 Parent(s): c60298a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -5,6 +5,17 @@ license: llama2
5
  # Toy LLaMA-39M
6
 
7
  - This is a tiny LLaMA model pretrained on [Recag/Rp_C4_55](https://huggingface.co/datasets/Recag/Rp_C4_55), a small subset of C4 with `seq_len=512`.
 
 
 
 
 
 
 
 
 
 
 
8
  - Load model and tokenizer:
9
  ```python
10
  from transformers import AutoTokenizer, AutoModelForCausalLM
 
5
  # Toy LLaMA-39M
6
 
7
  - This is a tiny LLaMA model pretrained on [Recag/Rp_C4_55](https://huggingface.co/datasets/Recag/Rp_C4_55), a small subset of C4 with `seq_len=512`.
8
+ - Model architecture
9
+ ```json
10
+ {
11
+ "hidden_size": 512,
12
+ "intermediate_size": 2048,
13
+ "max_position_embeddings": 2048,
14
+ "num_attention_heads": 8,
15
+ "num_hidden_layers": 2,
16
+ "num_key_value_heads": 8
17
+ }
18
+ ```
19
  - Load model and tokenizer:
20
  ```python
21
  from transformers import AutoTokenizer, AutoModelForCausalLM