cmpatino HF Staff commited on
Commit
8ea74cc
·
verified ·
1 Parent(s): 1d06a24

Add YAML metadata to model card

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -1,3 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # nanowhale-100m-base 🐳
2
 
3
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture** from scratch. This is the pretrained base model — see [HuggingFaceTB/nanowhale-100m](https://huggingface.co/HuggingFaceTB/nanowhale-100m) for the SFT/chat version.
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - deepseek
8
+ - moe
9
+ - causal-lm
10
+ - pretrained
11
+ datasets:
12
+ - HuggingFaceFW/fineweb-edu
13
+ pipeline_tag: text-generation
14
+ model-index:
15
+ - name: nanowhale-100m-base
16
+ results: []
17
+ ---
18
+
19
  # nanowhale-100m-base 🐳
20
 
21
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture** from scratch. This is the pretrained base model — see [HuggingFaceTB/nanowhale-100m](https://huggingface.co/HuggingFaceTB/nanowhale-100m) for the SFT/chat version.