cmpatino HF Staff commited on
Commit
6a3bed5
·
verified ·
1 Parent(s): f3e29ee

Add YAML metadata to model card

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -1,3 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # nanowhale-100m 🐳
2
 
3
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture**, fine-tuned for chat/instruction following. Trained from scratch — no weights from DeepSeek-V4 were used.
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - deepseek
8
+ - moe
9
+ - causal-lm
10
+ - sft
11
+ - chat
12
+ datasets:
13
+ - HuggingFaceFW/fineweb-edu
14
+ - HuggingFaceTB/smol-smoltalk
15
+ base_model: HuggingFaceTB/nanowhale-100m-base
16
+ pipeline_tag: text-generation
17
+ model-index:
18
+ - name: nanowhale-100m
19
+ results: []
20
+ ---
21
+
22
  # nanowhale-100m 🐳
23
 
24
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture**, fine-tuned for chat/instruction following. Trained from scratch — no weights from DeepSeek-V4 were used.