cmpatino HF Staff commited on
Commit
fa4ce8a
·
verified ·
1 Parent(s): 8fecf92

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -2,7 +2,7 @@
2
 
3
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture**, fine-tuned for chat/instruction following. Trained from scratch — no weights from DeepSeek-V4 were used.
4
 
5
- - **Pretrained base model**: [cmpatino/smol-deepseek-v4-100m-pretrain](https://huggingface.co/cmpatino/smol-deepseek-v4-100m-pretrain)
6
  - **This model**: SFT on [HuggingFaceTB/smol-smoltalk](https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk)
7
 
8
  ## Architecture
 
2
 
3
  A small ~110M parameter language model implementing the **DeepSeek-V4 architecture**, fine-tuned for chat/instruction following. Trained from scratch — no weights from DeepSeek-V4 were used.
4
 
5
+ - **Pretrained base model**: [HuggingFaceTB/nanowhale-100m-base](https://huggingface.co/HuggingFaceTB/nanowhale-100m-base)
6
  - **This model**: SFT on [HuggingFaceTB/smol-smoltalk](https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk)
7
 
8
  ## Architecture