Upload folder using huggingface_hub
Browse files- README.md +2 -2
- README_cn.md +2 -2
- config.json +1 -1
- model.safetensors +2 -2
README.md
CHANGED
|
@@ -10,9 +10,9 @@ This project is created using the official **DeepSeek-V4-Pro** model architectur
|
|
| 10 |
[中文说明](./README_cn.md)
|
| 11 |
|
| 12 |
## Purpose
|
| 13 |
-
The purpose of these weights is to provide a
|
| 14 |
|
| 15 |
-
The original **DeepSeek-V4-Pro model** requires significant GPU resources and runs on frameworks like **vLLM/SGLang**
|
| 16 |
|
| 17 |
The difference between this model and the original **DeepSeek-V4-Pro** is shown below:
|
| 18 |
```json
|
|
|
|
| 10 |
[中文说明](./README_cn.md)
|
| 11 |
|
| 12 |
## Purpose
|
| 13 |
+
The purpose of these weights is to provide a lightweight implementation for researchers who want to study the model architecture and run local quickly.
|
| 14 |
|
| 15 |
+
The original **DeepSeek-V4-Pro model** requires significant GPU resources and runs on frameworks like **vLLM/SGLang**, making it difficult to deploy on standard hardware.
|
| 16 |
|
| 17 |
The difference between this model and the original **DeepSeek-V4-Pro** is shown below:
|
| 18 |
```json
|
README_cn.md
CHANGED
|
@@ -10,9 +10,9 @@ base_model:
|
|
| 10 |
[English README](./README.md)
|
| 11 |
|
| 12 |
## 目的
|
| 13 |
-
为研究人员提供一个
|
| 14 |
|
| 15 |
-
原始 **DeepSeek-V4-Pro** 需要大量 GPU 资源,基于 **vLLM/SGLang** 框架
|
| 16 |
|
| 17 |
此模型与原始 **DeepSeek-V4-Pro** 的区别如下:
|
| 18 |
```json
|
|
|
|
| 10 |
[English README](./README.md)
|
| 11 |
|
| 12 |
## 目的
|
| 13 |
+
为研究人员提供一个轻量级实现,方便在有限硬件资源下研究和快速本地运行。
|
| 14 |
|
| 15 |
+
原始 **DeepSeek-V4-Pro** 需要大量 GPU 资源,基于 **vLLM/SGLang** 框架运行,难以在普通硬件上部署。
|
| 16 |
|
| 17 |
此模型与原始 **DeepSeek-V4-Pro** 的区别如下:
|
| 18 |
```json
|
config.json
CHANGED
|
@@ -103,5 +103,5 @@
|
|
| 103 |
"topk_method": "noaux_tc",
|
| 104 |
"transformers_version": "5.8.0.dev0",
|
| 105 |
"use_cache": true,
|
| 106 |
-
"vocab_size":
|
| 107 |
}
|
|
|
|
| 103 |
"topk_method": "noaux_tc",
|
| 104 |
"transformers_version": "5.8.0.dev0",
|
| 105 |
"use_cache": true,
|
| 106 |
+
"vocab_size": 129280
|
| 107 |
}
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:69c7fdc776d036798e6476625c37dff266791b39150d54c2ec5ce22304fcca8d
|
| 3 |
+
size 2640881898
|