silence09 commited on
Commit
138513a
·
verified ·
1 Parent(s): 83392c4

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +2 -2
  2. README_cn.md +2 -2
  3. config.json +1 -1
  4. model.safetensors +2 -2
README.md CHANGED
@@ -10,9 +10,9 @@ This project is created using the official **DeepSeek-V4-Pro** model architectur
10
  [中文说明](./README_cn.md)
11
 
12
  ## Purpose
13
- The purpose of these weights is to provide a **lightweight** implementation for researchers who want to **study the model architecture and run locally quickly**.
14
 
15
- The original **DeepSeek-V4-Pro model** requires significant GPU resources and runs on frameworks like **vLLM/SGLang** and custom kernels written by **TileLang**, making it difficult to deploy on standard hardware.
16
 
17
  The difference between this model and the original **DeepSeek-V4-Pro** is shown below:
18
  ```json
 
10
  [中文说明](./README_cn.md)
11
 
12
  ## Purpose
13
+ The purpose of these weights is to provide a lightweight implementation for researchers who want to study the model architecture and run local quickly.
14
 
15
+ The original **DeepSeek-V4-Pro model** requires significant GPU resources and runs on frameworks like **vLLM/SGLang**, making it difficult to deploy on standard hardware.
16
 
17
  The difference between this model and the original **DeepSeek-V4-Pro** is shown below:
18
  ```json
README_cn.md CHANGED
@@ -10,9 +10,9 @@ base_model:
10
  [English README](./README.md)
11
 
12
  ## 目的
13
- 为研究人员提供一个**轻量级实现**,方便在**有限硬件资源下研究和快速本地运行**
14
 
15
- 原始 **DeepSeek-V4-Pro** 需要大量 GPU 资源,基于 **vLLM/SGLang** 框架和**TileLang**编写的Kernel运行,难以在普通硬件上部署。
16
 
17
  此模型与原始 **DeepSeek-V4-Pro** 的区别如下:
18
  ```json
 
10
  [English README](./README.md)
11
 
12
  ## 目的
13
+ 为研究人员提供一个轻量级实现,方便在有限硬件资源下研究和快速本地运行。
14
 
15
+ 原始 **DeepSeek-V4-Pro** 需要大量 GPU 资源,基于 **vLLM/SGLang** 框架运行,难以在普通硬件上部署。
16
 
17
  此模型与原始 **DeepSeek-V4-Pro** 的区别如下:
18
  ```json
config.json CHANGED
@@ -103,5 +103,5 @@
103
  "topk_method": "noaux_tc",
104
  "transformers_version": "5.8.0.dev0",
105
  "use_cache": true,
106
- "vocab_size": 128000
107
  }
 
103
  "topk_method": "noaux_tc",
104
  "transformers_version": "5.8.0.dev0",
105
  "use_cache": true,
106
+ "vocab_size": 129280
107
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:069e897368db2a599e20d6fc2e3bc9791b7120e0f9fbb112f322941ce4062024
3
- size 2638580010
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69c7fdc776d036798e6476625c37dff266791b39150d54c2ec5ce22304fcca8d
3
+ size 2640881898