Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -237,7 +237,7 @@ gibberish. Do not use that script.
|
|
| 237 |
|
| 238 |
The full pipeline (download → convert → re-quantize → finalize → patch
|
| 239 |
→ verify) is automated in
|
| 240 |
-
[`build_mlx_q4q8.sh`](https://
|
| 241 |
project repo). Quick reference of the steps:
|
| 242 |
|
| 243 |
```
|
|
@@ -256,7 +256,7 @@ project repo). Quick reference of the steps:
|
|
| 256 |
M3 Ultra: ~75 minutes plus the initial download (~160 GB at ~150 MB/s =
|
| 257 |
~18 minutes on a fast link).
|
| 258 |
|
| 259 |
-
See [`requantization-plan.md`](https://
|
| 260 |
diagnostic write-up of why the requantize step is needed.
|
| 261 |
|
| 262 |
## License & attribution
|
|
|
|
| 237 |
|
| 238 |
The full pipeline (download → convert → re-quantize → finalize → patch
|
| 239 |
→ verify) is automated in
|
| 240 |
+
[`build_mlx_q4q8.sh`](https://huggingface.co/Deviad/DeepSeek-V4-Flash-MLX-Q4Q8/blob/main/build_mlx_q4q8.sh) (companion script in the
|
| 241 |
project repo). Quick reference of the steps:
|
| 242 |
|
| 243 |
```
|
|
|
|
| 256 |
M3 Ultra: ~75 minutes plus the initial download (~160 GB at ~150 MB/s =
|
| 257 |
~18 minutes on a fast link).
|
| 258 |
|
| 259 |
+
See [`requantization-plan.md`](https://huggingface.co/Deviad/DeepSeek-V4-Flash-MLX-Q4Q8/blob/main/requantization-plan.md) for the
|
| 260 |
diagnostic write-up of why the requantize step is needed.
|
| 261 |
|
| 262 |
## License & attribution
|