Sweaterdog
/

Andy-4-micro

Model card Files Files and versions

Sweaterdog commited on Jun 26, 2025

Commit

107fa39

·

verified ·

1 Parent(s): 47c067c

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -45,10 +45,11 @@ First, you need to choose your quantization, this chart is with the base of `819
 | Quantization | VRAM Required |
 |--------------|---------------|
-| F16          | 6 GB+         |
-| Q5_K_M       | 4 GB+         |
-| Q4_K_M       | 4 GB          |
-| Q3_K_M       | 1.5 GB or CPU |
 **NOTE:** GPUs made before 2017 will have *significantly slower speeds* than newer GPUs, also, CPU inference will be extremely slow.

 | Quantization | VRAM Required |
 |--------------|---------------|
+|--------------|---------------|
+| F16          | 5 GB       |
+| Q8_0       | 3 GB+         |
+| Q5_K_M       | 2 GB+         |
+| Q3_K_M       | 1GB or CPU   |
 **NOTE:** GPUs made before 2017 will have *significantly slower speeds* than newer GPUs, also, CPU inference will be extremely slow.