how to deploy AngelSlim/Hy-MT1.5-1.8B-1.25bit

#1
by Aelous - opened

how to deploy AngelSlim/Hy-MT1.5-1.8B-1.25bit

We're putting the finishing touches on our latest llama.cpp kernel for deploying AngelSlim/Hy-MT1.5-1.8B-1.25bit, expect that to release soon! If you want to get started earlier, feel free to try our APK (https://huggingface.co/AngelSlim/Hy-MT1.5-1.8B-2bit-GGUF/resolve/main/Hy-MT-demo.apk?download=true), as it already includes the merged kernel.

AngelSlim org

We have released STQ1_0 kernel for 1.25-bit model and given a PR to llama.cpp PR #22836 ! If you have any questions or suggestions for STQ_0, welcome to comment under the PR !🔥🔥🔥

Sign up or log in to comment