Thank you for your work and support. Is it possible to make a higher quantization variant von that model? Like Q4 or even Q2? I would like to speed up the response speed with that.
· Sign up or log in to comment