am looking forward to see what size gguf you can make

by infinityai - opened Jan 12

Jan 12

am looking forward to see what size gguf you can make

Unsloth AI org Jan 13

Jan 13

•

I don't know if you've seen this or not about this, it's supposed to let you quantise from 16 bit to 2 bit and have next to no loss

It's supposed to be a way to compress the weights without any loss supposedly

Can you have a look at it and let me know what you think?

It would be really cool if we could compress some of the top models using this QuIP compression quantisation technique on top to with the REAP

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment