GGUF

Please add IQ4_XS gguf

#1
by BigBeavis - opened

It's highest quant type for 70b that allows 16+32gb setups load at 8k without slipping into pagefile

Sign up or log in to comment