A 8bit version of Model
#12
by varun500 - opened
No description provided.
A 8bit version of the model would be helpful which can be loaded in 16GB of GPU VRAM
TheBloke changed pull request status to closed
Please just use load_in_8bit=Truewith an HF model like I've told you!
Sure will do that