33.6 GB
VictorSanh's picture
gpu memory / inference speed - tradeoffs of quantization
1fa1cbd