Hello, could you please share your quantitative methods or scripts? Thanks in advance.

#1
by lzyrapx - opened

Hello, could you please share your quantitative methods or scripts of gemma-3-12b-it-NVFP4? Thanks in advance.

lzyrapx changed discussion title from Hello, could you please share your quantitative methods or scripts? to Hello, could you please share your quantitative methods or scripts? Thanks in advance.

I wrote it according to the llm-compressor's examples.
https://github.com/vllm-project/llm-compressor/blob/main/examples/multimodal_vision/gemma3_example.py
Just change the modifier to NVFP4.

Thank you so much!

Sign up or log in to comment