Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RedHatAI
/
gemma-4-31B-it-NVFP4

Image-Text-to-Text
Transformers
Safetensors
gemma4
fp4
vllm
llm-compressor
compressed-tensors
conversational
8-bit precision
Model card Files Files and versions
xet
Community
5
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Error KeyError: 'layers.0.experts.0.down_proj.input_global_scale' when running on vllm

1
#5 opened 2 days ago by
pachePizza

Running Gemma 4 Truthfully at 128K on One RTX 5090

1
#4 opened 5 days ago by
Mosai-Sys

Please update tokenizer config as well

12
#2 opened 9 days ago by
alexcardo

Can you plese update the chat template

❤️ 3
1
#1 opened 9 days ago by
alexcardo
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs