Why model dtype in config is fp32?

by Rayzl - opened Nov 17, 2025

I infer the model with transformers Qwen2_5_VLForConditionalGeneration,

model = Qwen2_5_VLForConditionalGeneration.from_pretrained("Xiaomi-MiMo-VL-Miloco-7B", dtype="auto").to("cuda")

when dtype set to torch.bfloat16, the model think not stop. dtyoe set auto work well.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment