Fix for onnxruntime-gpu slow first run

by ehrrh - opened Aug 12, 2024

Aug 12, 2024

•

edited Aug 29, 2024

With this model if you set providers=["CUDAExecutionProvider", "CPUExecutionProvider"] to use the gpu with onnx it spends a long time using the GPU for something when you try to do the first inference before finally quickly tagging everything, happens with both onnxruntime-gpu 1.18.1 and 1.19.0; if the number of images you want to tag is small it ends up being faster to use CPU because of the long wait time for the first image on GPU, why is that?

ehrrh changed discussion status to closed Aug 12, 2024

ehrrh changed discussion status to open Aug 29, 2024

ehrrh

Aug 30, 2024

Ok, found the actual fix: https://github.com/microsoft/onnxruntime/issues/19838

cuda_options = {
    "cudnn_conv_algo_search": "HEURISTIC"  # Set the cuDNN convolution algorithm search to HEURISTIC
}

model = rt.InferenceSession(model_path, providers=[('CUDAExecutionProvider', cuda_options), 'CPUExecutionProvider'])

ehrrh changed discussion title from Bug with onnxruntime-gpu to Fix onnxruntime-gpu slow first run Aug 30, 2024

ehrrh changed discussion title from Fix onnxruntime-gpu slow first run to Fix for onnxruntime-gpu slow first run Aug 30, 2024

SmilingWolf

Owner Sep 1, 2024

Keeping this open for awareness. I'm assuming you reported this upstream too?

SmilingWolf changed discussion status to closed Sep 1, 2024

SmilingWolf changed discussion status to open Sep 1, 2024

ehrrh

Sep 2, 2024

•

edited Sep 2, 2024

Yeah, it seems to be a known problem https://github.com/microsoft/onnxruntime/issues/10746

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment