synth-id-remover

Running

dennny123 Claude Sonnet 4.5 (1M context) commited on Jan 26

Commit

2cdf689

1 Parent(s): e90ff32

Fix ops.py patch - actually move to CPU not just convert dtype

Previous patch did input.float() which kept tensors on GPU.
Now does input.cpu().float() to actually run on CPU.
Then moves result back: x.to(device).to(dtype)

Co-Authored-By: Claude Sonnet 4.5 (1M context) <noreply@anthropic.com>

Files changed (1) hide show

app.py +3 -2

app.py CHANGED Viewed

@@ -84,8 +84,9 @@ def _patch_qwen_for_mig_gpu():
                         f'{space}    x = torch.nn.functional.linear(input, weight, bias)\n',
                         f'{space}except RuntimeError as e:\n',
                         f'{space}    if "CUBLAS" in str(e):\n',
-                        f'{space}        x = torch.nn.functional.linear(input.float(), weight.float(), bias.float() if bias is not None else None)\n',
-                        f'{space}        x = x.to(input.dtype)\n',
                         f'{space}    else:\n',
                         f'{space}        raise\n'
                     ]

                         f'{space}    x = torch.nn.functional.linear(input, weight, bias)\n',
                         f'{space}except RuntimeError as e:\n',
                         f'{space}    if "CUBLAS" in str(e):\n',
+                        f'{space}        device = input.device\n',
+                        f'{space}        x = torch.nn.functional.linear(input.cpu().float(), weight.cpu().float(), bias.cpu().float() if bias is not None else None)\n',
+                        f'{space}        x = x.to(device).to(input.dtype)\n',
                         f'{space}    else:\n',
                         f'{space}        raise\n'
                     ]