data-archetype
/

semdisdiffae_p32

image-reconstruction

image-tokenizer

semantic-alignment

Model card Files Files and versions

data-archetype commited on 9 days ago

Commit

6a12ad8

·

verified ·

1 Parent(s): e12172b

Fix bf16 standalone RMSNorm precision

Files changed (2) hide show

README.md +1 -0
fcdm_diffae/norms.py +4 -4

README.md CHANGED Viewed

@@ -17,6 +17,7 @@ library_name: fcdm_diffae
 | Date | Change |
 |------|--------|
 | 2026-04-08 | Initial release |
 **Experimental patch-32 version** of

 | Date | Change |
 |------|--------|
+| 2026-04-10 | Refresh standalone package: fix bf16 RMSNorm precision path in both encoder and decoder to match training code; local export tooling now preserves fp32 EMA weights for future re-exports |
 | 2026-04-08 | Initial release |
 **Experimental patch-32 version** of

fcdm_diffae/norms.py CHANGED Viewed

@@ -30,10 +30,10 @@ class ChannelWiseRMSNorm(nn.Module):
         # Float32 accumulation for stability
         ms = torch.mean(torch.square(x), dim=1, keepdim=True, dtype=torch.float32)
         inv_rms = torch.rsqrt(ms + self._eps)
-        y = x * inv_rms
         if self.weight is not None:
             shape = (1, -1) + (1,) * (x.dim() - 2)
-            y = y * self.weight.view(shape).to(dtype=y.dtype)
             if self.bias is not None:
-                y = y + self.bias.view(shape).to(dtype=y.dtype)
-        return y.to(dtype=x.dtype)

         # Float32 accumulation for stability
         ms = torch.mean(torch.square(x), dim=1, keepdim=True, dtype=torch.float32)
         inv_rms = torch.rsqrt(ms + self._eps)
+        y = x * inv_rms.to(dtype=x.dtype)
         if self.weight is not None:
             shape = (1, -1) + (1,) * (x.dim() - 2)
+            y = y * self.weight.view(shape).to(dtype=x.dtype)
             if self.bias is not None:
+                y = y + self.bias.view(shape).to(dtype=x.dtype)
+        return y