Diffusers documentation

ErnieImageTransformer2DModel

You are viewing main version, which requires installation from source. If you'd like regular pip install, checkout the latest stable version (v0.37.1).
Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

ErnieImageTransformer2DModel

A Transformer model for image-like data from ERNIE-Image.

A Transformer model for image-like data from ERNIE-Image-Turbo.

ErnieImageTransformer2DModel

class diffusers.ErnieImageTransformer2DModel

< >

( hidden_size: int = 3072 num_attention_heads: int = 24 num_layers: int = 24 ffn_hidden_size: int = 8192 in_channels: int = 128 out_channels: int = 128 patch_size: int = 1 text_in_dim: int = 2560 rope_theta: int = 256 rope_axes_dim: typing.Tuple[int, int, int] = (32, 48, 48) eps: float = 1e-06 qk_layernorm: bool = True )

Update on GitHub