Stable Diffusion v1.5 β€” Pokemon LoRA

LoRA adaptation weights for Stable Diffusion v1.5, fine-tuned on the Pokemon BLIP Captions dataset (833 samples) to generate Pokemon-style images from text prompts.

Video walkthrough: Fine-tune Stable Diffusion with LoRA for as low as $1

Training Details

Detail Value
Method 4-bit QLoRA following the official LoRA blog post
Hardware Amazon EC2 g4dn.xlarge (1x NVIDIA T4 GPU)
Environment AWS Deep Learning AMI for PyTorch
Training time ~6 hours
Cost ~$3 on-demand / ~$1 with EC2 Spot Instances

Usage

from diffusers import StableDiffusionPipeline
import torch

pipe = StableDiffusionPipeline.from_pretrained(
    "runwayml/stable-diffusion-v1-5", torch_dtype=torch.float16
)
pipe.unet.load_attn_procs("juliensimon/stable-diffusion-v1-5-pokemon-lora")
pipe.to("cuda")

image = pipe("a pokemon with blue fire wings").images[0]

Resources

  • Training script and training logs are included in the repository.
Downloads last month
28
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for juliensimon/stable-diffusion-v1-5-pokemon-lora

Adapter
(2715)
this model

Dataset used to train juliensimon/stable-diffusion-v1-5-pokemon-lora

Space using juliensimon/stable-diffusion-v1-5-pokemon-lora 1