EllaPriest45 commited on
Commit
26b709b
·
verified ·
1 Parent(s): 17a133f

Upload 6 files

Browse files
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Fabled[[:space:]]Infusion[[:space:]]v4.0[[:space:]]FP16[[:space:]]-[[:space:]]Illustrious.png filter=lfs diff=lfs merge=lfs -text
37
+ Frisky[[:space:]]Dingo[[:space:]]FP16[[:space:]]-[[:space:]]Illustrious.png filter=lfs diff=lfs merge=lfs -text
Fabled Infusion v4.0 - Illustrious.txt ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Fabled Infusion v4.0 - Illustrious
2
+
3
+ All v4 example images showcase the base model only, rendered in a single pass at high resolutions, with No detailers, LORAs, embeddings or upscaling, with 2 - 3 exceptions. I'm using the Kohya Deep Srink node to maintain stability at higher resolutions.
4
+
5
+ There's a simplified workflow embedded in each of my v4 gallery images. (just verify the Magic Node settings with the 🗈 notes section at the bottom, as I may have tweaked them slightly). - Standard SDXL VAE is baked.
6
+
7
+ V4 - Cadmium - (a more fantasy / semi-realism aesthetic with a focus on high detail, high contrast & saturation.)
8
+
9
+ Suggested settings:
10
+
11
+ VAE: sdxl_vae (baked in)
12
+
13
+ Clip skip: 2
14
+
15
+ Samplers / Schedulers: DPMpp_2M_SDE_GPU / SGM_Uniform is recommended, but a wide selection are supported
16
+
17
+ Resolution: up to 2048 x 1536, with the Kohya Deep Shrink node, portrait or landscape. all standard 1MP resolutions work well.
18
+
19
+ CFG: 3 - 7.0 (I typically use 3.5, - 4.8)
20
+
21
+ Steps: 30 - 36
22
+
23
+ Prompting:
24
+
25
+ Natural language prompting & Danbooru tags. Generally less is more, for best results try to write clear concise prompts, (look to my sample images for examples and general formatting).
Fabled Infusion v4.0 FP16 - Illustrious.png ADDED

Git LFS Details

  • SHA256: 96f6dcf96ffe1fe0038f8c9b47f62ad00d9cccfe60eccd96b857558bf6d38078
  • Pointer size: 132 Bytes
  • Size of remote file: 3.01 MB
Fabled Infusion v4.0 FP16 - Illustrious.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:168239c3f5b6bcac8f4726ed29d8ee44b3645f529a000b0a5834360886c7efdc
3
+ size 7105376360
Frisky Dingo FP16 - Illustrious.png ADDED

Git LFS Details

  • SHA256: 171a049260720c00195cd5c2e3e5bd862a9c60878a2ad7797eba527cbf617af5
  • Pointer size: 132 Bytes
  • Size of remote file: 4.2 MB
Frisky Dingo FP16 - Illustrious.txt ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Frisky Dingo FP16 - Illustrious
2
+
3
+ Karras should generally be avoided with v1. DDPM - DDIM is recommended, with DPM++ 2M / 3M SDE - Heun / SGM_Uniform, as a good 2nd choice. (see below for additional options).
4
+
5
+ This is a hybrid with a heavily IL biased CLIP, (primarily iLustMix v7), & an XL biased UNET. It can use IL, XL & PONY LORAs to varying degrees, though you may need to adjust the weights a bit, (situationally depending). Poses and concepts tend to work well while things like artist styles generally don't translate to realism, but can still be compositionally useful, in addition to pushing the model into a semi-realism or illustration / anime style. (I'll have a section @ the bottom for additional notes on prompting, LORAs & settings which I'll try to expand on it over time).
6
+
7
+ I used upscaling & a detailer in most example images, (4x_RealWebPhoto-v4 and / or 4x_foolhardy_Remacri).
8
+
9
+ V1 Suggested settings:
10
+
11
+ VAE: sdxl_vae (baked in).
12
+
13
+ Clip skip: 2 was used during merge, (setting 1 or 2 should yield same results).
14
+
15
+ Samplers/Schedulers: DDPM / DDIM_Uniform (will yield the best results consistently) For good (close 2nd) options: DPM++ 2M SDE & DPM++ 3M SDE / Heun & SGM_Uniform, or Euler Ancestral/SGM_Uniform.
16
+
17
+ low step options: (LCM/SGM_Uniform or DPM++ 2M/AYS, DPM++2Sa/AYS).
18
+
19
+ Resolution: all standard 1MP resolutions work well in portrait and landscape, (depending on scene complexity). In addition to standard XL resolutions, I often use 1024x1360, 1120x1440, 1232x1584, and occasionally 1344x1728, in my initial generation. Some scenes will benefit from the higher resolution in both composition & detail, but you will loose some prompt adherence and see more errors in the highest resolutions.
20
+
21
+ CFG: 3.8 - 7 (I typically use 4.6, -5.2 in my initial generation). A more stylized look will tend to creep in @ higher CFGs.
22
+
23
+ Steps: 32 - 38 (I use 36 most often, but I find 38 is sometimes required to nail a pose in more complex scenes).
24
+
25
+ DMD2 / LCM: I like to set the LORA strength low (around 0.6) & the CFG high (about 1.6-1.7) with 12-16 steps. This allows neg prompts to work & I find it to be the right balance of speed and output quality. (I'll expand on multi-pass refiner / upscaling once I do a bit more testing).
26
+
27
+ Prompting:
28
+
29
+ Natural language prompting supported by Danbooru tags. Generally less is more, (this model takes things very literally & has a slight learning curve, but I promise it's worth the time). For best results try to write a clear concise prompts summery in natural language, followed by tags to fill out details. A photo centric approach is best when trying to push realism. Too many IL tags associated with anime will start to push you into semi-realism, they are however effective and can be useful in composition. With a little time getting used to the nuance, you'll find the balance point where you can get an IL biased composition with photorealistic output. Quality tags should generally be put last, (look to my sample images for examples and general formatting). I'll add more on this in the notes section over time.
30
+
31
+ positive prompts - responds well to camera related tags: photorealistic, raw photo, amiture photo, depth of field, Fujifilm XF 50mm f/2 R WR lens, 35mm film, bokeh, etc.
32
+
33
+ negative prompts - I generally recommend keeping sepia in your negatives to overcome a sepia bias in lower CFGs. (artificial, anime, illustration, unreal) up to a weight of :1.4 if needed. (always best to keep weights as low as you can to achieve the scene). (board expressionless:1.2), is useful for getting away from the default XL poker-face.
34
+
35
+ Check in from time to time for for additional info: Prompting Insights, LORA Settings, Gen Settings and anything else useful I can think of, I'll be expanding on the Notes section over time. (see bellow).
36
+
37
+ (This was a passion project that I obsessed on for 3 weeks. I hope you dig it).
38
+
39
+ 🗈 Notes & Tips:
40
+
41
+ (to be expanded over time),
42
+
43
+ Notes: Frisky essentially started as a "full- Realism" branch of Fabled Infusion, they have very similar ingredients but in different ratios. While testing the latest Fabled version and discovered that the CLIP had picked up some issues over the iterative versions, so I started from scratch. I followed a very similar process to what I did with FI, but I avoided the mistake of using LORA's to stabilize sub-mixes, before a final mix. (I suspect that was the biggest issue with my process). I used a couples LORAs here, but only subtly in the final step. Additionally, I rebuilt the clip in the final step. I think this is my most cohesive merge to date. It should be quite similar to what you're used to with FI, but way more photorealism biased. It also takes most LORAs better, (at least in my testing so far).
44
+
45
+ LORAs: It's best to split CLIP & Model strength if your UI allows for this. If I'm using an IL or pony pose LORA, I tend to set the CLIP at about 0.92-1.0 & the model much lower, (I test to find the point where I can get the pose, without influencing the visual output of the checkpoint), This tends to be in the 0.60-0.80 range. If you don't have this option you can just set LORA strength a little lower until you find the best balance point, (but it's very nice to have fine control over this).
46
+
47
+ Prompting: Your prompt should be structured in order of importance / what you want to see in the foreground, followed by additional details, and finally, the medium & quality tags.
48
+
49
+ FD handles specificity well, I.E "Gibson Les Paul guitar" will yield much better results than guitar.
50
+
51
+ Be aware that the model can be hypersensitive some tags & phrases. This is true across most SDXL models, but I notice it hear. For example using terms like "hyperdetailed warm skin texture with viable pores" or "freckles" can overcook your output, especially at higher CFGs. I generally use lower weights like (freackles:0.25) to compensate.
52
+
53
+ I was using "peach fuzz" in my list of skin detailed but peaches started appearing everywhere. I've switched to "baby hairs on body" or "baby hairs illuminated by sunlight". You'll likely find examples where you need to find a creative way to reword something.
WAI_Not FP16 - Illustrious.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e585bc2a4aa07cfa99d6f71839ad6b09b71da12936aa57114c79008f9433ead0
3
+ size 7105351992