👍 Please Rate! - Your feedback, rating or a follow are greatly appreciated. - Thankyou! 😊
These models can use both Illustrious and XL Loras, though you may need to adjust the weights a bit.
🗈 Attention: I created a fork of this model for full realism called Frisky Dingo. You can find it here. (I'll be updating it shortly).
All v4 example images showcase the base model only, rendered in a single pass at high resolutions, with No detailers, LORAs, embeddings or upscaling, with only 2 - 3 exceptions. I'm using the Kohya Deep Srink node to maintain stability at higher resolutions.
There's a simplified workflow embedded in each of my v4 gallery images. (just verify the Magic Node settings with the 🗈 notes section at the bottom, as I may have tweaked them slightly). - Standard SDXL VAE is baked.
V4 - Cadmium - (a more fantasy / semi-realism aesthetic with a focus on high detail, high contrast & saturation.)
Suggested settings:
VAE: sdxl_vae (baked in)
Clip skip: 2
Samplers / Schedulers: DPMpp_2M_SDE_GPU / SGM_Uniform is recommended, but a wide selection are supported
Resolution: up to 2048 x 1536, with the Kohya Deep Shrink node, portrait or landscape. all standard 1MP resolutions work well.
CFG: 3 - 7.0 (I typically use 3.5, - 4.8)
Steps: 30 - 36
Prompting:
Natural language prompting & Danbooru tags. Generally less is more, for best results try to write clear concise prompts, (look to my sample images for examples and general formatting).
(added new models, I'll update the credits shortly).
V3 - Canny Mountain - (greater focus on retention of the illustrious knowledge base, camera film and post-processing effects)
Suggested settings:
VAE: sdxl_vae (baked in)
Clip skip: 2 was used during merge, (setting 1 or 2 should yield same results)
Samplers: DPM++ 2M SDE, DPM++ 3M SDE, Euler Ancestral Schedulers: SGM Uniform, Karras, Occasionally I use others for variance. (experimentation is recommended),
Resolution: all standard 1MP resolutions work well in portrait and landscape, (depending on context) (I often use 1024x1360 and 1120x1440 in portrait and landscape) I sometimes go as high as 1344x1728,
CFG: 3.8 - 8.0 (I typically use 3.8, -5.6 for photorealism)
Steps: 32 - 38 (I use 36 most often),
Prompting:
Danbooru tags, & natural language prompting. Generally less is more, for best results try to write clear concise prompts, (look to my sample images for examples and general formatting).
positive prompts - responds well to camera related tags: photorealistic, raw photo, amiture photo, depth of field, bokeh etc.
negative prompts - I generally recommend keeping sepia in your negatives to overcome a sepia bias. (it can be helpful to add a few things like "artificial, anime, illustration, unreal", if you're pushing for greater realism).
V2 - Fully REALized - (greater photorealism while maintaining much of the illustrious knowledge base)
Suggested settings:
VAE: sdxl_vae (baked in)
Clip skip: 2 was used during merge, (setting 1 or 2 should yield same results)
Sampler: DPM++ 2M SDE - SGM Uniform, (good option for photorealism) or Euler Ancestral - SGM Uniform (experimentation is recommended),
Resolution: all standard 1MP resolutions work well in portrait and landscape, (depending on context) (I often use 1024x1360 and 1120x1440 in portrait and landscape)
CFG: 2.8 - 9.0 (I commonly use 3.8, 5 & 7)
Steps: 24 - 38 (I use 36 most often, though I'm starting to use lower values with solutions like CFG rescale & Zero Star).
V1 - Beyond the Valley
Suggested settings:
VAE: sdxl_vae (baked in)
Clip skip: 2 was used during merge, (setting 1 or 2 should yield same results)
Sampler: Euler Ancestral - SGM Uniform (most consistent good results), DPM++ 2M SDE - SGM Uniform, (good option for photorealism)
Resolution: all standard 1MP resolutions work well in portrait and landscape, (depending on context) (I often use 1024x1360 and 1120x1440 in portrait and landscape)
CFG: 2.8 - 8.0 (lower for more photorealism - I commonly use 2.8, 3.8, 5 & 7)
Steps: 24 - 38 (I use 36 most often)
Prompting:
Primarily Danbooru tags, mixed with a bit of natural language prompting. Generally less is more, for best results try to write clear concise prompts, (look to my sample images for examples and general formatting).
positive prompts - Hype4realistic can be added to push realism a bit further.
negative prompts - I generally recommend starting with none and adding tags as needed. (it can be helpful to add a few things like "toon, illustration, unreal", if you're pushing for greater realism).
I started this project to in an attempt to recreated a specific aesthetic created by blinkdotleh using his workflow where 2 models split the steps during image generation. He created a series of images using Uncanny valley for the initial steps & my fabled Illusion model as a refiner. I created a style Lora from those outputs and included it in this merge. This is an attempt to recreate that look in a singe model while preserving as much of the Illustrious knowledge base as possible.
on the image generation side, this is roughly 50% Illustrious & 50% XL (mostly bigASP), while the CLIP more heavily favors Illustrious at about 65%.
🗈 Notes & Tips:
(to be expanded over time),
Preferred Sampler / Scheduler combos: in order preference:
DPM++ 2M_SDE_GPU, / SGM_uniform, Simple, beta,
DPM++ 2M_SDE, / SGM_uniform, Simple, beta,
DPM++ 3M_SDE_GPU, / Simple, SGM_uniform, beta,
Euler_A, / Simple, beta, SGM_uniform, exponential,
DPM++ 3M_SDE, / Simple, SGM_uniform, beta,
heun, / Simple, beta, (Sometimes garbage, sometimes pure gold)
dpmpp_2M, / beta, (same as the last one, hit or miss depending on scenario, but when it behaves, it's a clear winner, after some trial & error it becomes more predictable).
DDPM / ddim_uniform, (High accuracy with clean details, a bit of a washout - overexposed effect, but with higher CFG's & / or charged terms in your prompting, this can become the best option.)
Karras, tends to overcook things and glitch eyes, I almost never use it, (I know people swear by it, it's great for many scenarios / models but generally not with mine.
Exponential, pairs well with LCM to mitigate washout
AYS, works well, but I prefer LCM with this model.
------
Patch Model Add Downscale (Kohya Deep Shrink),
is a default node in Comfy that greatly increases image stability by compressing the Unet model during the 1st compositional stage of image generation then re-expanding it for fine details.
Parameters of concern include:
Block number: (3 is default but 4 can also be useful in some instances).
Downscale Factor: (I use 1.5 as a default but will go as low as 1.25 for resolutions close to standard, & sometimes as high as 2 if pushing the upper limits.
End Percent: (this is the percentage of total steps, the point at witch the unet will decompress. You can generally leave this at 0.35, but I sometimes make slight adjustments if I'm trying to eliminate a pesky error.
-------Resources used / Creator thanks
Checkpoints:
Uncanny valley by meden - (clip only)
Loras:
Hyperrealistic [Pony | Illustrious] by Zoropaton
SPO-SDXL_4k-p_10ep_LoRA_webui by rockeycoss
custom style Lora - (not yet publicly available)
(Additionally, thanks to everyone whos prompts I've pilfered for testing).
Description
🗈 Attention: I created a fork of this model for full realism called Frisky Dingo. You can find it here. (I'll be updating it shortly).
v4 Cadmium moves into a more fantasy / semi-realism aesthetic with a focus on high detail, high contrast & color saturation.
This version is fairly versatile in terms of sample / scheduler options, but I find dpm++_2M_SDE_GPU to be exception for prompt adherence.
Steps: 30 - 36
CFG: 3.2ish - 6.5ish (I tend to stick around 3.6 - 3.8 for a more realistic look & 4.6 - 4.8 for a more stylized look.
See description for further details and tips.



















