This model has previously been named AnimagineXL Realistic or Stylistic and Make Art Not Goon.
What makes this model special? I prioritized prompt following and composition, making it excellent for art. This makes it excellent at NSFW as a consequence, not by over training on pr0n at the cost of general knowledge, like many other models.
How did I do this? I took the under-appreciated Animagine XL v4.0 because it was well trained from scratch on SDXL 1.0 Base for character knowledge up to Jan 2025. The prompt knowledge of the clip + model is something really special. With the right tags, it is rather artistic and magical with lighting, shadows and composition. CagliostroLab have a discord too. I didn't use "Opt" or "Zero" in the mix, as I think they broke the magic with those.
I think the style and prompt following peaked in v5, and I pushed too hard to improve realism in v6, v7, v8. I also focused too much on integrating DMD weights to get faster generations with all samplers - both normal samplers and LCM samplers.
v9_wip update
It's a work in progress, using the animagine clip with illustrious weights, but maintaining prompt following. For general photorealistic generation, end the prompt with explicit or safe followed by:
sharp focus, depth of field, fine hair strands texture, photo \(medium\), photorealistic, realistic, masterpiece, high score, great score, absurdres
And negative:
ugly, blurry, lowres, worst quality, low score, bad score, text, sketch, drawing,
Previous versions
Check out the workflow in the "Training Data" download! The v8 model is basically tuned for ddpm at cfg 1, 20 steps. However, Euler a works almost as well, and LCM is particularly good as a first pass for architecture, strong shapes, followed by a ddpm pass to add more detail.
Many of the early versions are designed for low CFG. If you want to use cfg > 1 then add the cfg norm node in comfyui to stop burn!
For characters, the prompt format is:
1girl, name, series, rating, stuff, ... , stuff, ..., stuff, ... , masterpiece, high score, great score, absurdres
1girl, remilia scarlet, touhou, safe, riding a horse, sunset, masterpiece, high score, great score, absurdres
Prompting guide for Animagine: https://cagliostrolab.net/posts/optimizing-animagine-xl-40-in-depth-guideline-and-update
You don't need to add masterpiece, high score, great score, absurdres to the end of the prompt - it is tuned to work without it. Only add it if you want more anime style.
Version History
v9 - ⭐⭐⭐ - It's almost an entirely new model. The clip encoder is the same, and the clip attention layers come from v5, but many of the weights are 50% or more from realistic illustrious models. Photorealism is better, prompt following is ok, hands and eyes are a little better.
v8 - ⭐⭐ - Heavily reworked the original Animagine 4.0 to maintain dynamic range in lighting and shadow, knowledge and prompt following, and vastly improved photo-realism.
v7 - ⭐⭐ - Squeezed a little more out of Animagine by optimizing specific booru tags and darkness.
v6 - ⭐⭐ - Quite a leap in prompt following, realism, and surrealist art.
v5 - ⭐⭐⭐ - Improved prompt following, shadow and light dynamic range, better artist style flexibility, and good anatomy and acceptable hands. It's still a bit more horny than I'd like but it's difficult when all the realism models excel at XXX.
v4 - ⭐ - An improvement in anatomy, realism, highlights and shadows. It also follows closely to Animagine XL 4.0. Unfortunately it became more horny, use "safe" in the positive and (nsfw, explicit) in the negative.
v3 - ⭐ - I focused on improving the prompt following from the base model, face expressions and only "good enough" photo-reality. Works with wider range of samplers, steps and cfg.
v2 - more photorealism but less authentic to Animagine and broke lighting and expressions.
v1 - quite authentic to Animagine, but noisy except with specific samplers.
License
This model adopts the original CreativeML Open RAIL++-M License from Stability AI without any modifications or additional restrictions. The license terms remain exactly as specified in the original SDXL license, which includes:
✅ Permitted: Commercial use, modifications, distributions, private use
❌ Prohibited: Illegal activities, harmful content generation, discrimination, exploitation
⚠️ Requirements: Include license copy, state changes, preserve notices
📝 Warranty: Provided "AS IS" without warranties
Please refer to the original SDXL license for the complete and authoritative terms and conditions.
Description
This is the strongest model yet for details. I think v7 will be the final version in this style. If there is to be v8, it must drift away from Animagine 4.0.
FAQ
Comments (14)
The workflow is a gem. Thank you so much for sharing it.
Thanks! I have just updated it again a few seconds ago, in the "training data" download. It's a compromise - if you set the 6 noise seeds for the initial images to increment, you get more variety, but to reproduce, you have to drag the preview back in from the explorer, so it breaks your flow. No easy fix in comfyui, just not designed for that.
What card are you rocking? I only have a 3090 so it is tolerable speed for me!
Sorry I'm just not getting it, what's the difference between the v7 model showcase images -- they seem to all have same prompt seed etc and yet style differentiation?
CivitAI doesn't automatically read ComfyUI workflows properly. If you're using ComfyUI, the workflow is in the "training images" download zip on the model and the workflow explanation is in the description. Basically, the workflow generates random noise swirls of different brightness and colors as seeds, so you see what structure you're going to get instead of choosing a random number. You don't have to use it, the model is a normal model, it works like any other.
@ShinyLilly Ok, got it. I wouldn't know, stuck with genning on civitai (for now!).
great work, thankyou!
I'm already used to the joker who follows me and gives a frowny face to most of my posts, but I can't for the life of me figure out why my dancing-in-field post got five frowns?
(it's near the top if this gallery's "most reactions" mode, here)
I'd love if anyone can explain, seems useful to know. Don't worry, I'm very thick skinned.
I would blame the fingercounting police!
@nemo474 Oh, I should have realized that! Thanks for pointing it out, useful to know.
Personally, I don't get them -- they think it was easy getting precisely seven fingers on each hand -- and for two characters simultaneously ?! I modestly assert 'twas a work of brilliance.
@ravemry9 I don't get it either, I only count toes.
@nemo474 lololol.
Frowny is a negative reaction.
CivitAi does not give the option to give negative reaction. Thumbs down, Disgust would be example of negative reaction.
The tear next to eye indicates this is not a Frowny face.
These used that reaction may found the image/scene touching.
Words_of_LightOh, sorry, I didn't notice your reply here; civit's unreliable notifications hits yet again. Thanks for the help!

















