CivArchive
    Preview 51916528
    Preview 50747522
    Preview 50909996
    Preview 50925402
    Preview 50752676
    Preview 50760051
    Preview 50760133
    Preview 51319297
    Preview 51342407
    Preview 51777880
    Preview 51988350
    Preview 52552845

    Stabilizer

    Name is misleading. This is NOT a LoRA that can magically fix your overfitted merged base model.

    This is a finetuned model from pretrained model. But trained as a LoRA.

    • 11k images. No bias, no default style.

    • Zero smooth plastic glossy AI image in dataset. Glossy Al images are polluting the world, but not on my watch, I handpicked every single image.

    • Natural language captions from Gemini, rather than tags in random order with high FPR.

    • Standard noise scheduler. Inpainting/img2img friendly.

    Improvements:

    You can get the style exactly as it should be (as long as the model knows).

    Comparisons with pretrained base model:

    See more xy plots in cover images.

    Why LoRA?

    • This LoRA is a DoRA (from Nvidia), which is more efficient than traditional LoRA.

    • Enough for thousands of training images.

    • What? You prefer to download and store a 7GiB checkpoint than a 80Mib LoRA?

    Share merges using this model is prohibited. FYI, there are hidden trigger words to print invisible watermark. I coded the watermark and detector myself. I don't want to use it, but I can.

    This model only published on Civitiai and TensorArt. If you see "me" and this sentence in other platforms, all those are fake and the platform you are using is a pirate platform.


    How to use

    It is highly recommended that you use pretrained base model.

    And load this LoRA with strength 1.

    Versions:

    Load this LoRA first

    This LoRA uses a new arch called DoRA from Nvidia, more efficient than traditional LoRA. But the patch weight is dynamically calculated based on the currently loaded base model weights (which will be changed when you loading other LoRAs). To avoid unexpected changes, load this LoRA first.

    Specify styles in prompt

    This model does not have an strong default style and is very creative. You must specify the style you want in the prompt.


    If you want to use it on finetuned/merged base models:

    I personally disagree this. This is not the model's original intention. But this is a LoRA after all.

    Be aware:

    This LoRA can't remove glossy shiny plastic AI style. Although the dataset is "AI image free". It can't make AI image polluted 1girl overfitted 50 versions of Nova furry 3D anime WAI or whatever look better. What the model learned is learned. Use a pretrained base model if you want to get rid of overfitted AI style.

    What is "overfitted AI style"? This is what Craft Lawrence (from spice and wolf) should be, if you've seen the anime: img. This is what those AI style polluted 1girl overfitted model generated: img

    Some base models already merged this model. If you got deformed images with this LoRA even at low strength (e.g. <0.5). Your base model has already merged this LoRA (and you merged it twice). And the model weights got multiplied (because how the DoRA works) and collapsed.

    Beware of fake base model creators, aka. thieves. Some "creators" never do the training, they only grab other people's models, merge them, wipe all metadata and credits, and sell it as their own base model.


    Update log

    Moved here

    Description

    FAQ

    Comments (5)

    solid_snekJan 14, 2025
    CivitAI

    It really does 'stabilize' the pictures. But it seems to increase the contrast a little bit too much. The smooth lighting especially suffers and is replaced by a strong light.

    reakaakasky
    Author
    Jan 15, 2025· 2 reactions

    I'm assuming you are using v-pred + v0.114. Yeah I noticed this issue. The training noise overshot a little bit. Will fix in next version.

    solid_snekJan 15, 2025

    @reakaakasky I'm really looking forward to it. Thank you so much!

    Dead_Internet_TheoryJan 17, 2025
    CivitAI

    Ok, I don't get it. How exactly it is a LoRA like this can even work? It's not like checkpoints are trained to be half-assed and you can just train the AI to "do better". Like, wouldn't a checkpoint maker just do what you're doing here?

    (I'm not criticizing your work, just wondering how can a LoRA have such a "special sauce", what's the tradeoff?)

    reakaakasky
    Author
    Jan 17, 2025· 5 reactions

    The tradeoff is less "creativity", less "crazy" things.

    The base model, especailly noobai, is so creative, because the huge amount of training data.

    An inappropriate metaphor:

    You prompt is "an apple". But what kind of apple? it's not in the prompt. The model will never know. It saw many apples, green, red,big,small. All fit your prompt. So the best option for the model is just generating something that is the average of what it saw. Which is, the best mathematically optimal solution, but also a really bad image. Same for the styles and many other small aspects, there are just too many things in the model.

    One way to solve this: use tons of prompt. e.g. "a big red shiny ... apple" Which is not user friendly, and the old CLIP text encoder in SDXL also cannot handle this very well.

    Other way: train a lora with only good normal thing, e.g. only big red apples, and let the model forget other uncommon things.