SRPO by Tencent: In the world of AI art, a groundbreaking new model, Direct-Align, is changing the game by teaching diffusion models to paint with human-like flair, while sidestepping two major creative roadblocks. Instead of the usual slow and expensive process of painstaking, step-by-step corrections, Direct-Align leaps ahead with a clever shortcut, using a predefined noise prior to instantly "interpolate" stunning visuals from any point in the creative process. Even more revolutionary is its ability to learn on the fly. By introducing Semantic Relative Preference Optimization (SRPO), the model can listen to text-based feedback - like a master artist adjusting to a client's whims - and make real-time changes to its style. This eliminates the need for endless, repetitive training sessions, making it remarkably efficient. The results speak for themselves: in a dazzling display, Direct-Align fine-tuned the FLUX.1.dev model, boosting its realism and aesthetic appeal by over three times.
馃憞
Resources:
1) Model GGUF: https://huggingface.co/befox/SRPO-GGUF/tree/main
or the refined version: https://civarchive.com/models/1953067?modelVersionId=2210446
2) VAE GGUF: https://huggingface.co/calcuis/pig-vae/blob/main/pig_flux_vae_fp32-f16.gguf
3) Flan GGUF: https://huggingface.co/silveroxides/flan-t5-xxl-encoder-only-GGUF/tree/main
3) UltraSharp v2: https://huggingface.co/Kim2091/UltraSharpV2/tree/main
4) for NSFW (here is what you can do: https://civarchive.com/posts/22174353) use this lora: https://civarchive.com/models/1295758/nsfw-fluxorwan-22orqwen-mystic-xxx?modelVersionId=2009929 or this one: https://civarchive.com/models/754919/photorealistic-nude ... or any other one you like ;)
Description
works as supposed ;)
FAQ
Comments (17)
Censored?
here is what you can do: https://civitai.com/posts/22174353
using this lora: https://civitai.com/models/1295758/nsfw-fluxorwan-22orqwen-mystic-xxx?modelVersionId=2009929
I only get black images
sometimes it is because of some kind of "attention" turned ON
Disable the ReFlux patcher
i am also getting black images help
@vivekkarumudi300聽you should not, it is a pure "Flux" WF
@0l1v1aR0551聽how to fix the black images then? I'm using the flow as written here. I don't see any attention.
@WarmKnowledge6820692聽you have some kind of technical issue with Comfy, and the "devil is in the details" somewhere, the updated regular Comfy just "works"...
since I was not fixing this problem for myself - I can't help with it, for now
This is based on flux? And would all Flux lora's work similar to chroma?
it is based on flux - NSFW samples of this model were made with LORAs made for Flux
its give nice render thanks, only downside that its very long ( 600s ) for my 4080 12gb vram even with Teacache, any tips for speed up the gen x2 without decrease too much the quality ? Thanks!!!!
first sampler - set smaller amount of steps - like 34
upscaler - set smaller final resolution
@0l1v1aR0551聽thanks!
how does it fare in comparison to flux? don't see any real improvements in the examples just blurry and almost like they need way more steps? Such a waste of potential they should have done Chroma, Qwen image, new Hunyuan Image or even a Wan 2.2 instead of wasting time on flux with its very restrictive license
in comparison to pure Flux and even Krea - this model does ONLY realistic photos, mostly of people (I was directly comparing all 3 simply by loading them into the workflow with the same seed / prompt) - so if you need humans - use this one or maybe Krea
wow. thanks for best workflow







