GGUF: Flux Unchained - CivArchive (CivitAI Archive)

GGUF: Flux Unchained - Q5_1_v2

NSFW

Preview undefined

[Note: Unzip the download to get the GGUF. Civit doesn't support it natively, hence this workaround]

GGUF version of FluxUnchained by socalguitarist . Credit goes to him for tuning this model. I converted it to GGUF by a modified version of this script.

It can be used in ComfyUI with this custom node or with Forge UI.

See https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050 to learn more about Forge UI GGUF support and also where to download the VAE, clip_l and t5xxl models.

Which model should I download?

[Current situation: Using the updated Forge UI and Comfy UI (GGUF node) I can run Q8_0 on my 11GB 1080ti.]

Download the one that fits in your VRAM. The additional inference cost is quite small if the model fits in the GPU. Size order is Q4_0 < Q4_1 < Q5_0 < Q5_1 < Q8_0.

Q4_0 and Q4_1 should fit in 8 GB VRAM
Q5_0 and Q5_1 should fit in 11 GB VRAM
Q8_0 if you have more!

Note: With CPU offloading, you will be able to run a model even if doesn't fit in your VRAM.

Updates

V2: I created the original (v1) from an fp8 checkpoint. Due to double quantization, it accumulated more errors. So I found that v1 couldn't produce sharp images. For v2 I manually merged the bf16 Dev checkpoint and then made the GGUF. This version can produce more details and much crisper results.

All the license terms associated with Flux.1 Dev apply.

Description

Bigger than Q5_0, should be better

FAQ

Comments (95)

sarcastictofuAug 21, 2024· 3 reactions

CivitAI

I have one question.. why is this model a zip file not a Safetensor file?

tsujioai740Aug 21, 2024· 5 reactions

It's because it's a GGUF quant and Civitai doesn't support this format yet. So this is a workaround.

Author

Aug 21, 2024· 2 reactions

Just unzip it, you'd get the GGUF. Civit doesn't support this format, hence this workaround.

rfdomingos363Aug 21, 2024· 25 reactions

@nakif0968 (sigh) unzips....

bombomooAug 22, 2024· 2 reactions

CivitAI

Q8 has 4gb file inside. Looks like an error.

Author

Aug 22, 2024

What's the size of the zip file? Did you download the correct version https://civitai.com/models/662112?modelVersionId=748232 ?

bombomooAug 22, 2024

Zip file is 12gb, but after unzipping it's 4gb for some reason

ishadowxxAug 22, 2024· 5 reactions

CivitAI

Perfect🔥🔥🔥.

Now we only need to work on the Ace Hole not looking like an Hole-less meat whirlpool and the Vanjana not looking like a smashed hotdog.

wang1818Aug 22, 2024· 6 reactions

CivitAI

Foge works perfectly, thanks to the author for the good model, under dpm++2m, 8steps has very good results, 2070s 8g, 36 seconds

yondaimesaAug 24, 2024

Hi, could you tell how you got it working? Where did you download vae, and which T5 model you downloaded? Could you give a link or name?

leclettico912Aug 26, 2024

I can't see the gguf model in the Forge checkpoints list. Yes, the model is in webui\models\Stable-Diffusion

wang1818Aug 29, 2024

@leclettico912 Is your foge up to date? https://github.com/lllyasviel/stable-diffusion-webui-forge

wang1818Aug 29, 2024

@leclettico912 [GGUF and Flux full fp16 Model] loading T5, CLIP + new VAE UI · lllyasviel/stable-diffusion-webui-forge · Discussion #1050 · GitHub

wang1818Aug 29, 2024

@yondaimesa [GGUF and Flux full fp16 Model] loading T5, CLIP + new VAE UI · lllyasviel/stable-diffusion-webui-forge · Discussion #1050 · GitHub

RogueLikelyAug 22, 2024

CivitAI

Running ComfyUI on linux, have ran the pip upgrade and made sure ComfyUI is updated, but the GGUF nodes keep failing to import. Does anyone know of a fix?

Light_SaberAug 22, 2024· 1 reaction

CivitAI

Unable to run in Forge UI. Give me the followin error "AssertionError: You do not have CLIP state dict!"

Author

Aug 22, 2024· 4 reactions

You need to download T5, CLIP-L and VAE separately and put in the corresponding folders. See here for more info https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050

Author

Aug 22, 2024· 4 reactions

Also, be sure to load them in the UI (along with setting the checkpoint). [How: See the pic Illyasviel shared in the above link.]

Light_SaberAug 22, 2024· 1 reaction

@nakif0968 Thanks a lot mate. This is really helpful.

Author

Aug 22, 2024· 1 reaction

@Light_Saber No problem.

velantegAug 23, 2024

@nakif0968 Can you provide more details? There no any mention of clip_l on this page at all.

leclettico912Aug 23, 2024

@velanteg from that page: "Put clip-l and t5 in models\text_encoder"

fallen_programmerSep 17, 2024

I've downloaded t5, clip and VAE but after this I have error : ValueError: Failed to recognize model type!

joanfrances1996Aug 22, 2024· 1 reaction

CivitAI

Any chance for a Q2 or Q3 version?

Author

Aug 22, 2024· 1 reaction

If the gguf-py library adds support for them I can give it a try. But for now, I don't know how to do that.

joanfrances1996Aug 22, 2024· 1 reaction

@nakif0968 Thanks for the consideration. I'm just getting into gguf as well. I'll take a look as well.

velantegAug 23, 2024· 3 reactions

CivitAI

Unpacked imto models folder but Forge not see gguf checkpoint.

88850Aug 24, 2024

CivitAI

can you turn on on site generation? i really wanna try this one!

Author

Aug 24, 2024

Civit doesn't support any Flux base model yet. That being said, this one's meant to speed up local generations anyway.

88850Aug 24, 2024· 2 reactions

@nakif0968 ah gotcha. that explains why i can only find 2 checkpoint flux models that support on site generation. thanks for the info 💪

schschAug 24, 2024· 2 reactions

CivitAI

Thanks for your work! Any recommendation of number of STEPS and sampler? I got incredible results in Comfyui with steps 8 and EULER/Normal, 768x1024, on my 6G VRAM card, using Q3. I can load Q4 (my NVIDIA driver supports offloading VRAM to RAM), but outputs are a bit pixelated. It only gets better with 15 or more steps, but not so sharp as Q3. Is there a way to use PASEER or those lightning loras? Thanks!

Author

Aug 24, 2024· 1 reaction

Euler + Simple schedule. I haven't tried anything else.

zelgadisexeAug 25, 2024

CivitAI

For some reason, your models is downloaded in the archive and, probably, that's why after it is unzipped, my hash does not match what is listed on the site. Because of this, the model is not displayed in the resources when uploading the image to CivitAI. Of course, i can manually insert the correct hash of the model into each the png, but it's boring((

Author

Aug 26, 2024

Maybe because it's a zip file. If Civit supported .gguf it wouldn't be a problem.

zzzeenAug 27, 2024· 3 reactions

CivitAI

One of the best. clear and sharp while some other FLux version give me some noise. But i have problem. Yesterday, its work seamlessly with flux lora. But today, it's broken. image generated but without any lora effect. Anyone can exlpain this case?

tedbivAug 27, 2024· 3 reactions

CivitAI

nice. used q8 v2 version. on rtx 3060 an 896x1152 image takes about 1:24 minutes. first model to give me tentacles wrapped around diver. thanks for release. see image below.

tedbivAug 27, 2024· 3 reactions

CivitAI

the girls in this model seem prettier than other flux models. they don't have the square dimpled 'male' look to them... good job.

tedbivAug 27, 2024· 2 reactions

CivitAI

really good following prompts... better than some of the other flux models i've used.

Rance99Aug 28, 2024· 3 reactions

CivitAI

Can you please include training some Asian female figures and faces, I think it's already excellent for Europe and the US, but the support for Asian characters isn't good enough

tedbivAug 28, 2024

something specific? asian girls turn out nice for me...

Rance99Aug 29, 2024· 1 reaction

@tedbiv I think more often than not it's European girls that are exported, there's still less diversity in Asian girls

infernahermit846Aug 28, 2024· 7 reactions

CivitAI

Loads really fast.
Generation is slow. over 1min for 20step images.
Q4 generates high quality images and loads fast with 8GB vram.
Can do SFW and basic NSFW.
Nice model over all.
We need faster Flux models + Less v-ram hungry ones, otherwise it's not worth it.

tedbivAug 28, 2024· 2 reactions

CivitAI

silly question....

is there a prompt/key phrase to get smaller breasts and nipples? i've tried small, tiny, flat chested. seems to have no affect.

thanks in advance.

MonkeyDFeelsAug 28, 2024· 2 reactions

CivitAI

Has anyone tried using loras with this checkpoint? I'm trying to use AIENGI models when same prompts and they just don't look right :(
https://civitai.com/user/AIENGI/models

ViratXAug 29, 2024· 3 reactions

CivitAI

Hi! Do you plan on creating GGUF model for the HyFU-8-Step-Hybrid version of Flux Unchained by SCG?
Please do if you can! Thank You.

iKandyAug 30, 2024

CivitAI

May I know how long it takes to render an image with your 1080 11GB GPU?

Author

Aug 30, 2024

~12s/step * 20 steps = 240s = 4min

iKandyAug 30, 2024

@nakif0968 wow that's not too bad. any tips on optimising it further in Forge UI?

Author

Aug 30, 2024· 1 reaction

Can't think of any beyond what's already on the Forge UI Github

BroseppiSep 2, 2024

CivitAI

I am not saying you are wrong with this comment, I just want to know what I am doing wrong. You say that gguf is faster than safetensor in the safetensor upload, but with my 3090 I am getting 35s/it instead of 1.3s/it that I get with the safetensor. I must be doing something wrong, but i have no idea what

SendraelSep 7, 2024· 1 reaction

GGUF is not faster than fp8 or fp16, it is slower. Thats because has a compressed data format. Th egguf versions are just smaller to get it in your vram. With a 3090 you should have no problem using fp16...

BroseppiSep 14, 2024

@Sendrael ahh, right. Does the fp16 have the vae, clip and text encoder all built in? I will try it out in any case. Thanks for letting me know, I appreciate it :)

Also, it seems that the gguf issue fixed itself, it

Edit: could you point me toward the model page with the fp16 to download? I'm having trouble finding it

Edit 2: Ok, found the dev. Now looking for schnell.

Edit 3: what should I put for the VRAM GPU weights slider? The same size as the model file, smaller or larger? and if one of the latter 2, by how much?

Edit 4: I am actually getting .2 s/it slower generation speed using fp16 over the ggufs.

aiM0NGUsSep 2, 2024

CivitAI

awesome stuff keep up the great work! btw, how do you convert/change from a safetensors to a gguf format, is there any tools to do this you can recommend, thanks! :)

jonesaidSep 3, 2024

CivitAI

Are these all quants of Flux Unchained version 1.0? The "v2" in the titles is confusing.

Author

Sep 3, 2024· 1 reaction

V2: I created the original (v1) from an fp8 checkpoint. Due to double quantization, it accumulated more errors. So I found that v1 couldn't produce sharp images. For v2 I manually merged the bf16 Dev checkpoint and then made the GGUF. This version can produce more details and much crisper results.

hinablueSep 8, 2024

CivitAI

I've also used the ComfyUI-GGUF tool to convert my own models, but Forge doesn't work properly. Could you please share how you did the gguf conversion?

Author

Sep 8, 2024· 1 reaction

Use this fork https://github.com/mhnakif/ComfyUI-GGUF

hinablueSep 8, 2024

@nakif0968 Thank you!

97BuckeyeSep 12, 2024

CivitAI

Thank you for your work. Very appreciated. One additional request... Would you please make this available in the Q6K version? I've found that to be almost as good Flux Dev, but saving a couple extra GB of VRAM that I need on my 12GB GPU.

Author

Sep 12, 2024· 1 reaction

https://civitai.com/models/705823?modelVersionId=789534

homoludensSep 13, 2024

CivitAI

1st THX 1st - thanks for sharing...

But I think decision to release it as a ZIP is very inconvenient. I am comparing different GGUF releases of FLUX and it is constantly A BIG PAIN in the letter "S" to check which images should be posted on which CivitAI page.

The name of ZIP archive differs from the name of GGUF inside. Why did you decide to compress the GGUF format in the first place? The compression profit is almost zero.

Author

Sep 14, 2024· 2 reactions

It's not for compression. Currently, CivitAI does not allow the upload of a .gguf file format, but it allows you to upload a .zip. Hence this workaround.

homoludensSep 14, 2024

@nakif0968 I suspected that... sadness. So yet another workaround needed - rename file after extraction. But hash code of GGUF won't match a has of zip on CivitAI.
OK. Sorry for blaming you...

mykeehuSep 13, 2024· 2 reactions

CivitAI

Based on my tests, I got the best images at 768 resolution. Thanks, I really like this model!

WeirdScienceXSep 15, 2024· 1 reaction

CivitAI

Excellent models, thanks for your hard work and sharing

CuboneSep 18, 2024· 4 reactions

CivitAI

Very good but very horny model. I can't generate any women wearing bra, they're always topless.

Sailor_LunaSep 19, 2024

CivitAI

I don't understand what is it means "Q4_0 and Q4_1 should fit in 8 GB VRAM". What about clip models that loads in vram too? For me the speed the same as with regular big models, because its can be only "loaded partially" (the clip model is loaded first and uses all memory). Do I miss something?

JohnDoe176Oct 3, 2024

Download the extra models extension and use the force set clip device to set the vae to cpu this will save you the vram

Sailor_LunaOct 4, 2024

@chrislgolden130 which extension exactly? Is this for ComfyUI. I only found nodes created by city96, its for another models and dont support dual load. Also vae is very small and loads after generation so it doesnt need to be loaded to cpu, because low_vram unload models each time anyway

JohnDoe176Oct 11, 2024· 1 reaction

@waitran Sorry I meant clip not vae. its "comfyui extra models" in comfy manager.

vgadictOct 11, 2024· 5 reactions

CivitAI

To get Q4_1 v2 to work in SD Forge on my RTX 4060 w/ 8 Gig VRAM, I found that I needed to lower SD Forge's GPU Weight setting from the default value (7163?) to a slightly lower value or it would immediately run out of memory. After I got that figured out, it has worked very well. This checkpoint provides the capability for realistic nude figures straight out of the box with no other LoRAs needed. It also seems less finicky and much more responsive to various NSFW prompts than many other base/checkpoint flux models I've tried.

tedbivOct 25, 2024· 1 reaction

CivitAI

DUMB QUESTION:

sorry if this is dumb question. do i need different type of loras with gguf models? when including loras in prompt i get heavily pixeled/unusable images... or am i just lucky. :)

Author

Oct 25, 2024

The same loras should work for GGUF. Tested in ComfyUI, not sure about Forge.

tedbivOct 25, 2024

hmm... tried fluxunchained-dev-q8-0.gguf and get same behavior as with my model... i am using forgewebui.

Author

Oct 25, 2024

@tedbiv Maybe a ForgeUI issue. You might wanna report it on their GitHub Repo

tedbivOct 25, 2024· 1 reaction

here's what i found -

GGUF Flux Models Require LoRA

Based on the provided search results, it appears that GGUF (Generalized GPU Unified Format) Flux models require different LoRAs (Locally-Optimized Rerouting Algorithms) compared to traditional Flux models. Here are some key findings:

Compatibility issues: Some users have reported compatibility issues when trying to use LoRAs designed for traditional Flux models with GGUF Flux models. For example, Issue #57 in the x-flux-comfyui repository mentions that a LoRA designed for Civitai works perfectly on an online service running the FP8 dev version of Flux, but not with the GGUF Flux Dev Q4_0 model.

XLabs LoRAs: The search results suggest that XLabs LoRAs are designed to work with GGUF Flux models, providing better performance and generation times. In contrast, Civitai LoRAs may not be optimized for GGUF models and may lead to slower generation times or compatibility issues.

Quantization: GGUF models are quantized, which means they have been optimized for reduced memory usage and faster inference. This requires different LoRA configurations compared to traditional Flux models, which are typically not quantized.

Node updates: Some users have reported issues with LoRAs not working with GGUF Flux models after updating nodes. For example, Issue #4674 in the ComfyUI repository mentions that updating the node to the latest version resolved issues with LoRAs not working with GGUF models.

MokanshanFeb 23, 2025

The same problem is that every time the image is generated, there is a lot of noise or mosaic, and I haven't even used Lora yet. I tested Q8, Q5, and Q4, but none of them worked. However, other versions of the GUFF model can work well. Although I suspect that there is a problem with VAE decoding, isn't FLUX's VAE only available in the AE version?

freakingpogi3431Nov 17, 2024

CivitAI

Any update on this? I keep on getting consistent error message on ForgeUI no matter the combination I use for the other files.

idamanFeb 2, 2025

CivitAI

anyone dial in a sampler and scheduler for best results or fast results?

Author

Feb 2, 2025

I use Euler + Simple

ziriuss1Feb 7, 2025· 1 reaction

CivitAI

hi any workflow for this? you try teacache o wavespeed for flux?

MokanshanFeb 23, 2025· 2 reactions

CivitAI

The same problem is that every time the image is generated, there is a lot of noise or mosaic, and I haven't even used Lora yet. I tested Q8, Q5, and Q4, but none of them worked. However, other versions of the GUFF model can work well. Although I suspect that there is a problem with VAE decoding, isn't FLUX's VAE only available in the AE version?

Author

Feb 23, 2025

I’ve heard that some people have noise in their gens, but I have not been able to reproduce this issue. And there really shouldn't be any ambiguity regarding FLUX VAE, afaik, there's only one VAE for FLUX, the official Blackforest Lab one

tedbivFeb 23, 2025· 1 reaction

set 'diffusion in low bits' to 'automatic (fp16 lora)'

MokanshanFeb 23, 2025

@pretty_pixels Yes, it seems to be just a problem encountered by a small group of people. I have 3070T 8GB Vram, and I can actually run Flux D FP8, but GUFF is faster. Currently, only the "Flux Unchained" version has encountered a lot of noise/damaged images. If it weren't for VAE, I really wouldn't be able to find the reason. Thank you again for your efforts, and I will continue to follow up on your work

MokanshanFeb 23, 2025

@tedbiv
I will try,but How do we get this in comfyui, i have no idea if there is a node to select those?

tedbivFeb 23, 2025

oh, sorry i don't know. i use forge for images. i only use comfy for videos.

MokanshanFeb 23, 2025

@tedbiv thx anyway :）

obdbdjyApr 12, 2025· 1 reaction

CivitAI

Why is everything I create in a comic style?

supandifoolJun 19, 2025· 1 reaction

CivitAI

There is this new quantization technique called svdquant nunchaku, which is smaller is size and very fast, about 3x times faster. Any plans to release your unchained model in this svdquant format.

ferrrett33Apr 8, 2026

CivitAI

So you create a model for people with low VRAM but it starts at 8GB, so if someone has 6GB we don't exist. Bruh, you're awesome could you buy me a video card? I'm poor like the human species in general.

Author

Apr 8, 2026

I can't tell if you're trolling or not, but I'm gonna give you a straight answer. There is Z-image Turbo and Flux Klein 4B, which would run fine with 6gb cards. Those are newer, better models. This model is very old. Don't use it. There are better options. (And, BTW, I didn't "create" this model; that would be Black Forest Labs).

ferrrett33Apr 9, 2026· 1 reaction

@pretty_pixels Thanks, that's very helpful. Yeah I'm a bit upset because this is like an elite community of gurus throwing an impossible amount of acryonyms and technical concepts around and everything out there is outdated or pure disinfo. Not your fault of course, but I've never seen anything like this in my life. It's like get a PhD to understand everything written on a single page anywhere. Search whatever, it's outdated or part disinfo, you have to be a genius.

Author

Apr 9, 2026

@forum2233726 You can always ask an AI chatbot like Gemini,Grok, or chatgpt to get up to speed. I still use them to learn about new models and tools.

ferrrett33Apr 17, 2026· 1 reaction

@pretty_pixels Ok after 10 days of doing this every day I'm finally starting to get it. Yes the Q4 models do work on 6GB. Image generation can take 3-4 minutes or more on regular models and an older GPU, unless you use the Hyper 8 Lora, which is really good.

Btw, this model is still in the top 30 highest rated Fluxdev models on CivitAI. I heard Nunchaku is the latest thing. Looks like Flux 1 is starting to approach SDXL level of NSFW models but nowhere near there yet without extensive LORAs.

Checkpoint

Flux.1 D

by pretty_pixels

Download (Beta) View on CivitAI

Details

Downloads

3,166

Platform

CivitAI

Platform Status

Available

Created

8/20/2024

Updated

7/27/2026

Deleted

-

Files

ggufFluxUnchained_q51V2.zip

Size:

8.40 GB

SHA256:

fc9cbc723b7abc6741d9afcf0d1946e1d8cdcf77e0bf18dc22d5403cf9047b55

Mirrors

HuggingFace (1 mirrors)

ggufFluxUnchained_q51V2.zip

CivitAI (1 mirrors)

ggufFluxUnchained_q51V2.zip