Qwen_Image_fp8_e4m3fn - CivArchive (CivitAI Archive)

Qwen_Image_fp8_e4m3fn - v1.0

NSFW

Qwen-Image is a powerful multimodal model developed by Alibaba Cloud, designed to understand and generate both text and images. It supports high-resolution image generation and complex visual reasoning, making it suitable for a wide range of AI applications. As part of the Qwen family, Qwen-Image integrates seamlessly with other models to enhance creative and analytical tasks across modalities.

Description

FAQ

Comments (31)

Daedalus_7Aug 6, 2025· 31 reactions

CivitAI

Hopefully, the mods at CivitAI will give QWEN a tag for it soon, so we can filter it as a base model instead of searching it in the "other" models category.

cluster1500Aug 12, 2025

It's supported now 👍🏻

haidensd58757Aug 10, 2025

CivitAI

Hi do you recommend this for RTX 4070? how many minutes will it take lets say for 896x1152?

aigirlfriend555

Author

Aug 11, 2025· 7 reactions

my settings is RTX 3060(12GB Vram), Resolution: 896x1328, CFG: 1.5, Steps:14 , Sampler: res_multistep. takes about 5 minute each image. (first run is longer)

haidensd58757Aug 13, 2025

aigirlfriend555 That's good i'll try it out. Thanks for valuable info

xishui8873Aug 15, 2025

aigirlfriend555 弱弱的问一句我30708g+16g共享，能用么

kiryanton930Aug 10, 2025

CivitAI

What settings? Draws a black screen.

qdr1enAug 11, 2025· 4 reactions

You have to launch ComfyUI without the "--use-sage-attention" flag in the .bat file.

cluster1500Aug 11, 2025· 1 reaction

and without the "--fast" flag, if it is not yet fixed.

slimedAug 13, 2025

cluster1500 thx dude

cluster1500Aug 12, 2025· 7 reactions

CivitAI

Base Model type "Qwen" is now supported by CivitAi. Please change it from "Others", so we can filter properly for all the amazing Qwen art 🥰

sekaiwlc07860Aug 12, 2025· 2 reactions

CivitAI

If use 4090, Q6_K,Q8_0 and Fp8, which is better and fast?

FKAI007Aug 12, 2025· 3 reactions

I believe it would be Q6 for speed, FP8 for quality, and Q8 for something in between.

creatumundo399Aug 19, 2025· 1 reaction

For a 4090, fp8 will always be better than gguf. The gguf models are for people who have low VRAM since the model is loaded on the cpu and not the gpu. But that causes the generation of the image to be much slower since the CPU is not as good for that type of calculations. On the other hand, the .safetensors models (fp8, fp16) are loaded on the GPU.

zidsAug 21, 2025

@creatumundo399 don't tell what you don't know. All gguf can be loaded in gpu too

creatumundo399Aug 21, 2025· 3 reactions

@zids The gguf model was created to be able to run models on PCs with less VRAM. Yes, okay, you can load it onto the GPU. But why would you download a gguf model when you have a 4090 when you can use fp8 or fp16 and even fp32, which work directly on the graphics card, faster and more efficiently? I may not know everything, but I see that I know better than you how to use the models.

KrooksKrooksAug 23, 2025· 1 reaction

fp8 will run blazing fast.

ViktorIltimirovSep 3, 2025

Not sure about Qwen, but in Flux Q8_0 model definitely better than fp8. But it slower than fp8

ronikushSep 16, 2025· 1 reaction

@ViktorIltimirov Yeah coz he is wrong on every metric, fp8's lose a LOT of quality in comparison with original/base model. Whereas a quant at the top of the shop @ Q8 , in the vast majority of cases are very close if not indistinguishable from OG base models. BUT yeah slower than FP8 of course.

Cowboy_GibopAug 17, 2025· 2 reactions

CivitAI

How do you fix that qwen ALWAYS generates almost the same image with promt, on the different seeds?

KrooksKrooksAug 21, 2025· 2 reactions

Yeah, that's its huge con: the model seems to be trained for precision and it avoids creativity.
But since the model seems to be precise, in theory, the variety of keywords in the prompt compensate for the lack of variety from the seed change.
But then again, with some prompts it seems to lack training so it may perhaps be a very limited model.

helsingvashSep 20, 2025

I think the best way to get around this is to use some loras.

taijincoke279Nov 14, 2025

Wildcards.

AInimeGirlsAug 22, 2025

CivitAI

What text encoder is this Checkpoint looking for? I waited for nearly 33 minutes for this to generate and it was only at 25%. I'm using an RTX 5070 so I know it shouldn't take that long and loading the checkpoint and removing the vae node were the only changes i made to the Qwen workflow.

KrooksKrooksAug 23, 2025· 1 reaction

I wonder if you're using bf16? I can only say that fp8 variant requires qwen_2.5_vl_7b_fp8_scaled With 16GB 4060 Ti it takes up to 7 minutes to generate an image. Your GPU should be about 15% faster.

uncinsane475Aug 24, 2025

@lyagushka420 5070 only have 12GB VRAM and Qwen-image fp8 need 19GB space (and around 2GB Clips). so it'll have a much worse experience compare to 4060ti with weaker GPU but much bigger VRAM.

I used GGUF-Q4KS and takes around 180 seconds to generate a picture with similar texture but much worse chinese text comprehension.

AInimeGirlsAug 25, 2025

@lyagushka420 @uncinsane475 turned out to be user error. I was trying to use it as a Checkpoint but it should have gone in diffusion_model. I ended up going back to the OG fp8 version and I'm able to run 1328x1328 at 50 steps in about 5-7 minutes.

ViktorIltimirovSep 3, 2025

fp8 model can run with 12Gb VRAM. Image can generate less than one minute on 3080 RTX when it combined with lightning 8 step lora and cfg 1

QuinceyForderAug 23, 2025· 9 reactions

CivitAI

is it only for Comfy UI and can it run on oldschool A1111?

SwaggerBoiOct 16, 2025

"Old"

yazi6393wy136Nov 10, 2025· 1 reaction

CivitAI

How can I get output with no pubic hair. All the time I type the prompt,then give me the pubic hair....is any prompt or lora can make it out?

Checkpoint

Qwen