WAN2.2 | LLM Enhancer | T2I -> I2V | Detailer/Upscale
Features:
Turn any Text to Image to a Image 2 Video.
Released version:
Z-Image (Includes now also a QWEN Video prompt enhancer)
Flux.Krea
QWEN
SDXL (Initial version)
In terms of speed vs quality, the latest Z-Image version is for sure my own personal favourite. The QWEN Video prompter might need some additional fine-tuning.
Intended use:
Feed a rather simple/short base prompt to a LLM (generating an extensive prompt) to quickly preview multiple images from any Text to Image model (I usually use 4 images).
Select the preferred image to feed that to an dual-pass upscaler, followed by a face-detailer.
The final image is passed to the WAN 2.2 processing (dual pass, interpolating and a final upscaling).
On a RTX4070 the total runtime, from start to finish usually takes around 6 minutes.
(note that the LLM prompt generation is fully optional and can be disabled using a switch).
Description
Z-Image Turbo Text to Image (with LLM prompt enhancer) and upscaling to WAN 2.2 Video