【WAN2.1】IMG to VIDEO - CivArchive (CivitAI Archive)

【WAN2.1】IMG to VIDEO - 🟡 v1.7

NSFW

✨ WAN2.1 — Image to video — Simple Workflow

A clean, all-in-one WAN image-to-video workflow built entirely with the UmeAiRT Toolkit for ComfyUI.
Only 12 nodes. No spaghetti wires. Just load your model, write your prompt, and hit generate.

⚠️ IMPORTANT — Nodes 2.0 Required

This workflow is built for the Nodes 2.0 (Vue) interface of ComfyUI. If you don't enable it, the workflow may have display problems.

How to activate Nodes 2.0:

Open ComfyUI
Go to Settings (⚙️ icon, bottom-left)
Find "Use Nodes V2 (Vue)" and toggle it ON
Refresh the page
Load the workflow

If you prefer the classic interface, check out my Legacy version of this workflow instead (link).

🎯 Features

Text-to-video generation
Automatic download of models in auto version
Built-in SeedVR2 upscaler — high-quality tiled upscaling (toggleable on/off) Slower than a classic upscaler, but significantly better quality
Full metadata embedding — your images are saved with all generation parameters, ready for online publishing and remixing
3 LoRA slots — with individual on/off toggles and strength control and you can connect as many other lora modules to each other for as many LoRA as you want.

📦 Custom Node Required

Only one custom node to install:

👉 ComfyUI-UmeAiRT-Toolkit

Install via ComfyUI Manager (search "UmeAiRT") or use the UmeAiRT Auto-Installer.
The Toolkit packages everything internally — upscaler, face detailer, metadata saver. No other custom nodes needed.

📂 Files you need (in manual version)

For base version
I2V Model : 480p or 720p
In models/diffusion_models

For GGUF version
I2V Quant Model :
- 720p : Q8, Q5, Q3
- 480p : Q8, Q5, Q3
In models/unet

Common files :
CLIP: umt5_xxl_fp8_e4m3fn_scaled.safetensors
in models/clip
CLIP-VISION: clip_vision_h.safetensors
in models/clip_vision
VAE: wan_2.1_vae.safetensors
in models/vae
Speed LoRA: 480p, 720p
in models/loras

ANY upscale model:

Realistic : RealESRGAN_x4plus.pth
Anime : RealESRGAN_x4plus_anime_6B.pth

in models/upscale_models

Description

New, simplified version.
No new features, just an easier-to-use interface for less experienced users.

FAQ

Comments (27)

pushpiblMar 30, 2025

CivitAI

或许你应该使用强力lora加载器，他更方便

UmeAiRT

Author

Apr 1, 2025

PowerLora 要求放置对 WAN2.1 造成问题的剪辑

thisisarandomaccount2025Mar 30, 2025· 1 reaction

CivitAI

Dumbass me was gonna make a comment complaining the LORA's are not working on the version 1.7

Turns out I forgot the toggle button to turn them on...

90SplashMar 31, 2025

same here :0

CasualGamingMar 31, 2025

CivitAI

whats the difference between gguf model and safetensors regular wan model?

synalon973Mar 31, 2025· 1 reaction

I don't fully understand the technical side, but as far as I can understand it GGUF is better for CPU and GPU acceleration with quantized models. I assume that means Sage Attention works well with it, and it works well with LLM.

Safetensor is safer than ckpt and is faster loading but doesn't have the other benefits that GGUF has as far as I can tell.

There is probably much more to it but that is my low level basic understanding.

CasualGamingApr 1, 2025

@synalon973 Thanks for the explanation!

Goldstar1Mar 31, 2025

CivitAI

the umt5_xxl file should be placed in clip? other guides I've seen says to put it in text_encoders?

beautybotMar 31, 2025

CivitAI

How to fix blurry hands? Tried all settings, all good except blurry hands

synalon973Mar 31, 2025

I use detailed hands, detailed fingers, defined hands, defined fingers in the postive prompts. It helps a bit.

tomazxzas143Mar 31, 2025

CivitAI

first workflow I tried for Wan, and honestly, I'm shocked at how good it is for I2V at least, hunyan shits the bed at that department. THE FUTURE IS HERE BOIS

szluga940Mar 31, 2025

CivitAI

Bro this is great. Could you add workflow with more loras, like 7 or more :) I dont know how to connect this pasta to add more.

krisadaApr 1, 2025· 1 reaction

I believe you can increase number of loras with current workflow by adjusting num_loras (third row of lora module.) New rows for choosing lora and its strength should appear.

UmeAiRT

Author

Apr 1, 2025

@krisada Exactly, it's why i use this lora loader

soap204Mar 31, 2025· 1 reaction

CivitAI

Thanks for all your shares of work flows and tips.

I_XXIVApr 1, 2025

CivitAI

Works great, On a 4070 12GB + 32GB, it took 438 secs to generate a decent 720-ish upscaled clip.

* I2V, 480P, Q4_K_S, sage attention and tea cache on.

* 13 FPS, interpolate to 26 FPS.

* 46 frames, 3.5 secs clip.

* 14 steps, uni_pc.

* Base 405X720 (9:16), Upscale 1.8X, bicubic.

UmeAiRT

Author

Apr 1, 2025

Thanks for your feedback

6226069Apr 1, 2025

CivitAI

Really been enjoying both the complex and simple versions of this workflow. Not sure if it's because I run all of these local rather than using a VM, but your setups have not only been the most consistent to use/run, they have also been simple to setup and even get running in the first place.

That said, I did have a though/idea I wanted to throw at you so you could tell me whether it's possible or not...

Since you have the "save last frame" feature in your complex workflow, would it be possible to setup parameters in which you could say, set a "video count" threshold where depending on how many you were intending to make, you could set a custom iteration loop where the last frame of the previously generated video gets immediately fed into a second run of the workflow?

I hope you'll excuse my ignorance if this is already a thing or can't be done. I have EXTEMELY limited knowledge of how all this works but based on playing around with a couple of things, it felt reasonable to assume that something like this could be possible. Curious to hear your thoughts on it.

Thank you!

UmeAiRT

Author

Apr 1, 2025

I've never implemented a loop of this type before, so I'd have to do some research. For the moment, I've already published another workflow that automatically extends a video, maybe that could help you.

boxertwin75593Apr 1, 2025

CivitAI

Mine errors out loading the Florence 2 prompt gen model, despite it being installed in my LLM folder. Any ideas why? Thanks!

UmeAiRT

Author

Apr 1, 2025· 2 reactions

You have the solution in the patchnote:

Known issue :

Florence dont works with some versions of transformers.

To solve that :

Navigate to your ComfyUI Portable directory (if you are using it)

Run .\python_embeded\python.exe -m pip install transformers==4.49.0 --upgrade

boxertwin75593Apr 1, 2025

@UmeAiRT Thanks for this and apologies for not seeing the patchnote. I tried this but still get the same error. Will keep using V1.6 for now thanks!

boxertwin75593Apr 1, 2025

CivitAI

Does anyone know a good way to minimise the flash at the beginning of the clip? I can't seem to prompt mine out but the sample clips don't seem to have that. Thanks.

funscripter627Apr 1, 2025· 1 reaction

Changing sampler and/or scheduler may help as will lowering your lora strength.

zxxz4531654Apr 1, 2025

CivitAI

Hello, is there any way not to eat up all RAM while upscaling the video? It used about 30+GB of RAM when I tried to create a 512*512, 120 frames video.

UmeAiRT

Author

Apr 1, 2025

For long videos I recommend doing the upscale separately with this workflow: Upscaler simple workflow WAN2.1 - v1.0 | Wan Video Workflows | Civitai

zxxz4531654Apr 2, 2025

@UmeAiRT many thanks!

Workflows

Wan Video

by UmeAiRT

Download (Beta) View on CivitAI

upscale

tool