This workflow takes an Image and an audio track as input to generate a video.
Important Notice
Update ComfyUI. A lot of the code has been updated in the last few days.
Include --reserve-vram 1 in your launch option to avoid OOM.
Models to download
Place in models/diffusion_models
https://huggingface.co/Lightricks/LTX-2/resolve/main/ltx-2-19b-distilled-fp8.safetensors
Place in models/text_encoders
Place in models/loras
Description
Changed to FP8 distilled model.
Set resolution at 1920 x 1088.
Changed to Manual Sigmas.
Changed to Native Video Save, to prevent saving 3 different files for final video.
Details
Downloads
857
Platform
CivitAI
Platform Status
Available
Created
1/18/2026
Updated
2/1/2026
Deleted
-
Files
ltx2ImageAudioTo_v30.zip
Mirrors
Huggingface (2 mirrors)
CivitAI (1 mirrors)