Acknowledgements
I have addressed all known bugs within this workflow (excluding those inherent to ComfyUI or specific custom nodes). As such, I am concluding regular updates for this project.
I might still push occasional updates if the mood strikes me, but for now, consider this the final version. Thank you so much for the downloads and support. I hope this workflow helps your creative process - happy generating!
Overview
A simple LTX-2.3 workflow. It uses the Distilled GGUF model for fast generation.
Core Features
Single / Extend Generation Mode
You can choose whether to generate the video once or extend it afterward.
Three prompt input modes
Prompt enhancement using Ollama
Native LTX prompt enhancement
Plain (no enhancement)
If Ollama is not needed, you can disconnect it from the Ollama SubGraph node or simply delete the node.
Optional Features
Preview Switch: Displays a preview during sampling.
Audio-driven: Generates a video that matches an existing audio file.
T2V Switch: Ignores the start image and generates the video using text-to-video.
Full Mode: Generated using Steps 15 and CFG 3.0. This takes a very long time. While the camera work and motion may be slightly improved, the generation time is not practical.
Double-Frame Mode: For use with intense motion. By rendering at twice the frame rate, facial distortion is less likely to occur.
Note
If you encounter the error, please update ComfyUI-KJNodes to the latest version.
If you apply the Sulphur LoRA, a strength of 0.5 is likely the best choice. At 1.0, it tends to introduce noise into the audio, and basic motions such as walking or running are more likely to appear in slow motion.
Description
v1.4.9.4 : Extended video/audio part bug fix. (Audio within the overlap prioritizes the source video, so depending on timing, newly generated audio may be ignored in the overlap.)
v1.4.9 : Change to hybrid generate. (cfg3.0 3step+cfg1.0 5step)
v1.4.6 : Added Double Frame Mode.
v1.4.3 : Added input video mode to the extension source.
FAQ
Comments (2)
[Edited] javawock7618 provided great guidance for the LTX2.3_reasoning_I2V_V3.safetensors file in the comments. Thank you!
The workflow will run even without LTX2.3_reasoning_I2V_V3.safetensors.
This VBVR LoRA is uploaded on Civitai rather than Hugging Face. (Please note that it contains NSFW showcase content.)
Additionally, an official LoRA compatible with ComfyUI is available at the following link:
https://huggingface.co/siraxe/VBVR-LTX2.3-diffsynth_comfyui
This version is recommended.
