CivArchive
    IT'S ALIVE! | Ultimate IMAGE to VIDEO suite | LTX/Cog/PyramidFlow - v7.0
    NSFW
    Preview 38469341
    Preview 37534722

    no need to buzz me, i'm fine. ty 💗 feedbacks are much more appreciated.
    check older versions for more video examples.
    Credits and acknowledgments at bottom below.
    __________________________________________________________________________________________________

    This ComfyUI workflow offer different ways to make I2V by using 6 Video models:

    • LTX

    • CogVideoX-5B-1.5-I2V

    • Pyramid Flow

    • CogVideoX-Fun-v1.1-InP (2B)

    • CogVideoX-Fun-v1.1-InP (5B)



    The workflow incorporates settings based on my hands-on experience with various video models, settings that may differ from the standard configurations typically associated with each model.

    __________________________________________________________________________________________________


    - - - | IT'S ALIVE | - - -

    This is the result of over a month of work, almost daily.
    I still keep improving it, optimizing and adding useful settings I came across or that i find by experimenting.

    The workflow is designed so that you never have to leave the control room (console);
    everything you need is right there at your fingertips.



    Some extra tweaking commands are accessible via keyboard shortcuts (more info below).

    __________________________________________________________________________________________________



    Different methods are provided and can be selected with a slider:

    • Creative: In this mode, a single image is used as input, and the model generates a video with full creative freedom, determining its own ending.
      *This method works with all Video models avaible in the menu.

    • Sticky: In this mode, a single image is used as input, and the workflow automatically generates an ending frame that closely resembles the input image but with a slightly different perspective or altered facial expression if a person is detected. This new image, which resembles and references the input image, can be fine-tuned using convenient sliders available in the "Tweaking" section. This allows for maintaining the overall composition throughout the video.
      *This method works only with COG models (for now), so make sure to activate it in the console.

    • Zoom: In this mode, a single image is used as input, and the final frame of the video is a zoomed-in version of the input image. The zoom level can be adjusted in the "tweaking" menu. *This method works only with COG models (for now)

    • Img1Img2 (Custom Start/End mode): This mode allows for the use of two or three images as inputs, enabling the model to create transitions between them. It is particularly useful for controlled animations and performs well with 5B COG models. *This method works only with COG models (for now)

    • Img1Img1: This mode uses a single input image as start and end, so focuses more on maintaining the composition of the input image *This method works only with COG models (for now)

    __________________________________________________________________________________________________

    Shortcuts:


    You can navigate through the only sections you need by using 3 keyboard keys:


    1 = CONSOLE

    Press the "1" key to access the console, where all the settings you need are available and easy to access.

    2 = TRAJECTORIES (TORA - COG)

    Press the "2" key to access Tora Trajectories

    This method is exclusive to COG models.

    The only compatible model avaible in this workflow that works with TORA is "5BInP"
    and must be selected as showed below:


    Four combinable trajectories are available .
    You can activate only 1, 2, 3, or all four using a dedicated slider avaible in the console:



    Be aware that each time you change the base resolution or input image you must redo all trajectories.




    3 = TWEAKING

    Press the "3" key to access Tweaking section.
    Here you can adjust the ending frame for STICKY MODE to closely resembles the input image but from a slightly different perspective/angle or a different face expression/head position, ensuring the final frame remains similar or nearly identical to the initial image while incorporating subtle variations.
    Auto Outpainting is avaible on the right, so you can copy the result and paste it back as input in case you need (Outpainting need to be activated in the console)


    __________________________________________________________________________________________________



    Cleaning VRAM when change model is reccomended:



    __________________________________________________________________________________________________

    __________________________________________________________________________________________________


    __________________________________________________________________________________________________

    ### Troubleshooting:
    This error message in console is completly normal:


    ### Minimum Hardware Requirements:

    12GB VRAM for low resolution.

    ### Render Times:

    On 3090 takes from 5 seconds to 2 minutes or more for each videos, depending on models/resolution/steps.

    ### Important COG Suggestions And Examples:

    - To achieve a good, consistent result, then using "First/Last" mode, the two images need to be similar (same location, people, and very close positioning of everything) eg: You can choose to use any images, but you'll get the best results if you use two images that are fairly similar. For example, try pick two screenshots from a random internet video or two 3D characters posed in two different poses. or two similar AI-generated images 🙄

    - Stay around 10-15 steps, go higher for better quality. (it's hit or miss below that, although I’ve gotten some nice ones at 5 steps).

    - For quick tests, use a lower base resolution (like 320). At that resolution it takes around 10 seconds on my 3090.

    - If the results are full of artifacts, switch to "custom prompt only" to avoid auto-prompting and get more stable, consistent animations, by simplify the prompt. Write something simple

    (check manual written in the workflow)

    Simple prompts like "a person posine, blink, camera shake" or words wiggle, earthquake, lens flares, blink, camera shake, handheld camera have already been tested here with great success. Please share your findings!

    - If the video seems too fast for your settings, turn on interpolation in group and raise the "extra interpolation multiplier" or change the video length in the COG settings group according to user manual written right inside the workflow.

    ### Other Considerations:

    I've tested Cog A LOT and changed the values from the standard settings to something I think works better/faster, at least based on my tests.

    Feel free to make your own changes (and if you find better settings, please let us know)!

    no need to buzz me, i'm fine. ty💗 feedbacks are much more appreciated.

    __________________________________________________________________________________________________

    *Please note:

    do not confuse the different Cog models, as there are many, and they differ from one another.

    I suggest taking a look around to understand what the other Cog models are capable of. There is a bit of confusion around,

    here's a spredsheet by Kijay to clarify some misunderstanding
    https://docs.google.com/spreadsheets/d/16eA6mSL8XkTcu9fSWkPSHfRIqyAKJbR1O99xnuGdCKY/edit?gid=0#gid=0

    In general if you are looking for a way to produce videos quickly, with the most dynamic options, resolutions, ratios, first/last, trajectories then I recommend sticking to this workflow or at least using the models I use in this workflow.

    SINCE TORA IS NOW COMPATIBLE WITH THIS XFUN VERSIONS OF COG I ADDED IT IN THE FLOW.

    __________________________________________________________________________________________________

    __________________________________________________________________________________________________

    CHANGELOGS:

    V8.0

    changelog:

    • completly redone the entire workflow + cleanup and better logic.

    • added 3 video models

    • added auto outpainting

    • added liveportrait

    • Creative mode now works as it should, with no images selected as ending

    • holy s**t this took me a week to finish

    __________________________________________________________________________________________________

    V7.0

    changelog:

    • Workflow clean, bug fixes.

    __________________________________________________________________________________________________

    V6.0

    changelog:

    • added 4 Tora trajectories

    • More UI controls

    • Better settings, refinements and tips inlcuded

    to use tora be sure you are using 5B model (there's a switch to change from 2B to 5B)

    then:

    1)deactivate "let's Cog" in groups

    2)load an image, run

    3) setup 4 trajectories (control+click to break the splines in multiple points)

    4) activate "let's Cog" and run (the "extend video" will automatically turn on when you activate "let's cog", deactivate it for now untill i figure how to extend this tora mode)

    __________________________________________________________________________________________________

    V5.0

    changelog:

    • Extra Extend ( ability to load a third image to create a video using 3 images)

    • More UI controls

    • Better settings, refinements and tips inlcuded

    __________________________________________________________________________________________________

    V4.0

    changelog:

    • EXTEND now works for all modes (except first last mode)

    • improved UI

    + lora strenght slider

    + seeds management

    + prompt strenght

    + COG frames amount control

    + model selector 2B/5B

    + negative prompts

    other changes:

    • efficiency improvements

    • completly revisited chain system

    • some minor fixes

    __________________________________________________________________________________________________

    V3.0

    |changelog|:

    improved UI:

    • added lora loader and strenght slider

    • model selector 2B/5B

    • negative prompts

    • some workflow efficiency improvements

    other changes:

    • switched to a faster interpolation method

    • some minor fixes

    • updated user's manual

    __________________________________________________________________________________________________

    V2.0

    |changelog|:

    • more refined workflow

    • more options

    • simple ui

    • added User's Manual

    __________________________________________________________________________________________________
    __________________________________________________________________________________________________


    A special thanks to Kijai , Purz and the team at Rgthree , DreamProject and VideoHelperSuite, who dedicated their time for help me resolve some steps i thought were impossible to solve.
    I encourage you all to visit their pages and support their projects

    __________________________________________________________________________________________________
    __________________________________________________________________________________________________

    📽️H A V E F U N📽️

    Description

    thanks for feedback, all conflicts should be fixed now.
    The workflow is also updated with latest cog nodes.

    FAQ

    Comments (59)

    p0rn_TVOct 30, 2024· 4 reactions
    CivitAI

    oct 2024.

    Still my fav I2V workflow

    VeroleOct 31, 2024· 2 reactions
    CivitAI

    Hi, am I the only one who have this error :
    Cannot execute because a node is missing the class_type property.: Node ID '#209'

    what can i do please ? I think i have pb with mixlab

    LatentDream
    Author
    Nov 1, 2024

    uhm no idea sorry. everything updated?

    VeroleNov 2, 2024

    @LatentDream python is 3.10, is it the reason ? ( i am afraid to update because i have pinokio )

    6066407Nov 4, 2024

    I got this as well, this was the last and only node I couldnt find

    LatentDream
    Author
    Nov 4, 2024

    @bigfishman @kuro59

    i added a new version in the same ZIP file now, is called MINI,
    with less stuff, only essentials.
    Hopefully you'll be more lucky with that.

    6066407Nov 5, 2024

    @LatentDream thanks ill try it out !

    DOSANGNov 1, 2024
    CivitAI

    My computer died because it ran out of memory. For reference, my RAM is 32GB. Why does this happen?

    superuser111Nov 1, 2024

    Yes, for some reason both models are loaded at once, both 5B and 2B at the same time (not sure why). I have 32 ram too and everything works. Set the swap file to 47 gb. If video memory is also lacking, enable "enable sequential cpu offload" in CogVideoModel and enable "Enable vae tilling" in all CogVideo Decode. I also replaced the clip loader node T5 encoder with GGUF Q3. You can also replace the basic models with GGUF, but for some reason they work only in the old version of CogVideo node. After updating it gives an error. And also there is no lora (at least in the old version).

    LatentDream
    Author
    Nov 1, 2024· 1 reaction

    @superuser111 hey brother 🙂 yes low ram/vram should should change T5 and deactivate nodes not used probably. I'm on 64ram and 3090. Focused on Vram and saw it never went above 12 so i wrote the description as is.

    Enable VAE tiling was active till V4, i choose to deactivate cause some tile can cause overlap but again, less RAM/VRAM maybe better adjust this settings accordingly expecialy if generating A LOT of frames.
    About the lora: Thats up to you to load a lora or not. there are really few loras around for COG

    superuser111Nov 1, 2024

    @LatentDream Hi bro! I have 12 gb Vram, and I can tell you for sure, when "Enable vae tilling" is not active, it crashes with an error. You don't notice it because the memory is probably used for a short time. Without "enable sequential cpu offload", it also crashes immediately. About the lora, I get an error when enabling it in v4. I don't understand why, I downloaded the same lora as in workflow. I will try in v5. Thank you for such a cool workflow!

    LatentDream
    Author
    Nov 4, 2024

    @superuser111 thanks! i now added that vae tiling in the mini workflow wich i added in the zip

    Cukrova_VilaNov 1, 2024
    CivitAI

    Hi, i have a error: "'list' object has no attribute 'startswith'" with node nr. 445, can u help me please?

    LatentDream
    Author
    Nov 3, 2024

    no idea. sorry

    LatentDream
    Author
    Nov 4, 2024· 1 reaction

    If you want you can try redownload the ZIP file, wich now include a second version called MINI,
    with less stuff loaded inside, only essentials.
    Maybe you'll have more luck with that

    ZyVNov 2, 2024
    CivitAI

    I can't find the following nodes:
    CogVideoXFunResizeToClosestBucket
    AppendStringsToList

    LatentDream
    Author
    Nov 3, 2024· 1 reaction

    be sure you install missing nodes. ComfyUI-CogVideoXWrapper in particular

    ZyVNov 3, 2024

    @LatentDream reinstalling helped

    petestueveNov 3, 2024
    CivitAI

    What am I missing I keep getting Depthflow

    The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

    LatentDream
    Author
    Nov 3, 2024

    depthanything2 is used to simulate different angle view of the input images, to make video extend mode works in "sticky mode". is just an option. don't use the "stiky" mode and bypass depth anything group or delete it completly if you are experiencing issues

    petestueveNov 4, 2024

    awesome thank you

    LatentDream
    Author
    Nov 4, 2024

    i added a new version in the same ZIP file now, is called MINI,
    with less stuff, only essentials.
    Hopefully you'll be more lucky with that.

    lewisayers252Nov 6, 2024

    @LatentDream I only see ITS_ALIVE_5_FINAL.json in the zip file for the latest version. I downloaded all versions and none of them had a mini file

    LatentDream
    Author
    Nov 6, 2024

    @lewisayers252 You were unlucky to arrive at the wrong moment when I was replacing the versions. Now you can download it

    LatentDream
    Author
    Nov 9, 2024

    The workflow includes multiple nodes, some of them keep getting updates so the workflow keep break one day or the other.
    I’m doing my best. sorry. installed a clean comfy, updated on the latest version of everything and made some adjustements.

    V7 is up. give it a try.

    LatentDream
    Author
    Nov 9, 2024

    The workflow includes multiple nodes, some of them keep getting updates so the workflow keep break one day or the other.
    I’m doing my best. sorry. installed a clean comfy, updated on the latest version of everything and made some adjustements.

    V7 is up. give it a try.

    unmaticNov 3, 2024
    CivitAI

    Anyone know how to get past the string function node in the math group? Have tried messing with the variables but nothing works. Loaded in as "replace", "no", "5", "4".
    I limited the control panel to 01 lets cog, and 08 Florence caption. But I get the red outline on the string function node. Appreciate any help.

    LatentDream
    Author
    Nov 3, 2024· 1 reaction

    why you need to touch that math panel? thats only some math for switching images for the extend mode

    unmaticNov 3, 2024

    @LatentDream Exactly! that's my question :) I'm hoping to leave it alone. The whole process gets stuck at the math panel. No idea why. Any tips? To be more precise its the "string function" and "text to number" nodes that get the red outline. Be great to see someone open the workflow, and see what they have to do to get a working generation. Currently I make sure all the models in the green box are there (the only difference from the example is that I can't find: "tfxxl_fp8_e4m3fn_scaled", so instead I use: "tfxxl_fp8_e4m3fn" ...could that be an issue?) . Then I load an image, add a prompt and hit queue pompt.

    LatentDream
    Author
    Nov 4, 2024

    you can try redownload the ZIP file, wich now include a second version called MINI,
    with less stuff loaded inside, only essentials. Hopefully you'll be more lucky with that.

    LatentDream
    Author
    Nov 9, 2024

    The workflow includes multiple nodes, some of them keep getting updates since they’re relatively new, like the COG ones, so the workflow break.
    I’m doing my best. sorry.

    I just released a new update, v7. give it a try.

    6066407Nov 4, 2024
    CivitAI

    I tried to run it for 4 hours today but I ran into many problems, am I going about this fundamentally wrong somehow?

    I was able to get it down to just missing one node

    ChatGPTShowText or smth

    lemmeknow, I just deleted it from the json and am trying to see if it works without it, node #209

    LatentDream
    Author
    Nov 4, 2024

    maybe you mean "showTextFotGPT" from Comfy-MixLab-Nodes. You need that in order to mix custom prompt and auto prompt. if you are not interested in auto prompting you can just bypass the entire thing as well as Florence

    LatentDream
    Author
    Nov 4, 2024· 1 reaction

    If you want you can try redownload the ZIP file, wich now include a second version called MINI,
    with less stuff loaded inside, only essentials

    6066407Nov 5, 2024

    @LatentDream Yeah it worked when I removed it so I just went with it, but good to know what it was for

    LatentDream
    Author
    Nov 7, 2024

    @bigfishman had to reinstall everything from skratch and i got the same exact problem. i just swapped that node with "show any" from easy-use pack. it was a node to show text that probably isnt updated from long time and dont work anymore it seems

    6066407Nov 7, 2024

    @LatentDream Ok sounds good!

    _crz_Nov 5, 2024
    CivitAI

    Theres only 1 workflow in the zip and it doesn't contain the tora workflow?

    LatentDream
    Author
    Nov 6, 2024· 1 reaction

    You were unlucky to arrive at the wrong moment when I was replacing the versions. Now you can download it

    _crz_Nov 7, 2024

    @LatentDream
    Damn bad timing. I guess I got bad timing again aswell.
    Lots of errors in mini and full workflows nothing works

    _crz_Nov 7, 2024

    * CogVideoXFunSampler 428:

    - Value 1003046072779429 bigger than max of 2048: height
    For some unknown reason it keeps setting the height to this ridiculous number and erroring

    - Value not in list: scheduler: '0.5' not in ['DPM++', 'Euler', 'Euler A', 'PNDM', 'DDIM', 'CogVideoXDDIM', 'CogVideoXDPMScheduler', 'SASolverScheduler', 'UniPCMultistepScheduler', 'HeunDiscreteScheduler', 'DEISMultistepScheduler', 'LCMScheduler']
    And it keeps trying to set a float as the scheduler value. I cant even open up your older workflows to see what its meant to be because older workflows are glitching aswell now with the same problem

    _crz_Nov 7, 2024

    The positive and negative prompt sockets on CogVideoXWrapper aren't connected to anything, so its throwing an error also

    _crz_Nov 7, 2024

    Here I recorded the problem https://youtu.be/pKpAX_1TIx0.
    Cant complain that I didn't post an image or something this time :P

    And yes everything is updated.

    _crz_Nov 8, 2024

    Any ideas on why the workflows are broken?

    _crz_Nov 8, 2024

    Thought I'd have another go at trying to get your workflows working but it seems like all the tora stuff is connect wrong aswell
    Failed to validate prompt for output 304:

    * CogVideoXFunSampler 428:

    - Return type mismatch between linked nodes: vid2vid_images, TORAFEATURES != IMAGE

    _crz_Nov 8, 2024

    CogVideoXFunSampler

    Error while processing rearrange-reduction pattern "B (T H W) C -> (B T) C H W". Input tensor shape: torch.Size([2, 10752, 1920]). Additional info: {'H': 19, 'W': 28}. Shape mismatch, can't divide axis of length 10752 in chunks of 532

    superuser111Nov 8, 2024· 1 reaction

    @cjanderson88506 Yes, ComfyUI is a complex thing that can just break any workflow after any update. For some reason I have a completely different sampler, no previews in it, and the scheduler in its place. Try to install this workflow on a clean Comfy, nodes conflict very often. I can tell you for sure, everything works here, I only have an error when I turn on Lora, I haven't figured out why, everything else works fine.

    LatentDream
    Author
    Nov 9, 2024· 1 reaction

    The workflow includes multiple nodes, some of them keep getting updates so the workflow keep break one day or the other.
    I’m doing my best. sorry. installed a clean comfy, updated on the latest version of everything and made some adjustements.

    V7 is up. give it a try.

    superuser111Nov 10, 2024

    @cjanderson88506 Also, the problem may be in Python 3.12, people say many nodes do not work correctly on it. Try version 3.11.9, I use it.

    VeroleNov 7, 2024
    CivitAI

    hi thanks for the mini add, i have still an issue with comyui art venture, i didnt see a response that work for me on the net. anyone here has found a solution ? or is it possible to change that node with another ?

    LatentDream
    Author
    Nov 9, 2024

    The workflow includes multiple nodes, some of them keep getting updates since they’re relatively new, like the COG ones, so the workflow break.
    I’m doing my best. sorry.

    I just released a new update, v7. give it a try.

    3343269Nov 8, 2024
    CivitAI

    So weird, getting Allocation on Device error. I have 32G ram. I tried the suggestions in the other comment to no avail.I also sometimes get "The size of tensor a (number) must match the size of tensor b (15451) at non-singleton dimension 1. The whole things halts.

    Additionally, I'm getting a "Failed to validate prompt for output 812:

    * ImageCompositeMasked 810:

    - Required input is missing: source"

    error in console, not sure if that's fatal/related or not.

    LatentDream
    Author
    Nov 9, 2024· 1 reaction

    The workflow includes multiple nodes, some of them keep getting updates since they’re relatively new, like the COG ones, so the workflow break.
    I’m doing my best. sorry.

    I just released a new update, v7. give it a try.

    3343269Nov 9, 2024

    @LatentDream Omg, no need to apologize! Thanks for the response and your work on this. Will test it out next time I'm able!

    lewisayers252Nov 9, 2024

    I always ran out of ram early in the workflow (also 32GB). I changed the 5b node so that it loaded the same 2b model as the 2b node and I was barely able to squeak by with 82% ram usage. I think both the 2b and 5b models are being loaded at the same time. I tried disabling the 5b node but the switch requires it. I am going to try removing the switch and connect the 2b node directly and see if that works. If it does, I think changing from an after load switch to a bypass will resolve these kinds of issues.
    -----update-----
    that worked to control the ram issue. I ran into another problem after that and was (kind of) able to resolve it. Like you mentioned before, the nodes are being updated frequently. CogVideoXFun Sampler threw and error stating that height was an unknown key. I used fix node v2 from the context menu. that changed the fields shown on the node. several of the input parameters, including height, have been moved into the node itself. This disconnected some of the input values, namely, CogVideoXFun Resize T. The workflow completed after this. I wonder if there is a way to lock down the version of a given node so you are not chasing random node updates on your workflow. I think it's great in concept, just a bit tricky to keep up with.

    lewisayers252Nov 9, 2024

    I suppose I should note my system specs for reference. I have an Nvidia 3090 with 24GB of vram, 32GB of system ram and I am running comfy in docker. this keeps things like this from crashing my computer :) The docker container hits 100% about when I am at 85% host ram. So I am not getting the full 32GB, somewhere around 16GB give or take is what the container is getting. now that I have dug into this a bit, I think I need to provision more ram to docker in general.

    LatentDream
    Author
    Nov 9, 2024

    @lewisayers252  yeah you are right, i forgot to enable the tiles, you can do it by yourself meanwhile i release another update. is an option right under the cog module that avoid filling ram/vram too heavily.

    LatentDream
    Author
    Nov 9, 2024· 1 reaction

    @lewisayers252 turn on "enable_vae_tiling" on each Cog video Decode node. i'll set that enabled as default on next update, wich will include live portrait too 😎

    lewisayers252Nov 9, 2024· 1 reaction

    I already had vae tiling enabled so that part was not a hindrance. My biggest problem was the CogVideo Models getting dumped into ram on load. They were both loading even though you only select one. I was able to get a workflow to complete by bypassing the switch and disabling one of the cog model loaders. 2b worked fine but 5b by itself flooded my ram. I am incrementally increasing the ram allocation to docker to try to give it enough room to breathe. I get why the developer of these cog model loaders load into ram before vram. the transfer between ram and vram is almost instant which really speeds up processing. I just wish there was some better config options to control how the models load.

    Workflows
    Other

    Details

    Downloads
    624
    Platform
    CivitAI
    Platform Status
    Available
    Created
    10/30/2024
    Updated
    6/11/2026
    Deleted
    -

    Files

    itsALIVEFastIMAGEToVIDEO_v60Tora.zip

    Mirrors

    itsALIVEUltimateIMAGETo_v70.zip

    Mirrors