This is what i used before LTX to sync a video
Wan 2.2 animate Wrapper version
This uses wan animate and wan video wrapper to make long form lip synced videos. It does well for what it does. But it has limits and some issues.
Issues
1. Wan wrapper does not let me load GGUF, so scaled fp8 are used.
2. This is not set up to inpaint. I could have sworn inpainting worked but i cant connect a mask set to the mask input without error.
- trying to make a native comfy flow that does inpaint later. Either im going crazy or the wrapper cant inpaint?
3. This cant do complex stuff. This is an talking avitar copier. Dont try to do anything difficult.
4. The smaller you make the video, the worse the ref image copy will be. 480p seems smallest you can go without loosing the subject. Unless its a close up face shot.
Enable the options you want, you dont need pose, face, or a ref image. Just 1 works or any combo of them depending how much of the video you want to copy.
If you use pose it will make a video following the movement
If you use use face video it will make a video copying the face movement
If you use a ref image it will copy the image subject and background and make a video from it.
I actually find it works best with just a face video and ref image used as a start frame with no pose info. Prompting what you want the movement to be.
