Experimented a bit before releasing but this model is the best out there right now easily for motion control. I will require a WAN character lora for the best possible likeness retention but can work without. Inputs are base reference image and video reference for movement
The workflow attached as the free version is the official comfyui template , didn't test it as I kinda hate subgraphs
Models should be the same, example was made with my workflow at medium resolution!
My version with infinite length is already up for subs in the video workflows section
Vae & Text encoders
Loras