v6.5 (hotfixes)
(now defunct)
I noticed a couple of bugs when using v6 so I fixed them here, hopefully you didn't experience them in the first place:
- Fixed "resizing video" repeating in the log for every frame
- Fixed padding size not updating to new settings when you process the same video as last time
- I think I fixed preview_video not working for high fps videos? This whole feature needs a rework anyway.
I also improved the masking feature to scale based on the actual size of the mouth, instead of it being increased by a set number of pixels.
This should make your settings a lot more consistent between different clips, different resolutions and if the subject moves closer to or futher or way from the camera.
I'm now working on separating the wav2lip stage and the gfpgan stage in order to satisfy the suggestion of using gfpgan only on the area of the mask #8 - in addition to improving inference speed, I'm hoping this will also greatly reduce the straight lines at the edges of the face caused by wav2lip but we'll see!