I'm not sure if mods will decide to remove this, as it has nothing to do with image generation. I'm just uploading here to share what I've been working on lately.
I've discovered that you can train musical instruments with so-vits-svc to make them talk and possibly even transform other instruments.
https://github.com/34j/so-vits-svc-fork