SPLTECH AI

Tag: generative ai

Lip Synching Head Talking Videos At 4K using Wav2lip and Vqgan

Hello everyone, in this article I wanted to talk about a paper that is a follow-up of Wav2Lip:: Towards Generating Ultra-High Resolution Talking-Face Videos with Lipsynchronization. This paper makes an improvement of the Wav2Lip model so that it is able to generate lip-synching videos at 4K resolutions. And how does it manage to achieve that? […]

March 30, 2023
PHORHUM: From a 2D photo to 3D animated model by Google

Google has released a paper on a new state-of-the-art machine learning model, called PHORHUM, that is able to create a 3D model from a single 2D photo, with texture disentangled from the lighting source in the photo. There is only one bad news. Google hasn’t yet released source code or a demo for anyone to […]

June 22, 2022
Imagen: Text-To-Image AI From Google

Google has taken AI-generated images to a new level with Imagen. Imagen is a new state-of-the-art text-to-image diffusion model capable of generating highly realistic images given text input. It uses a very powerful language model, T5–XXL a language model with 4.6 billion of parameters trained on a huge text-only dataset. This new model is not […]

June 5, 2022
Wav2Lip: Create the perfect DeepFake Lip Sync with Wav2Lip and Google Wavenet

What if there was a way to automatically translate your videos and communicate with the whole world in over 30 languages? And I am not talking about just subtitles. There are two recent advances in AI that are bringing us very close to that reality: Google Wavenet and Wav2Lip Wav2Lip Is a state of the […]

December 12, 2020