Tag: generative ai

  • Lip Synching Head Talking Videos At 4K using Wav2lip and Vqgan

    Lip Synching Head Talking Videos At 4K using Wav2lip and Vqgan

    Hello everyone, in this article I wanted to talk about a paper that is a follow-up of Wav2Lip:: Towards Generating Ultra-High Resolution Talking-Face Videos with Lipsynchronization. This paper makes an improvement of the Wav2Lip model so that it is able to generate lip-synching videos at 4K resolutions. And how does it manage to achieve that? […]

  • PHORHUM: From a 2D photo to 3D animated model by Google

    PHORHUM: From a 2D photo to 3D animated model by Google

    Google has released a paper on a new state-of-the-art machine learning model, called PHORHUM, that is able to create a 3D model from a single 2D photo, with texture disentangled from the lighting source in the photo. There is only one bad news. Google hasn’t yet released source code or a demo for anyone to […]

  • Imagen: Text-To-Image AI From Google

    Imagen: Text-To-Image AI From Google

    Google has taken AI-generated images to a new level with Imagen.  Imagen is a new state-of-the-art text-to-image diffusion model capable of generating highly realistic images given text input. It uses a very powerful language model, T5–XXL a language model with 4.6 billion of parameters trained on a huge text-only dataset.  This new model is not […]

  • Wav2Lip: Create the perfect DeepFake Lip Sync with Wav2Lip and Google Wavenet

    Wav2Lip: Create the perfect DeepFake Lip Sync with Wav2Lip and Google Wavenet

    What if there was a way to automatically translate your videos and communicate with the whole world in over 30 languages? And I am not talking about just subtitles. There are two recent advances in AI that are bringing us very close to that reality: Google Wavenet and Wav2Lip Wav2Lip Is a state of the […]