Audio-driven-TalkingFace-HeadPose

Talking Face Generator

Generates talking face videos based on audio and personalized head pose estimation

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

GitHub

735 stars
24 watching
146 forks
Language: Python
last commit: 11 months ago

Related projects:

Repository Description Stars
mrzzm/hdtf A project providing a dataset and code for generating talking faces with high-resolution audio-visual data 347
eeskimez/emotalkingface A system that generates talking faces from images and speech with different emotions. 166
pkhungurn/talking-head-anime-demo Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques. 1,999
akanimax/t2f Generates images of human faces based on textual descriptions using deep learning techniques. 548
zhanglonghao1992/one-shot_free-view_neural_talking_head_synthesis An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements. 797
dmitryulyanov/age This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation. 285
wuhaozhe/style_avatar Generates stylized talking faces and videos using deep learning models 278
pavitrakumar78/anime-face-gan-keras A GAN-based system to generate anime faces using a custom dataset 198
chrisdonahue/wavegan An open-source machine learning algorithm for generating raw audio waveforms from raw data 1,330
a312863063/seeprettyface-ganerator-dongman A Python implementation of a StyleGAN-based anime face generator 151
lelechen63/atvgnet This repository provides implementations of neural networks used in cross-modal talking face generation 258
yi-ming-qian/roofgan A tool for generating realistic roof models using deep learning techniques 41
yitong91/storygan A framework for generating images that describe stories using deep learning techniques 233
iigroup/mm-celeba-hq-dataset A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks. 220
soroushmehr/samplernn_iclr2017 An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture. 537