Audio-driven-TalkingFace-HeadPose

Talking Face Generator

Generates talking face videos based on audio signals and personalized head poses.

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

GitHub

740 stars

25 watching

146 forks

Language: Python

last commit: over 1 year ago

Screenshot of yiranran/Audio-driven-TalkingFace-HeadPose website

ieeexplore.ieee.org/document/9894719

Related projects:

Repository	Description	Stars
mrzzm/hdtf	A project providing a dataset and code for generating talking faces with high-resolution audio-visual data	349
eeskimez/emotalkingface	A system that generates talking faces from images and speech with different emotions.	167
pkhungurn/talking-head-anime-demo	Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques.	2,001
akanimax/t2f	Generates images of human faces based on textual descriptions using deep learning techniques.	548
zhanglonghao1992/one-shot_free-view_neural_talking_head_synthesis	An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements.	807
dmitryulyanov/age	This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation.	285
wuhaozhe/style_avatar	Generates stylized talking faces and videos using deep learning models	278
pavitrakumar78/anime-face-gan-keras	A GAN-based system to generate anime faces using a custom dataset	198
chrisdonahue/wavegan	An open-source machine learning algorithm for generating raw audio waveforms from raw data	1,334
a312863063/seeprettyface-ganerator-dongman	A Python implementation of a StyleGAN-based anime face generator	151
lelechen63/atvgnet	This repository provides implementations of neural networks used in cross-modal talking face generation	258
yi-ming-qian/roofgan	A tool for generating realistic roof models using deep learning techniques	42
yitong91/storygan	A framework for generating images that describe stories using deep learning techniques	233
iigroup/mm-celeba-hq-dataset	A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks.	223
soroushmehr/samplernn_iclr2017	An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture.	537