HDTF

Talking face generator

A project providing a dataset and code for generating talking faces with high-resolution audio-visual data

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

GitHub

349 stars

14 watching

66 forks

Language: Python

last commit: about 2 years ago

Related projects:

Repository	Description	Stars
yiranran/audio-driven-talkingface-headpose	Generates talking face videos based on audio signals and personalized head poses.	740
eeskimez/emotalkingface	A system that generates talking faces from images and speech with different emotions.	167
akanimax/t2f	Generates images of human faces based on textual descriptions using deep learning techniques.	548
lelechen63/atvgnet	This repository provides implementations of neural networks used in cross-modal talking face generation	258
iigroup/mm-celeba-hq-dataset	A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks.	223
dmitryulyanov/age	This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation.	285
a312863063/seeprettyface-ganerator-dongman	A Python implementation of a StyleGAN-based anime face generator	151
pkhungurn/talking-head-anime-demo	Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques.	2,001
zhanglonghao1992/one-shot_free-view_neural_talking_head_synthesis	An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements.	807
wuhaozhe/style_avatar	Generates stylized talking faces and videos using deep learning models	278
tzt101/michigan	A method for generating realistic portraits with editable hair using deep learning techniques and multi-input conditioning.	294
zhirongw/deep-mrf	A deep learning model for probabilistic image representation and generation based on Markov Random Fields	76
mbzuai-oryx/groundinglmm	An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks for interactive visual conversations	797
mbzuai-oryx/video-chatgpt	A video conversation model that generates meaningful conversations about videos using large vision and language models	1,246
archinetai/audio-diffusion-pytorch	An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input	1,975