HDTF

Talking face generator

A project providing a dataset and code for generating talking faces with high-resolution audio-visual data

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

GitHub

347 stars
14 watching
66 forks
Language: Python
last commit: 6 months ago

Related projects:

Repository Description Stars
yiranran/audio-driven-talkingface-headpose Generates talking face videos based on audio and personalized head pose estimation 735
eeskimez/emotalkingface A system that generates talking faces from images and speech with different emotions. 166
akanimax/t2f Generates images of human faces based on textual descriptions using deep learning techniques. 548
lelechen63/atvgnet This repository provides implementations of neural networks used in cross-modal talking face generation 258
iigroup/mm-celeba-hq-dataset A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks. 220
dmitryulyanov/age This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation. 285
a312863063/seeprettyface-ganerator-dongman A Python implementation of a StyleGAN-based anime face generator 151
pkhungurn/talking-head-anime-demo Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques. 1,999
zhanglonghao1992/one-shot_free-view_neural_talking_head_synthesis An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements. 797
wuhaozhe/style_avatar Generates stylized talking faces and videos using deep learning models 278
tzt101/michigan A method for generating realistic portraits with editable hair using deep learning techniques and multi-input conditioning. 294
zhirongw/deep-mrf A deep learning model for probabilistic image representation and generation based on Markov Random Fields 76
mbzuai-oryx/groundinglmm An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks. 781
mbzuai-oryx/video-chatgpt A video conversation model that generates meaningful conversations about videos using large vision and language models 1,213
archinetai/audio-diffusion-pytorch An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input 1,961