HDTF
Talking face generator
A project providing a dataset and code for generating talking faces with high-resolution audio-visual data
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
347 stars
14 watching
66 forks
Language: Python
last commit: 6 months ago Related projects:
Repository | Description | Stars |
---|---|---|
yiranran/audio-driven-talkingface-headpose | Generates talking face videos based on audio and personalized head pose estimation | 735 |
eeskimez/emotalkingface | A system that generates talking faces from images and speech with different emotions. | 166 |
akanimax/t2f | Generates images of human faces based on textual descriptions using deep learning techniques. | 548 |
lelechen63/atvgnet | This repository provides implementations of neural networks used in cross-modal talking face generation | 258 |
iigroup/mm-celeba-hq-dataset | A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks. | 220 |
dmitryulyanov/age | This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation. | 285 |
a312863063/seeprettyface-ganerator-dongman | A Python implementation of a StyleGAN-based anime face generator | 151 |
pkhungurn/talking-head-anime-demo | Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques. | 1,999 |
zhanglonghao1992/one-shot_free-view_neural_talking_head_synthesis | An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements. | 797 |
wuhaozhe/style_avatar | Generates stylized talking faces and videos using deep learning models | 278 |
tzt101/michigan | A method for generating realistic portraits with editable hair using deep learning techniques and multi-input conditioning. | 294 |
zhirongw/deep-mrf | A deep learning model for probabilistic image representation and generation based on Markov Random Fields | 76 |
mbzuai-oryx/groundinglmm | An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks. | 781 |
mbzuai-oryx/video-chatgpt | A video conversation model that generates meaningful conversations about videos using large vision and language models | 1,213 |
archinetai/audio-diffusion-pytorch | An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input | 1,961 |