HDTF
Talking face generator
A project providing a dataset and code for generating talking faces with high-resolution audio-visual data
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
349 stars
14 watching
66 forks
Language: Python
last commit: 10 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| Generates talking face videos based on audio signals and personalized head poses. | 740 |
| A system that generates talking faces from images and speech with different emotions. | 167 |
| Generates images of human faces based on textual descriptions using deep learning techniques. | 548 |
| This repository provides implementations of neural networks used in cross-modal talking face generation | 258 |
| A large-scale dataset for training and evaluating algorithms for text-driven face generation and understanding tasks. | 223 |
| This repository provides code for training Generative Adversarial Networks (GANs) for various image datasets, including face generation. | 285 |
| A Python implementation of a StyleGAN-based anime face generator | 151 |
| Creates anime characters with realistic head movements from single images or webcam feeds using deep learning and computer vision techniques. | 2,001 |
| An implementation of neural talking head synthesis for video conferencing, allowing for one-shot creation of realistic face movements. | 807 |
| Generates stylized talking faces and videos using deep learning models | 278 |
| A method for generating realistic portraits with editable hair using deep learning techniques and multi-input conditioning. | 294 |
| A deep learning model for probabilistic image representation and generation based on Markov Random Fields | 76 |
| An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks for interactive visual conversations | 797 |
| A video conversation model that generates meaningful conversations about videos using large vision and language models | 1,246 |
| An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input | 1,975 |