audio2photoreal
Avatar generation
Generating photorealistic avatars from audio
Code and dataset for photorealistic Codec Avatars driven from audio
3k stars
31 watching
254 forks
Language: Python
last commit: 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/imagebind | An AI framework that combines data from multiple sources into a single embedding space, enabling various applications such as cross-modal retrieval and generation. | 8,362 |
sebastianstarke/ai4animation | A deep learning framework for data-driven character animation in Unity3D | 7,887 |
zejun-yang/aniportrait | An open-source framework for generating photorealistic animations driven by audio and reference images. | 4,670 |
facebookresearch/dinov2 | A PyTorch implementation of a self-supervised learning method for learning robust visual features without supervision. | 9,274 |
facebookresearch/pytorch3d | A deep learning library for 3D data processing and computer vision research using PyTorch | 8,824 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,333 |
facebookresearch/sam2 | An open-source software project providing code and tools for running inference with a deep learning model designed for visual segmentation in images and videos. | 12,524 |
facebookresearch/ca_body | A Python implementation of a neural network architecture for image avatar body generation | 47 |
facebookresearch/audiocraft | A deep learning library for generating high-quality audio | 21,018 |
huggingface/lerobot | A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning. | 7,518 |
facebookresearch/eft | Provides pseudo-GT 3D human pose data and pre-trained models for training 3D pose estimation algorithms | 376 |
nvidia/vid2vid | A PyTorch implementation of a video-to-video translation method for generating photorealistic videos from semantic label maps or other input data. | 8,607 |
tyiannak/pyaudioanalysis | A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,885 |
pytorch/audio | A PyTorch module providing tools and functions for audio signal processing | 2,545 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,285 |