audio2photoreal

Avatar generation

Generating photorealistic avatars from audio

Code and dataset for photorealistic Codec Avatars driven from audio

GitHub

3k stars
31 watching
254 forks
Language: Python
last commit: 2 months ago

Related projects:

Repository Description Stars
facebookresearch/imagebind An AI framework that combines data from multiple sources into a single embedding space, enabling various applications such as cross-modal retrieval and generation. 8,362
sebastianstarke/ai4animation A deep learning framework for data-driven character animation in Unity3D 7,887
zejun-yang/aniportrait An open-source framework for generating photorealistic animations driven by audio and reference images. 4,670
facebookresearch/dinov2 A PyTorch implementation of a self-supervised learning method for learning robust visual features without supervision. 9,274
facebookresearch/pytorch3d A deep learning library for 3D data processing and computer vision research using PyTorch 8,824
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,333
facebookresearch/sam2 An open-source software project providing code and tools for running inference with a deep learning model designed for visual segmentation in images and videos. 12,524
facebookresearch/ca_body A Python implementation of a neural network architecture for image avatar body generation 47
facebookresearch/audiocraft A deep learning library for generating high-quality audio 21,018
huggingface/lerobot A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning. 7,518
facebookresearch/eft Provides pseudo-GT 3D human pose data and pre-trained models for training 3D pose estimation algorithms 376
nvidia/vid2vid A PyTorch implementation of a video-to-video translation method for generating photorealistic videos from semantic label maps or other input data. 8,607
tyiannak/pyaudioanalysis A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. 5,885
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,545
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,285