AudioGPT

Audio toolkit

An audio processing toolkit that provides pre-trained models and tools for tasks like speech synthesis, music generation, sound detection, and talking head creation.

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

GitHub

10k stars
135 watching
868 forks
Language: Python
last commit: 7 months ago
Linked from 2 awesome lists

audiogptmusicsoundspeechtalking-head

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,752
haoheliu/audioldm A Python-based audio generation tool that can produce speech, sound effects, music, and more, using text as input or guided by user description. 2,483
speechbrain/speechbrain A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. 9,066
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,294
facebookresearch/audiocraft A deep learning library for generating high-quality audio 21,134
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,508
archinetai/audio-diffusion-pytorch An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input 1,975
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,561
futantan/opengpt A platform for creating and running custom ChatGPT applications with user-created apps and support for API tokens and internationalization. 3,935
waylaidwanderer/node-chatgpt-api Provides client-side access to ChatGPT and Bing AI APIs using Node.js 4,210
cogentapps/chat-with-gpt An open-source ChatGPT app with added features and customization options 2,327
open-mmlab/mmagic A toolkit for building and experimenting with generative AI models for image and video generation, restoration, enhancement, and other tasks. 6,986
williamfzc/chat-gpt-ppt Automates the creation of PowerPoint presentations using ChatGPT as a backend. 909
suno-ai/bark A text-to-audio model that generates realistic speech and other audio 36,433