MoeGoe

VITS toolkit

A software framework for text-to-speech and voice conversion using VITS inference.

Executable file for VITS inference

GitHub

2k stars
16 watching
249 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
enhuiz/vall-e An implementation of VALL-E in PyTorch for text-to-speech synthesis 2,970
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,466
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 36,977
metavoiceio/metavoice-src A deep learning model for generating human-like speech 3,936
louisshark/chatgpt_system_prompt A collection of GPT system prompts and various prompt injection/leaking knowledge to educate developers about writing effective system prompts and creating custom GPTs. 8,375
aigc-audio/audiogpt An audio processing toolkit that provides pre-trained models and tools for tasks like speech synthesis, music generation, sound detection, and talking head creation. 10,061
neonbjb/tortoise-tts An open-source text-to-speech system trained with high-quality audio capabilities 13,373
r9y9/deepvoice3_pytorch An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch 1,970
matlab-deep-learning/wav2vec-2.0 Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. 7
huggingface/text-generation-inference A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation 9,456
auspicious3000/contentvec An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings 471
jtsang4/claude-to-chatgpt A conversion tool from Anthropic's Claude model API to OpenAI Chat API format 1,269
facebookresearch/audio2photoreal Generating photorealistic avatars from audio 2,715
jwieting/para-nmt-50m A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets. 102
vchitect/vbench A benchmark suite for evaluating the performance of video generative models 643