MoeGoe
VITS toolkit
A software framework for text-to-speech and voice conversion using VITS inference.
Executable file for VITS inference
2k stars
16 watching
249 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
enhuiz/vall-e | An implementation of VALL-E in PyTorch for text-to-speech synthesis | 2,970 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,466 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 36,977 |
metavoiceio/metavoice-src | A deep learning model for generating human-like speech | 3,936 |
louisshark/chatgpt_system_prompt | A collection of GPT system prompts and various prompt injection/leaking knowledge to educate developers about writing effective system prompts and creating custom GPTs. | 8,375 |
aigc-audio/audiogpt | An audio processing toolkit that provides pre-trained models and tools for tasks like speech synthesis, music generation, sound detection, and talking head creation. | 10,061 |
neonbjb/tortoise-tts | An open-source text-to-speech system trained with high-quality audio capabilities | 13,373 |
r9y9/deepvoice3_pytorch | An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,970 |
matlab-deep-learning/wav2vec-2.0 | Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. | 7 |
huggingface/text-generation-inference | A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation | 9,456 |
auspicious3000/contentvec | An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings | 471 |
jtsang4/claude-to-chatgpt | A conversion tool from Anthropic's Claude model API to OpenAI Chat API format | 1,269 |
facebookresearch/audio2photoreal | Generating photorealistic avatars from audio | 2,715 |
jwieting/para-nmt-50m | A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets. | 102 |
vchitect/vbench | A benchmark suite for evaluating the performance of video generative models | 643 |