seamless_communication
Multilingual model library
A suite of AI models enabling more natural communication across languages through speech and text translation
Foundational Models for State-of-the-Art Speech and Text Translation
11k stars
142 watching
1k forks
Language: Jupyter Notebook
last commit: 10 days ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
facebookresearch/laser | A library for calculating and using multilingual sentence embeddings. | 3,599 |
facebookresearch/fairseq | A toolkit for training custom sequence-to-sequence models for various NLP tasks | 30,575 |
mumble-voip/mumble | A high-quality, low-latency voice chat software with support for multiple platforms and plugins. | 6,428 |
facebookresearch/mmf | A modular framework for building vision and language multimodal research projects using PyTorch. | 5,500 |
tensorspeech/tensorflowtts | Real-time speech synthesis using state-of-the-art architectures | 3,839 |
coqui-ai/tts | A deep learning toolkit for generating human-like speech from text | 35,453 |
ai-shifu/chatall | A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. | 15,241 |
facebookresearch/spiritlm | This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. | 777 |
hahahumble/speechgpt | An application that enables users to converse with ChatGPT via speech and text interfaces. | 2,746 |
camb-ai/mars5-tts | A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. | 2,534 |
openai-translator/openai-translator | A multi-platform translator and text processing tool leveraging ChatGPT API | 23,908 |
huanshere/videolingo | An AI-powered tool for automated video subtitle generation and localization | 6,608 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,517 |
qwenlm/qwen | This repository provides large language models and chat capabilities based on pre-trained Chinese models. | 14,164 |