seamless_communication

Multilingual model library

A suite of AI models enabling more natural communication across languages through speech and text translation

Foundational Models for State-of-the-Art Speech and Text Translation

GitHub

11k stars
143 watching
1k forks
Language: Jupyter Notebook
last commit: 2 months ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,352
facebookresearch/laser A library for calculating and using multilingual sentence embeddings. 3,604
facebookresearch/fairseq A toolkit for training custom sequence-to-sequence models for various NLP tasks 30,675
mumble-voip/mumble A high-quality, low-latency voice chat software with support for multiple platforms and plugins. 6,484
facebookresearch/mmf A modular framework for building vision and language multimodal research projects using PyTorch. 5,510
tensorspeech/tensorflowtts Real-time speech synthesis using state-of-the-art architectures 3,855
coqui-ai/tts A deep learning toolkit for generating human-like speech from text 36,118
ai-shifu/chatall A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. 15,332
facebookresearch/spiritlm This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. 845
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,752
camb-ai/mars5-tts A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. 2,551
openai-translator/openai-translator A multi-platform translator and text processing tool leveraging ChatGPT API 24,004
huanshere/videolingo An all-in-one video translation and localization tool using AI and machine learning to generate subtitles and dubbing for global knowledge sharing. 8,451
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,519
qwenlm/qwen This repository provides large language models and chat capabilities based on pre-trained Chinese models. 14,797