seamless_communication

Multilingual model library

A suite of AI models enabling more natural communication across languages through speech and text translation

Foundational Models for State-of-the-Art Speech and Text Translation

GitHub

11k stars
142 watching
1k forks
Language: Jupyter Notebook
last commit: 10 days ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
facebookresearch/laser A library for calculating and using multilingual sentence embeddings. 3,599
facebookresearch/fairseq A toolkit for training custom sequence-to-sequence models for various NLP tasks 30,575
mumble-voip/mumble A high-quality, low-latency voice chat software with support for multiple platforms and plugins. 6,428
facebookresearch/mmf A modular framework for building vision and language multimodal research projects using PyTorch. 5,500
tensorspeech/tensorflowtts Real-time speech synthesis using state-of-the-art architectures 3,839
coqui-ai/tts A deep learning toolkit for generating human-like speech from text 35,453
ai-shifu/chatall A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. 15,241
facebookresearch/spiritlm This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. 777
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,746
camb-ai/mars5-tts A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. 2,534
openai-translator/openai-translator A multi-platform translator and text processing tool leveraging ChatGPT API 23,908
huanshere/videolingo An AI-powered tool for automated video subtitle generation and localization 6,608
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,517
qwenlm/qwen This repository provides large language models and chat capabilities based on pre-trained Chinese models. 14,164