seamless_communication
Multilingual model library
A suite of AI models enabling more natural communication across languages through speech and text translation
Foundational Models for State-of-the-Art Speech and Text Translation
11k stars
143 watching
1k forks
Language: Jupyter Notebook
last commit: 2 months ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,352 |
facebookresearch/laser | A library for calculating and using multilingual sentence embeddings. | 3,604 |
facebookresearch/fairseq | A toolkit for training custom sequence-to-sequence models for various NLP tasks | 30,675 |
mumble-voip/mumble | A high-quality, low-latency voice chat software with support for multiple platforms and plugins. | 6,484 |
facebookresearch/mmf | A modular framework for building vision and language multimodal research projects using PyTorch. | 5,510 |
tensorspeech/tensorflowtts | Real-time speech synthesis using state-of-the-art architectures | 3,855 |
coqui-ai/tts | A deep learning toolkit for generating human-like speech from text | 36,118 |
ai-shifu/chatall | A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. | 15,332 |
facebookresearch/spiritlm | This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. | 845 |
hahahumble/speechgpt | An application that enables users to converse with ChatGPT via speech and text interfaces. | 2,752 |
camb-ai/mars5-tts | A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. | 2,551 |
openai-translator/openai-translator | A multi-platform translator and text processing tool leveraging ChatGPT API | 24,004 |
huanshere/videolingo | An all-in-one video translation and localization tool using AI and machine learning to generate subtitles and dubbing for global knowledge sharing. | 8,451 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,519 |
qwenlm/qwen | This repository provides large language models and chat capabilities based on pre-trained Chinese models. | 14,797 |