NeMo
Generative AI framework
A scalable generative AI framework for creating and deploying large language models and multimodal models
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
12k stars
209 watching
3k forks
Language: Python
last commit: about 1 month ago
Linked from 4 awesome lists
asrdeeplearninggenerative-ailarge-language-modelsmachine-translationmultimodalneural-networksspeaker-diariazationspeaker-recognitionspeech-synthesisspeech-translationtts
Related projects:
Repository | Description | Stars |
---|---|---|
haotian-liu/llava | A system that uses large language and vision models to generate and process visual instructions | 20,683 |
meta-llama/llama-stack | Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design. | 5,164 |
neuml/txtai | An all-in-one embeddings database for semantic search, LLM orchestration and language model workflows | 9,709 |
tensorflow/lingvo | A software framework for building sequence models using neural networks in TensorFlow | 2,820 |
salesforce/lavis | A library that provides pre-trained models and frameworks for multimodal vision-language intelligence tasks such as image captioning and visual question answering. | 10,058 |
nvidiagameworks/kaolin | A PyTorch library for accelerating 3D deep learning research with various GPU-optimized operations and tools. | 4,550 |
google-ai-edge/mediapipe | A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices | 27,962 |
eleutherai/gpt-neox | Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,997 |
sgl-project/sglang | A fast serving framework for large language models and vision language models. | 6,551 |
nomic-ai/gpt4all | An open-source Python client for running Large Language Models (LLMs) locally on any device. | 71,176 |
ashawkey/stable-dreamfusion | Generates 3D content from text using a combination of neural networks and image synthesis. | 8,351 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,352 |
dvlab-research/mgm | An open-source framework for training large language models with vision capabilities. | 3,229 |
nvlabs/instant-ngp | A software toolkit for training and rendering neural graphics primitives | 16,115 |
geekan/metagpt | A framework that enables the creation of software companies through artificial intelligence and collaborative systems | 45,943 |