ColossalAI

AI model optimizer

Making large AI models cheaper, faster, and more accessible by providing tools and strategies for efficient distributed training and inference.

Making large AI models cheaper, faster and more accessible

GitHub

39k stars
385 watching
4k forks
Language: Python
last commit: 6 days ago
Linked from 8 awesome lists

aibig-modeldata-parallelismdeep-learningdistributed-computingfoundation-modelsheterogeneous-traininghpcinferencelarge-scalemodel-parallelismpipeline-parallelism

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 7,964
microsoft/deepspeed A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. 35,463
flagai-open/flagai An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality 3,830
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,941
mediar-ai/screenpipe An AI-powered screen recording and analysis tool with vision, voice, and machine learning capabilities 8,923
significant-gravitas/autogpt A platform for building and deploying autonomous AI agents to automate complex workflows 168,407
postgresml/postgresml An open-source Postgres extension for machine learning and AI operations directly within the database. 6,033
huawei-noah/efficient-ai-backbones A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. 4,054
plasma-umass/scalene A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. 12,186
exo-explore/exo Allows developers to run AI models on personal devices with diverse hardware configurations. 14,829
portkey-ai/gateway A fast and reliable AI routing service with built-in guardrails for generating requests to multiple large language models. 6,290
google-research/big_vision Supports large-scale vision model training on GPU machines or Google Cloud TPUs using scalable input pipelines. 2,334
coqui-ai/tts A deep learning toolkit for generating human-like speech from text 35,453
tencent/hunyuandit A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis 3,456
skypilot-org/skypilot A framework for running AI and batch workloads on any infrastructure, offering unified execution, cost savings, and high GPU availability. 6,801