ColossalAI
AI parallelism toolkit
A toolkit for training and deploying large AI models in parallel on distributed computing infrastructure
Making large AI models cheaper, faster and more accessible
39k stars
384 watching
4k forks
Language: Python
last commit: 2 days ago
Linked from 8 awesome lists
aibig-modeldata-parallelismdeep-learningdistributed-computingfoundation-modelsheterogeneous-traininghpcinferencelarge-scalemodel-parallelismpipeline-parallelism
Related projects:
Repository | Description | Stars |
---|---|---|
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 7,964 |
microsoft/deepspeed | A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35,545 |
flagai-open/flagai | An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality | 3,830 |
eleutherai/gpt-neox | Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,941 |
mediar-ai/screenpipe | An AI-powered screen recording and analysis tool with vision, voice, and machine learning capabilities | 8,923 |
significant-gravitas/autogpt | A platform for building and deploying autonomous AI agents to automate complex workflows | 168,407 |
postgresml/postgresml | An open-source Postgres extension for machine learning and AI operations directly within the database. | 6,033 |
huawei-noah/efficient-ai-backbones | A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. | 4,054 |
plasma-umass/scalene | A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. | 12,186 |
exo-explore/exo | Allows developers to run AI models on personal devices with diverse hardware configurations. | 14,829 |
portkey-ai/gateway | A fast and reliable AI routing service with built-in guardrails for generating requests to multiple large language models. | 6,290 |
google-research/big_vision | Supports large-scale vision model training on GPU machines or Google Cloud TPUs using scalable input pipelines. | 2,347 |
coqui-ai/tts | A deep learning toolkit for generating human-like speech from text | 35,453 |
tencent/hunyuandit | A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis | 3,456 |
skypilot-org/skypilot | A framework for running AI and batch workloads on any infrastructure, offering unified execution, cost savings, and high GPU availability. | 6,801 |