ColossalAI
AI model optimizer
Making large AI models cheaper, faster, and more accessible by providing tools and strategies for efficient distributed training and inference.
Making large AI models cheaper, faster and more accessible
39k stars
385 watching
4k forks
Language: Python
last commit: 6 days ago
Linked from 8 awesome lists
aibig-modeldata-parallelismdeep-learningdistributed-computingfoundation-modelsheterogeneous-traininghpcinferencelarge-scalemodel-parallelismpipeline-parallelism
Related projects:
Repository | Description | Stars |
---|---|---|
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 7,964 |
microsoft/deepspeed | A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35,463 |
flagai-open/flagai | An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality | 3,830 |
eleutherai/gpt-neox | Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,941 |
mediar-ai/screenpipe | An AI-powered screen recording and analysis tool with vision, voice, and machine learning capabilities | 8,923 |
significant-gravitas/autogpt | A platform for building and deploying autonomous AI agents to automate complex workflows | 168,407 |
postgresml/postgresml | An open-source Postgres extension for machine learning and AI operations directly within the database. | 6,033 |
huawei-noah/efficient-ai-backbones | A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. | 4,054 |
plasma-umass/scalene | A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. | 12,186 |
exo-explore/exo | Allows developers to run AI models on personal devices with diverse hardware configurations. | 14,829 |
portkey-ai/gateway | A fast and reliable AI routing service with built-in guardrails for generating requests to multiple large language models. | 6,290 |
google-research/big_vision | Supports large-scale vision model training on GPU machines or Google Cloud TPUs using scalable input pipelines. | 2,334 |
coqui-ai/tts | A deep learning toolkit for generating human-like speech from text | 35,453 |
tencent/hunyuandit | A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis | 3,456 |
skypilot-org/skypilot | A framework for running AI and batch workloads on any infrastructure, offering unified execution, cost savings, and high GPU availability. | 6,801 |