PocketFlow
Model compressor
A framework that automatically compresses and accelerates deep learning models to make them suitable for mobile devices with limited computational resources.
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
3k stars
147 watching
491 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
automlcomputer-visiondeep-learningmobile-appmodel-compression
Related projects:
Repository | Description | Stars |
---|---|---|
optimalscale/lmflow | A toolkit for finetuning large language models and providing efficient inference capabilities | 8,273 |
tencent/tnn | A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. | 4,415 |
xiaomi/mace | A framework for deep learning inference on mobile devices | 4,934 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,226 |
huggingface/peft | An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,437 |
mlflow/mlflow | A platform to manage the entire machine learning lifecycle, from experiment tracking to model deployment. | 18,781 |
autumnai/leaf | An open machine learning framework for building classical, deep, or hybrid models on various hardware platforms. | 5,558 |
microsoft/deepspeed-mii | A Python library designed to accelerate model inference with high-throughput and low latency capabilities | 1,898 |
tencent/ncnn | An optimized framework for deploying deep learning models on mobile devices. | 20,479 |
swift-ai/swift-ai | A high-performance deep learning library written in Swift for Apple platforms. | 6,029 |
ludwig-ai/ludwig | A low-code framework for building custom deep learning models and neural networks | 11,189 |
microsoft/mmdnn | A toolset to convert and manage deep learning models across multiple frameworks. | 5,797 |
netflix/metaflow | A platform that enables scientists and engineers to build, deploy, and manage complex data science projects efficiently | 8,246 |
taki0112/senet-tensorflow | A TensorFlow implementation of Squeeze and Excitation Networks for image classification tasks | 756 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 7,964 |