PocketFlow

Model compressor

A framework that automatically compresses and accelerates deep learning models to make them suitable for mobile devices with limited computational resources.

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

GitHub

3k stars
147 watching
490 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list

automlcomputer-visiondeep-learningmobile-appmodel-compression

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
optimalscale/lmflow A toolkit for fine-tuning and inferring large machine learning models 8,312
tencent/tnn A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. 4,435
xiaomi/mace A framework for deep learning inference on mobile devices 4,949
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,257
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,699
mlflow/mlflow A platform for managing machine learning projects from inception to deployment 19,021
autumnai/leaf An open machine learning framework for building classical, deep, or hybrid models on various hardware platforms. 5,555
microsoft/deepspeed-mii A Python library designed to accelerate model inference with high-throughput and low latency capabilities 1,924
tencent/ncnn An optimized framework for deploying deep learning models on mobile devices. 20,655
swift-ai/swift-ai A high-performance deep learning library written in Swift for Apple platforms. 6,032
ludwig-ai/ludwig A low-code framework for building custom deep learning models and neural networks 11,236
microsoft/mmdnn A toolset to convert and manage deep learning models across multiple frameworks. 5,802
netflix/metaflow A platform that enables the development, scaling, and deployment of machine learning systems, providing tools for data science projects. 8,341
taki0112/senet-tensorflow A TensorFlow implementation of Squeeze and Excitation Networks for image classification tasks 757
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 8,011