PocketFlow

Model compressor

A framework that automatically compresses and accelerates deep learning models to make them suitable for mobile devices with limited computational resources.

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

GitHub

3k stars
147 watching
491 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

automlcomputer-visiondeep-learningmobile-appmodel-compression

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
optimalscale/lmflow A toolkit for finetuning large language models and providing efficient inference capabilities 8,273
tencent/tnn A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. 4,415
xiaomi/mace A framework for deep learning inference on mobile devices 4,934
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,226
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,437
mlflow/mlflow A platform to manage the entire machine learning lifecycle, from experiment tracking to model deployment. 18,781
autumnai/leaf An open machine learning framework for building classical, deep, or hybrid models on various hardware platforms. 5,558
microsoft/deepspeed-mii A Python library designed to accelerate model inference with high-throughput and low latency capabilities 1,898
tencent/ncnn An optimized framework for deploying deep learning models on mobile devices. 20,479
swift-ai/swift-ai A high-performance deep learning library written in Swift for Apple platforms. 6,029
ludwig-ai/ludwig A low-code framework for building custom deep learning models and neural networks 11,189
microsoft/mmdnn A toolset to convert and manage deep learning models across multiple frameworks. 5,797
netflix/metaflow A platform that enables scientists and engineers to build, deploy, and manage complex data science projects efficiently 8,246
taki0112/senet-tensorflow A TensorFlow implementation of Squeeze and Excitation Networks for image classification tasks 756
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 7,964