AutoAWQ
Model optimizer
An optimization package for 4-bit quantized models
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
2k stars
15 watching
220 forks
Language: Python
last commit: 11 months ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A tool for automating proof search and verification in dependent type theory using machine learning and external provers. | 220 |
| | An implementation of a method to compress large language models using additive quantization and fine-tuning. | 1,184 |
| | A Haskell package that enables compile-time conversion of JSON strings to data structures using a custom quasiquoter. | 80 |
| | A Python-based backtesting and live trading package for quantitative traders. | 541 |
| | A software framework for accurately quantizing large language models using a novel technique | 739 |
| | A Python library providing visualization tools and workflows for quantum computing | 13 |
| | A lightweight Java wrapper around JQ and Oniguruma libraries for efficient JSON processing | 82 |
| | Integrates automatic differentiation tools with quantum software packages. | 43 |
| | A comprehensive quantum computing library for programming and simulating quantum systems. | 374 |
| | Software package for camera-assisted motion control and experiment configuration of photonic integrated circuit interrogation platforms. | 6 |
| | A software framework for efficiently quantifying RNA-seq data from sequencing reads. | 663 |
| | A collection of 52 machine learning datasets for simulating quantum systems with noise and controls. | 99 |
| | A Python module simulating the behavior of quantum computers using linear algebra | 720 |
| | Automated workflow for generating and storing DFT calculations for organic molecules using Python and machine learning. | 88 |
| | Provides an SQL function to execute jq expressions against JSON values in a datasette plugin | 16 |