AutoAWQ
Model optimizer
An optimization package for 4-bit quantized models
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
2k stars
15 watching
220 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool for automating proof search and verification in dependent type theory using machine learning and external provers. | 220 |
| An implementation of a method to compress large language models using additive quantization and fine-tuning. | 1,184 |
| A Haskell package that enables compile-time conversion of JSON strings to data structures using a custom quasiquoter. | 80 |
| A Python-based backtesting and live trading package for quantitative traders. | 541 |
| A software framework for accurately quantizing large language models using a novel technique | 739 |
| A Python library providing visualization tools and workflows for quantum computing | 13 |
| A lightweight Java wrapper around JQ and Oniguruma libraries for efficient JSON processing | 82 |
| Integrates automatic differentiation tools with quantum software packages. | 43 |
| A comprehensive quantum computing library for programming and simulating quantum systems. | 374 |
| Software package for camera-assisted motion control and experiment configuration of photonic integrated circuit interrogation platforms. | 6 |
| A software framework for efficiently quantifying RNA-seq data from sequencing reads. | 663 |
| A collection of 52 machine learning datasets for simulating quantum systems with noise and controls. | 99 |
| A Python module simulating the behavior of quantum computers using linear algebra | 720 |
| Automated workflow for generating and storing DFT calculations for organic molecules using Python and machine learning. | 88 |
| Provides an SQL function to execute jq expressions against JSON values in a datasette plugin | 16 |