AQLM

Model compressor

An implementation of a method to compress large language models using additive quantization and fine-tuning.

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852

GitHub

1k stars
19 watching
177 forks
Language: Python
last commit: 13 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
huggingface/optimum-quanto A PyTorch quantization backend for models. 822
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,226
quant-aq/py-smps A Python library for analyzing and visualizing data from various aerosol sizing instruments 32
opengvlab/omniquant A software framework for accurately quantizing large language models using a novel technique 730
ist-daslab/gptq An implementation of post-training quantization algorithm for transformer models to reduce memory usage and improve inference speed 1,937
locuslab/qpth A solver for optimizing quadratic programs in deep learning and other fields. 685
jvalegre/aqme Automated workflows for quantum mechanical environments using Python. 99
pasqal-io/pyqtorch A PyTorch-based simulator for quantum machine learning 45
kefirski/pytorch_rvae A deep learning implementation of a recurrent variational autoencoder for generating sequential data. 357
oscarhiggott/pymatching A software library for decoding quantum error-correcting codes using a minimum-weight perfect matching algorithm 187
zlatko-minev/pyepr Automated design and analysis of quantum microwave circuits 165
jmtomczak/vae_vpflows A PyTorch implementation of volume-preserving flows for variational autoencoders. 90
karpathy/pytorch-made An implementation of a masked autoencoder density estimation model in PyTorch 539
vprusso/toqito A Python library providing numerical tools for studying quantum information objects 155
adgt/qonduit A Python library providing visualization tools and workflows for quantum computing 13