bitsandbytes

Language model helper

A Python library providing lightweight, hardware-accelerated operations for large language models.

Accessible large language models via k-bit quantization for PyTorch.

GitHub

6k stars
51 watching
630 forks
Language: Python
last commit: 7 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pmorissette/bt A flexible Python framework for building and testing algorithmic trading strategies 2,287
aristocratos/bpytop A resource monitor tool that displays system usage statistics and allows for filtering, sorting, and customization of system processes. 10,176
aristocratos/btop A tool to monitor system resources and display detailed information in a customizable format 21,039
turboderp/exllama A re-implementation of Llama for efficient use with quantized weights on modern GPUs. 2,760
pytorch/botorch A PyTorch-based library for Bayesian optimization, providing a modular interface for composing and optimizing probabilistic models. 3,102
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,226
microsoft/bitblas A library for efficient mixed-precision matrix multiplications on GPUs for deep learning models 420
bytedance/byteps A high-performance distributed deep learning framework supporting multiple frameworks and networks 3,630
lballabio/quantlib A comprehensive C++ library for modeling, trading, and risk management in quantitative finance. 5,392
drakkar-software/octobot-script An open-source Python framework for backtesting trading strategies in cryptocurrencies using machine learning and technical analysis techniques. 20
mit-han-lab/llm-awq A tool for efficient and accurate weight quantization in large language models 2,517
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 83,959
pennylaneai/pennylane A Python library for training quantum computers using programming techniques similar to neural networks 2,355
plasma-umass/scalene A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. 12,186
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 32,908