bitsandbytes
Language model helper
A Python library providing lightweight, hardware-accelerated operations for large language models.
Accessible large language models via k-bit quantization for PyTorch.
6k stars
52 watching
639 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
pmorissette/bt | A flexible Python framework for building and testing algorithmic trading strategies | 2,316 |
aristocratos/bpytop | A resource monitor tool that displays system usage statistics and allows for filtering, sorting, and customization of system processes. | 10,219 |
aristocratos/btop | A tool to monitor system resources and display detailed information in a customizable format | 21,529 |
turboderp/exllama | A re-implementation of Llama for efficient use with quantized weights on modern GPUs. | 2,783 |
pytorch/botorch | A PyTorch-based library for Bayesian optimization, providing a modular interface for composing and optimizing probabilistic models. | 3,126 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,257 |
microsoft/bitblas | A library to support efficient mixed-precision matrix multiplications on GPUs for deep learning model deployment | 445 |
bytedance/byteps | A high-performance distributed deep learning framework supporting multiple frameworks and networks | 3,635 |
lballabio/quantlib | A comprehensive C++ library for modeling, trading, and risk management in quantitative finance. | 5,480 |
drakkar-software/octobot-script | An open-source Python framework for backtesting trading strategies in cryptocurrencies using machine learning and technical analysis techniques. | 20 |
mit-han-lab/llm-awq | An open-source software project that enables efficient and accurate low-bit weight quantization for large language models. | 2,593 |
pytorch/pytorch | A Python library providing tensors and dynamic neural networks with strong GPU acceleration | 84,978 |
pennylaneai/pennylane | A Python library for training quantum computers using programming techniques similar to neural networks | 2,409 |
plasma-umass/scalene | A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. | 12,274 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |