bitsandbytes
Language model helper
A Python library providing lightweight, hardware-accelerated operations for large language models.
Accessible large language models via k-bit quantization for PyTorch.
6k stars
51 watching
630 forks
Language: Python
last commit: 7 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
pmorissette/bt | A flexible Python framework for building and testing algorithmic trading strategies | 2,287 |
aristocratos/bpytop | A resource monitor tool that displays system usage statistics and allows for filtering, sorting, and customization of system processes. | 10,176 |
aristocratos/btop | A tool to monitor system resources and display detailed information in a customizable format | 21,039 |
turboderp/exllama | A re-implementation of Llama for efficient use with quantized weights on modern GPUs. | 2,760 |
pytorch/botorch | A PyTorch-based library for Bayesian optimization, providing a modular interface for composing and optimizing probabilistic models. | 3,102 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,226 |
microsoft/bitblas | A library for efficient mixed-precision matrix multiplications on GPUs for deep learning models | 420 |
bytedance/byteps | A high-performance distributed deep learning framework supporting multiple frameworks and networks | 3,630 |
lballabio/quantlib | A comprehensive C++ library for modeling, trading, and risk management in quantitative finance. | 5,392 |
drakkar-software/octobot-script | An open-source Python framework for backtesting trading strategies in cryptocurrencies using machine learning and technical analysis techniques. | 20 |
mit-han-lab/llm-awq | A tool for efficient and accurate weight quantization in large language models | 2,517 |
pytorch/pytorch | A Python library providing tensors and dynamic neural networks with strong GPU acceleration | 83,959 |
pennylaneai/pennylane | A Python library for training quantum computers using programming techniques similar to neural networks | 2,355 |
plasma-umass/scalene | A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. | 12,186 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 32,908 |