cuml

GPU ML library

A suite of libraries implementing machine learning algorithms and mathematical primitives on NVIDIA GPUs

cuML - RAPIDS Machine Learning Library

GitHub

4k stars
78 watching
536 forks
Language: C++
last commit: about 2 months ago
Linked from 3 awesome lists

cudagpumachine-learningmachine-learning-algorithmsnvidia

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
rapidsai/cudf A GPU-accelerated data manipulation library built on top of C++/CUDA and Apache Arrow. 8,534
xtra-computing/thundergbm Accelerates machine learning algorithms on GPUs to improve performance and efficiency 695
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 8,011
nvlabs/tiny-cuda-nn A C++/CUDA framework for training and querying neural networks using GPUs 3,791
plasma-umass/scalene A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions. 12,274
postgresml/postgresml An open-source Postgres extension for machine learning and AI operations directly within the database. 6,070
paddlepaddle/paddle A high-performance deep learning framework designed for industrial-scale training and deployment of neural networks. 22,340
sony/nnabla A deep learning framework that provides a flexible and expressive Python API for building and training neural networks on various platforms. 2,729
microsoft/flaml Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms 3,968
iterative/cml Automates machine learning workflows and generates reports on every pull request. 4,046
nvlabs/instant-ngp A software toolkit for training and rendering neural graphics primitives 16,115
fminference/flexllmgen Generates large language model outputs in high-throughput mode on single GPUs 9,236
ddbourgin/numpy-ml A collection of machine learning algorithms implemented in NumPy for rapid experimentation and prototyping. 15,789
baidu-research/warp-ctc An implementation of a loss function used in sequence data analysis and machine learning 4,070
cupy/cupy A Python library for running NumPy/SciPy code on NVIDIA CUDA or AMD ROCm platforms using GPU acceleration. 9,586