cuml

GPU ML library

A suite of libraries implementing machine learning algorithms and mathematical primitives on NVIDIA GPUs

cuML - RAPIDS Machine Learning Library

GitHub

4k stars

78 watching

536 forks

Language: C++

last commit: 8 months ago

Linked from 3 awesome lists

cudagpumachine-learningmachine-learning-algorithmsnvidia

docs.rapids.ai/api/cuml/stable/

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
rapidsai/cudf	A GPU-accelerated data manipulation library built on top of C++/CUDA and Apache Arrow.	8,534
xtra-computing/thundergbm	Accelerates machine learning algorithms on GPUs to improve performance and efficiency	695
sjtu-ipads/powerinfer	An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs	8,011
nvlabs/tiny-cuda-nn	A C++/CUDA framework for training and querying neural networks using GPUs	3,791
plasma-umass/scalene	A high-performance Python profiler that analyzes CPU, GPU, and memory usage, providing detailed information and AI-powered optimization suggestions.	12,274
postgresml/postgresml	An open-source Postgres extension for machine learning and AI operations directly within the database.	6,070
paddlepaddle/paddle	A high-performance deep learning framework designed for industrial-scale training and deployment of neural networks.	22,340
sony/nnabla	A deep learning framework that provides a flexible and expressive Python API for building and training neural networks on various platforms.	2,729
microsoft/flaml	Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms	3,968
iterative/cml	Automates machine learning workflows and generates reports on every pull request.	4,046
nvlabs/instant-ngp	A software toolkit for training and rendering neural graphics primitives	16,115
fminference/flexllmgen	Generates large language model outputs in high-throughput mode on single GPUs	9,236
ddbourgin/numpy-ml	A collection of machine learning algorithms implemented in NumPy for rapid experimentation and prototyping.	15,789
baidu-research/warp-ctc	An implementation of a loss function used in sequence data analysis and machine learning	4,070
cupy/cupy	A Python library for running NumPy/SciPy code on NVIDIA CUDA or AMD ROCm platforms using GPU acceleration.	9,586