nccl
GPU communication library
A library of optimized primitives for efficient inter-GPU communication and data transfer.
Optimized primitives for collective multi-GPU communication
3k stars
154 watching
836 forks
Language: C++
last commit: 6 months ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
| Enables C# code to run on the GPU through DirectX and dynamically generated shaders | 2,799 |
| Removes Nvidia's restriction on simultaneous NVENC video encoding sessions | 3,606 |
| A Clojure library that enables parallel computations on GPU using OpenCL | 278 |
| A software toolkit for training and rendering neural graphics primitives | 16,115 |
| A set of libraries and sample code for 3D point cloud processing using CUDA. | 584 |
| A comprehensive C++ library for building GPU-accelerated user interfaces with WYSIWYG editing tools and XML support. | 2,354 |
| Tools for streamlined mixed precision and distributed training in PyTorch | 8,460 |
| A C++17 GPU-accelerated numerical computing library with Python-like syntax | 1,229 |
| A deep learning framework that provides a flexible and expressive Python API for building and training neural networks on various platforms. | 2,729 |
| A collection of examples demonstrating various approaches to programming multiple GPUs in parallel | 575 |
| A suite of libraries implementing machine learning algorithms and mathematical primitives on NVIDIA GPUs | 4,292 |
| A build system designed to optimize monorepos and integrate well with various frameworks and tools for fast CI. | 23,951 |
| A C++ library that optimizes 3D meshes for faster rendering on GPUs. | 5,795 |
| A C++/CUDA framework for training and querying neural networks using GPUs | 3,791 |
| An implementation of a loss function used in sequence data analysis and machine learning | 4,070 |