kernel_tuner
GPU optimizer
Automates the process of optimizing performance and energy efficiency in GPU applications.
Kernel Tuner
287 stars
10 watching
50 forks
Language: Python
last commit: 9 days ago
Linked from 1 awesome list
auto-tuningautotuningccpluspluscudacuda-kernelsgpugpu-computingkernel-tunermachine-learningopenclopencl-kernelsoptimizationpythonsoftware-developmenttesting
Related projects:
Repository | Description | Stars |
---|---|---|
syne-tune/syne-tune | A tool for large-scale and asynchronous hyperparameter optimization in machine learning | 390 |
jmrichardson/tuneta | Automates optimization of technical indicators for machine learning models in finance | 413 |
gpuopen-tools/radeon_gpu_profiler | A low-level profiling tool for analyzing GPU workloads and optimizing DirectX 12 and Vulkan games on AMD GPUs. | 393 |
arm-software/libgpucounters | A utility library providing access to performance counters on Arm GPUs. | 211 |
liyanghart/hyperparameter-optimization-of-machine-learning-algorithms | Provides tools and techniques for tuning hyperparameters in machine learning models to improve performance. | 1,275 |
hyperopt/hyperopt-sklearn | Automates search for optimal parameters in machine learning algorithms. | 1,588 |
jeremymain/gpuprofiler | A tool that captures system details and resource utilization metrics to help analyze performance and size virtual GPU environments. | 286 |
autonomio/talos | A tool for automating hyperparameter experiments for machine learning models using TensorFlow and Keras | 1,625 |
rib/gputop | A tool for analyzing GPU performance and metrics in real-time, providing graphical and machine-readable data for developers | 160 |
nvidia/tensorflow | An optimized version of TensorFlow to support newer hardware and libraries for NVIDIA GPU users | 996 |
gpuopen-tools/radeon_gpu_analyzer | An offline compiler and code analysis tool for various GPU architectures and programming languages. | 420 |
alan-fgr/cullminator9000 | A high-performance software optimization tool for removing unnecessary geometric data in 3D rendering pipelines | 35 |
nebuly-ai/nos | A module to optimize GPU utilization in Kubernetes clusters through dynamic partitioning and elastic resource management. | 630 |
google/jaxopt | An open-source project providing hardware accelerated, batchable and differentiable optimizers in JAX for deep learning. | 933 |
adacore/cuda | A toolset that compiles Ada and SPARK code to NVIDIA GPUs | 18 |