 PipeCNN
 PipeCNN 
 CNN accelerator
 A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
1k stars
 73 watching
 369 forks
 
Language: C 
last commit: over 3 years ago 
Linked from   1 awesome list  
  altera-opencl-sdkdeep-learningdeep-neural-networksfpgafpga-acceleratorhardwarehlsopencl 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | An implementation of a CNN-based method for non-blind deconvolution using Wiener filtering as a pre-processing step. | 22 | 
|  | An OpenCL implementation of Caffe, a mainstream DNN framework. | 518 | 
|  | A platform for exploring and evaluating deep learning hardware acceleration using a full-system simulation approach | 828 | 
|  | Fast and scalable neural network inference framework for FPGAs. | 770 | 
|  | A Python implementation of a Convolutional Neural Network from scratch using NumPy for building CNNs from scratch | 577 | 
|  | A framework that accelerates deep neural networks in web browsers using optimized models and GPU acceleration. | 1,978 | 
|  | An implementation of fully convolutional networks in Chainer, a deep learning framework. | 218 | 
|  | An implementation of convolutional neural networks in Matlab, providing GPU-enabled vectorized processing for image recognition and processing tasks. | 136 | 
|  | A .NET library that enables developers to run complex applications on various hardware platforms without writing additional C kernel code. | 177 | 
|  | A Torch implementation of a novel neural network architecture designed to improve the generalization ability of deep image classification models. | 129 | 
|  | An accelerator designed to speed up spiking neural networks by integrating synaptic weights on a single chip. | 132 | 
|  | A comprehensive hardware design stack for accelerating deep learning models | 258 | 
|  | An extension to Torch7's neural network package with experimental modules and optimizations. | 97 | 
|  | A deep learning framework implementation of higher-order graph convolutional architectures and their applications | 403 | 
|  | A CUDA implementation of neural network modules and related GPU operations | 215 |