PipeCNN
CNN accelerator
A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
1k stars
72 watching
369 forks
Language: C
last commit: almost 3 years ago
Linked from 1 awesome list
altera-opencl-sdkdeep-learningdeep-neural-networksfpgafpga-acceleratorhardwarehlsopencl
Related projects:
Repository | Description | Stars |
---|---|---|
hyeongseokson1/cnn_deconvolution | Improves deconvolution performance using a Convolutional Neural Network | 22 |
amd/opencl-caffe | An OpenCL implementation of Caffe, a mainstream DNN framework. | 517 |
ucb-bar/gemmini | An open-source platform for developing and evaluating deep neural network hardware accelerators | 812 |
xilinx/finn | Fast and scalable neural network inference framework for FPGAs. | 747 |
ahmedfgad/numpycnn | An implementation of a convolutional neural network (CNN) using NumPy for basic classification tasks. | 570 |
mil-tokyo/webdnn | A browser-based framework for accelerating deep neural networks using WebGPU or WebGL2 | 1,979 |
wkentaro/fcn | An implementation of fully convolutional networks in Chainer, a deep learning framework. | 218 |
jimmy-ren/vcnn_double-bladed | A GPU-enabled vectorized implementation of CNNs for computer vision tasks | 136 |
deepakkumar1984/amplifier.net | A .NET library that enables developers to run complex applications on various hardware platforms without writing additional C kernel code. | 175 |
jhkim89/pyramidnet | A Torch implementation of a novel neural network architecture designed to improve the generalization ability of deep image classification models. | 129 |
sfmth/openspike | An accelerator designed to speed up spiking neural networks by integrating synaptic weights on a single chip. | 129 |
apache/tvm-vta | A comprehensive hardware design stack for accelerating deep learning models | 254 |
clementfarabet/lua---nnx | An extension to Torch7's neural network package with experimental modules and optimizations. | 97 |
benedekrozemberczki/mixhop-and-n-gcn | A deep learning framework implementation of higher-order graph convolutional architectures and their applications | 402 |
torch/cunn | A CUDA implementation of neural network modules and related GPU operations | 215 |