PipeCNN
CNN accelerator
A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
1k stars
73 watching
369 forks
Language: C
last commit: almost 3 years ago
Linked from 1 awesome list
altera-opencl-sdkdeep-learningdeep-neural-networksfpgafpga-acceleratorhardwarehlsopencl
Related projects:
Repository | Description | Stars |
---|---|---|
hyeongseokson1/cnn_deconvolution | An implementation of a CNN-based method for non-blind deconvolution using Wiener filtering as a pre-processing step. | 22 |
amd/opencl-caffe | An OpenCL implementation of Caffe, a mainstream DNN framework. | 518 |
ucb-bar/gemmini | A platform for exploring and evaluating deep learning hardware acceleration using a full-system simulation approach | 828 |
xilinx/finn | Fast and scalable neural network inference framework for FPGAs. | 770 |
ahmedfgad/numpycnn | A Python implementation of a Convolutional Neural Network from scratch using NumPy for building CNNs from scratch | 577 |
mil-tokyo/webdnn | A framework that accelerates deep neural networks in web browsers using optimized models and GPU acceleration. | 1,978 |
wkentaro/fcn | An implementation of fully convolutional networks in Chainer, a deep learning framework. | 218 |
jimmy-ren/vcnn_double-bladed | An implementation of convolutional neural networks in Matlab, providing GPU-enabled vectorized processing for image recognition and processing tasks. | 136 |
deepakkumar1984/amplifier.net | A .NET library that enables developers to run complex applications on various hardware platforms without writing additional C kernel code. | 177 |
jhkim89/pyramidnet | A Torch implementation of a novel neural network architecture designed to improve the generalization ability of deep image classification models. | 129 |
sfmth/openspike | An accelerator designed to speed up spiking neural networks by integrating synaptic weights on a single chip. | 132 |
apache/tvm-vta | A comprehensive hardware design stack for accelerating deep learning models | 258 |
clementfarabet/lua---nnx | An extension to Torch7's neural network package with experimental modules and optimizations. | 97 |
benedekrozemberczki/mixhop-and-n-gcn | A deep learning framework implementation of higher-order graph convolutional architectures and their applications | 403 |
torch/cunn | A CUDA implementation of neural network modules and related GPU operations | 215 |