gemmini

Hardware simulator

A platform for exploring and evaluating deep learning hardware acceleration using a full-system simulation approach

Berkeley's Spatial Array Generator

GitHub

828 stars
31 watching
176 forks
Language: Scala
last commit: 6 days ago
Linked from 1 awesome list

acceleratorasicdnn

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mil-tokyo/webdnn A framework that accelerates deep neural networks in web browsers using optimized models and GPU acceleration. 1,978
doonny/pipecnn A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design 1,264
unagiootoro/ruby-dnn A Ruby-based deep learning library for building and training neural networks 46
ucb-bar/midas Automated framework for converting digital circuit designs into FPGA-accelerated simulators 98
maestro-project/frame A tool for analyzing and optimizing DNN accelerators 31
shigekikarita/grain2 A library that provides an autograd and GPGPU framework for dynamic neural networks in D. 7
yixuan/minidnn A C++ library implementing deep neural networks with good performance and modularity 399
amd/opencl-caffe An OpenCL implementation of Caffe, a mainstream DNN framework. 518
nngen/nngen Generates hardware-specific accelerator designs for neural networks 340
conan7882/googlenet-inception An implementation of a deep neural network architecture for image classification using pre-trained models and fine-tuning on the CIFAR-10 dataset. 285
mofanv/darknetz An application that runs several layers of a Deep Neural Network model in a secure environment for model privacy at the edge 86
mit-han-lab/proxylessnas Direct neural architecture search on target task and hardware for efficient model deployment 1,429
jhkim89/pyramidnet A Torch implementation of a novel neural network architecture designed to improve the generalization ability of deep image classification models. 129
namisan/mt-dnn A PyTorch package implementing multi-task deep neural networks for natural language understanding 2,238
torch/cunn A CUDA implementation of neural network modules and related GPU operations 215