gemmini
Hardware simulator
A platform for exploring and evaluating deep learning hardware acceleration using a full-system simulation approach
Berkeley's Spatial Array Generator
828 stars
31 watching
176 forks
Language: Scala
last commit: 6 days ago
Linked from 1 awesome list
acceleratorasicdnn
Related projects:
Repository | Description | Stars |
---|---|---|
mil-tokyo/webdnn | A framework that accelerates deep neural networks in web browsers using optimized models and GPU acceleration. | 1,978 |
doonny/pipecnn | A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design | 1,264 |
unagiootoro/ruby-dnn | A Ruby-based deep learning library for building and training neural networks | 46 |
ucb-bar/midas | Automated framework for converting digital circuit designs into FPGA-accelerated simulators | 98 |
maestro-project/frame | A tool for analyzing and optimizing DNN accelerators | 31 |
shigekikarita/grain2 | A library that provides an autograd and GPGPU framework for dynamic neural networks in D. | 7 |
yixuan/minidnn | A C++ library implementing deep neural networks with good performance and modularity | 399 |
amd/opencl-caffe | An OpenCL implementation of Caffe, a mainstream DNN framework. | 518 |
nngen/nngen | Generates hardware-specific accelerator designs for neural networks | 340 |
conan7882/googlenet-inception | An implementation of a deep neural network architecture for image classification using pre-trained models and fine-tuning on the CIFAR-10 dataset. | 285 |
mofanv/darknetz | An application that runs several layers of a Deep Neural Network model in a secure environment for model privacy at the edge | 86 |
mit-han-lab/proxylessnas | Direct neural architecture search on target task and hardware for efficient model deployment | 1,429 |
jhkim89/pyramidnet | A Torch implementation of a novel neural network architecture designed to improve the generalization ability of deep image classification models. | 129 |
namisan/mt-dnn | A PyTorch package implementing multi-task deep neural networks for natural language understanding | 2,238 |
torch/cunn | A CUDA implementation of neural network modules and related GPU operations | 215 |