tvm-vta

Deep Learning Accelerator

A comprehensive hardware design stack for accelerating deep learning models

Open, Modular, Deep Learning Accelerator

GitHub

258 stars

40 watching

73 forks

Language: Scala

last commit: over 2 years ago

Linked from 1 awesome list

hardwaremachine-learningtensortvmvta

tvm.apache.org/

Backlinks from these awesome lists:

aolofsson/awesome-opensource-hardware

Related projects:

Repository	Description	Stars
nvdla/hw	The NVDLA project provides hardware designs and tools for building deep learning inference accelerators.	1,763
vlang/vtl	A C library providing an n-dimensional tensor data structure and linear algebra routines	148
doonny/pipecnn	A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design	1,264
homles11/igcv3	An implementation of an efficient deep neural network architecture	189
eaplatanios/tensorflow_scala	A Scala API for TensorFlow's deep learning functionality	939
vict0rsch/deep_learning	A collection of tutorials and resources on implementing deep learning models using Python libraries such as Keras and Lasagne.	426
jnhwkim/nips-mrn-vqa	This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework.	39
acceleratehs/accelerate-llvm	Compiles Accelerate code to LLVM IR and executes it on CPUs or NVIDIA GPUs	159
google/cfu-playground	A framework for designing and evaluating custom processor instructions to accelerate machine learning tasks on FPGAs.	476
coreylowman/dfdx	A deep learning library for Rust with GPU acceleration and ergonomic API.	1,754
vlgiitr/dmn-plus	A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms	64
vlfeat/autonn	An API wrapper around MatConvNet that adds automatic differentiation for easy deep learning prototyping and research	89
uber/petastorm	Enables training and evaluation of deep learning models from Apache Parquet datasets in various machine learning frameworks	1,805
intel/intel-extension-for-tensorflow	Enables heterogeneous high-performance computing on Intel CPUs and GPUs for deep learning workloads	323
mit-han-lab/proxylessnas	Direct neural architecture search on target task and hardware for efficient model deployment	1,429