tvm-vta
Deep Learning Accelerator
A comprehensive hardware design stack for accelerating deep learning models
Open, Modular, Deep Learning Accelerator
258 stars
40 watching
73 forks
Language: Scala
last commit: 10 months ago
Linked from 1 awesome list
hardwaremachine-learningtensortvmvta
Related projects:
Repository | Description | Stars |
---|---|---|
| The NVDLA project provides hardware designs and tools for building deep learning inference accelerators. | 1,763 |
| A C library providing an n-dimensional tensor data structure and linear algebra routines | 148 |
| A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design | 1,264 |
| An implementation of an efficient deep neural network architecture | 189 |
| A Scala API for TensorFlow's deep learning functionality | 939 |
| A collection of tutorials and resources on implementing deep learning models using Python libraries such as Keras and Lasagne. | 426 |
| This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
| Compiles Accelerate code to LLVM IR and executes it on CPUs or NVIDIA GPUs | 159 |
| A framework for designing and evaluating custom processor instructions to accelerate machine learning tasks on FPGAs. | 476 |
| A deep learning library for Rust with GPU acceleration and ergonomic API. | 1,754 |
| A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |
| An API wrapper around MatConvNet that adds automatic differentiation for easy deep learning prototyping and research | 89 |
| Enables training and evaluation of deep learning models from Apache Parquet datasets in various machine learning frameworks | 1,805 |
| Enables heterogeneous high-performance computing on Intel CPUs and GPUs for deep learning workloads | 323 |
| Direct neural architecture search on target task and hardware for efficient model deployment | 1,429 |