tvm-vta
Deep Learning Accelerator
A comprehensive hardware design stack for accelerating deep learning models
Open, Modular, Deep Learning Accelerator
258 stars
40 watching
73 forks
Language: Scala
last commit: over 1 year ago
Linked from 1 awesome list
hardwaremachine-learningtensortvmvta
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | The NVDLA project provides hardware designs and tools for building deep learning inference accelerators. | 1,763 |
| | A C library providing an n-dimensional tensor data structure and linear algebra routines | 148 |
| | A tool for accelerating convolutional neural networks on Field-Programmable Gate Arrays (FPGAs) using OpenCL-based hardware design | 1,264 |
| | An implementation of an efficient deep neural network architecture | 189 |
| | A Scala API for TensorFlow's deep learning functionality | 939 |
| | A collection of tutorials and resources on implementing deep learning models using Python libraries such as Keras and Lasagne. | 426 |
| | This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
| | Compiles Accelerate code to LLVM IR and executes it on CPUs or NVIDIA GPUs | 159 |
| | A framework for designing and evaluating custom processor instructions to accelerate machine learning tasks on FPGAs. | 476 |
| | A deep learning library for Rust with GPU acceleration and ergonomic API. | 1,754 |
| | A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |
| | An API wrapper around MatConvNet that adds automatic differentiation for easy deep learning prototyping and research | 89 |
| | Enables training and evaluation of deep learning models from Apache Parquet datasets in various machine learning frameworks | 1,805 |
| | Enables heterogeneous high-performance computing on Intel CPUs and GPUs for deep learning workloads | 323 |
| | Direct neural architecture search on target task and hardware for efficient model deployment | 1,429 |