onnxruntime-training-examples
Transformer accelerator
Accelerates training of large transformer models by providing optimized kernels and memory optimizations.
Examples for using ONNX Runtime for model training.
314 stars
44 watching
62 forks
Language: C#
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
microsoft/onnxruntime-inference-examples | Repository providing examples for using ONNX Runtime (ORT) to perform machine learning inferencing. | 1,223 |
microsoft/onnxruntime | A cross-platform, high-performance machine learning accelerator | 14,864 |
microsoft/megatron-deepspeed | Research tool for training large transformer language models at scale | 1,914 |
alrevuelta/connxr | An embedded device-friendly C ONNX runtime with zero dependencies | 195 |
bigscience-workshop/megatron-deepspeed | A collection of tools and scripts for training large transformer language models at scale | 1,339 |
emergentorder/onnx-scala | An API and backend for running ONNX models in Scala 3 using typeful, functional deep learning and classical machine learning. | 138 |
hermanussen/compiletimemethodexecutiongenerator | Allows executing code during compilation to improve runtime performance | 20 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,161 |
microsoft/mpnet | Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. | 288 |
uber/neuropod | A unified interface to run deep learning models from multiple frameworks using C++ and Python. | 936 |
mit-han-lab/data-efficient-gans | Improves GAN training efficiency by incorporating data augmentation | 1,283 |
german-nlp-group/german-transformer-training | Trains German transformer models to improve language understanding | 23 |
fastnlp/cpt | A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 481 |
kraiskil/onnx2c | Generates C code from ONNX files for efficient neural network inference on microcontrollers | 230 |
soumith/imagenet-multigpu.torch | A toolkit for training neural networks on the ImageNet dataset using multiple GPUs. | 402 |