onnxruntime-training-examples

Transformer accelerator

Accelerates training of large transformer models by providing optimized kernels and memory optimizations.

Examples for using ONNX Runtime for model training.

GitHub

314 stars
44 watching
62 forks
Language: C#
last commit: about 1 month ago

Related projects:

Repository Description Stars
microsoft/onnxruntime-inference-examples Repository providing examples for using ONNX Runtime (ORT) to perform machine learning inferencing. 1,223
microsoft/onnxruntime A cross-platform, high-performance machine learning accelerator 14,864
microsoft/megatron-deepspeed Research tool for training large transformer language models at scale 1,914
alrevuelta/connxr An embedded device-friendly C ONNX runtime with zero dependencies 195
bigscience-workshop/megatron-deepspeed A collection of tools and scripts for training large transformer language models at scale 1,339
emergentorder/onnx-scala An API and backend for running ONNX models in Scala 3 using typeful, functional deep learning and classical machine learning. 138
hermanussen/compiletimemethodexecutiongenerator Allows executing code during compilation to improve runtime performance 20
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,161
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
uber/neuropod A unified interface to run deep learning models from multiple frameworks using C++ and Python. 936
mit-han-lab/data-efficient-gans Improves GAN training efficiency by incorporating data augmentation 1,283
german-nlp-group/german-transformer-training Trains German transformer models to improve language understanding 23
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 481
kraiskil/onnx2c Generates C code from ONNX files for efficient neural network inference on microcontrollers 230
soumith/imagenet-multigpu.torch A toolkit for training neural networks on the ImageNet dataset using multiple GPUs. 402