onnxruntime-training-examples

Transformer accelerator

Accelerates training of large transformer models by providing optimized kernels and memory optimizations.

Examples for using ONNX Runtime for model training.

GitHub

317 stars
44 watching
61 forks
Language: C#
last commit: 3 months ago

Related projects:

Repository Description Stars
microsoft/onnxruntime-inference-examples Repository providing examples for using ONNX Runtime (ORT) to perform machine learning inferencing. 1,243
microsoft/onnxruntime A cross-platform, high-performance machine learning accelerator 14,990
microsoft/megatron-deepspeed Research tool for training large transformer language models at scale 1,926
alrevuelta/connxr An embedded device-friendly C ONNX runtime with zero dependencies 196
bigscience-workshop/megatron-deepspeed A collection of tools and scripts for training large transformer language models at scale 1,342
emergentorder/onnx-scala An API and backend for running ONNX models in Scala 3 using typeful, functional deep learning and classical machine learning. 138
hermanussen/compiletimemethodexecutiongenerator Allows executing code during compilation to improve runtime performance 20
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
uber/neuropod A unified interface to run deep learning models from multiple frameworks using C++ and Python. 937
mit-han-lab/data-efficient-gans Improves GAN training efficiency by incorporating data augmentation 1,286
german-nlp-group/german-transformer-training Trains German transformer models to improve language understanding 23
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 482
kraiskil/onnx2c Generates C code from ONNX files for efficient neural network inference on microcontrollers 234
soumith/imagenet-multigpu.torch A toolkit for training neural networks on the ImageNet dataset using multiple GPUs. 402