gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
7k stars
122 watching
995 forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list
deepspeed-librarygpt-3language-modeltransformers