gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

GitHub

7k stars
122 watching
995 forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list

deepspeed-librarygpt-3language-modeltransformers

Backlinks from these awesome lists: