Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

GitHub

2k stars
24 watching
343 forks
Language: Python
last commit: 10 days ago
Linked from 1 awesome list


Backlinks from these awesome lists: