Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
2k stars
24 watching
343 forks
Language: Python
last commit: 10 days ago
Linked from 1 awesome list
Ongoing research training transformer language models at scale, including: BERT & GPT-2