flash-attention

Fast and memory-efficient exact attention

GitHub

14k stars
115 watching
1k forks
Language: Python
last commit: 7 days ago
Linked from 1 awesome list


Backlinks from these awesome lists: