Submitted by fxmarty t3_z1titt in MachineLearning
visarga t1_ixd4ks5 wrote
Does it include Flash Attention?
fxmarty OP t1_ixd5yaq wrote
I believe it does not in PyTorch 1.13. However if you try PyTorch nightlies there is support for FlashAttention and MemoryEfficientAttention. Example notebook: https://colab.research.google.com/drive/1eCDJ4pql8102J_BtGSyjCRJwLp3TTN_h . Digging into the source code of PyTorch we indeed see them.
However, this is only limited to inference for now, but given that there is work from PyTorch's team to include this natively, I would expect to see support for training in the future!
Viewing a single comment thread. View all comments