Viewing a single comment thread. View all comments

manOnPavementWaving t1_j4fi6f0 wrote

Mediocre first year IT students can do that. But no way it's writing an efficient flash attention kernel without having seen one before.

3