[P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels Submitted by pommedeterresautee t3_ydqmjp on October 26, 2022 at 6:10 AM in MachineLearning 40 comments 352
Sylv__ t1_itu2oar wrote on October 26, 2022 at 9:28 AM Impressive work! Thank you for open-sourcing it. Permalink 6 pommedeterresautee OP t1_itu6k3n wrote on October 26, 2022 at 10:22 AM Thank you, if you try it, don't hesitate to share your feedback with us Permalink Parent 5
pommedeterresautee OP t1_itu6k3n wrote on October 26, 2022 at 10:22 AM Thank you, if you try it, don't hesitate to share your feedback with us Permalink Parent 5
Viewing a single comment thread. View all comments