Viewing a single comment thread. View all comments

mgostIH t1_ir7euf7 wrote

You can apply it on the top call of your matrix mul and do everything inside the standard way, you still gain the efficiency since these algorithms also work in block matrix form.

−2

Ulfgardleo t1_ir7m5md wrote

Is it? I could not see from the paper whether they assume non-commutative multiplication in their small matrix optimization.

//Edit: they do a 4x4 block matrix, but the gains are less than 5% over the existing Strassen algorithm.

1