Ulfgardleo t1_ir7lytl wrote
Reply to comment by neanderthal_math in [R] Discovering Faster Matrix Multiplication Algorithms With Reinforcement Learning by EducationalCicada
All Standard unless very large. Atlas is just picking different kernels that "only" change order of operations to maximize CPU utilization.
Red-Portal t1_ir7xeyo wrote
The funny thing is that the lesson of ATLAS and OpenBLAS was that, matrix multiplication optimized to the assembly level by humans is still the best way to squeeze out performance.
Viewing a single comment thread. View all comments