Are Winograd’s algorithms useful to implement as a learning exercise?

abstractcontrol · 2024-05-13T16:36:13 1715618173

Never tried those, so I couldn't say. I guess it would.

Even so, creating all the abstractions needed to implement even regular matrix multiplication in Spiral in a generic fashion took me two months, so I'd consider that good enough exercise.

You could do it a lot faster by specializing for specific matrix sizes, like in the Cuda examples repo by Nvidia, but then you'd miss the opportunity to do the tensor magic that I did in the playlist.

selimthegrim · 2024-05-13T17:28:51 1715621331

You are the author of the playlist/maker of the videos?

abstractcontrol · 2024-05-14T07:34:14 1715672054