Until I found this:
https://www.youtube.com/@algorithmicsimplicity
Instantly clicked. Both convolution and transformer networks.
EDIT: for the purpose of visualization, I highly recommend following channel: https://www.youtube.com/watch?v=eMXuk97NeSI&t=207s
It nicely explains and shows concepts of stride, features, window size, input to output size relation - in convolutional NN
Until I found this:
https://www.youtube.com/@algorithmicsimplicity
Instantly clicked. Both convolution and transformer networks.
EDIT: for the purpose of visualization, I highly recommend following channel: https://www.youtube.com/watch?v=eMXuk97NeSI&t=207s
It nicely explains and shows concepts of stride, features, window size, input to output size relation - in convolutional NN