I think the flops comparison you’ve presented is not fair: for nvidia it is “ten...

programmer_dude · on June 25, 2021

Tensor flops is significant since this is exactly the use case for which it was designed. So IMO the comparison is fair.

asteroidbelt · on June 25, 2021

It doesn’t make sense. Why it is fair to compare matrix multiplication with generic float operations? It should be either comparison of matrix multiplication to matrix multiplication or generic float to generic float.

etaioinshrdlu · on June 25, 2021

Well, one confounding factor is that CPU Flops are more generic, for any algorithm. GPU Flops as mentioned work better on tensor cases.

However, when we do have tensors, the GPU and CPU would both work to their full potential, and thus the flops comparison ought to be valid.