Can someone a lot smarter than me give a basic explanation as to why something l...

tantony · on March 15, 2023

Stable Diffusion runs pretty fast on Apple Silicon. Not sure if that uses the GPU though.

I think one reason in this particular case may be the 4-bit quantization.

alwayslikethis · on March 15, 2023

Quantization is the answer here. CPU running the large models at 16 bits (which is actually 32, because CPUs mostly do not support FP16) would be really slow.