The flattened image patch of width and height PxP pixels gets multiplied with a ...

		blueblisters on Aug 25, 2022 \| parent \| context \| favorite \| on: Imagen: An AI system that creates photorealistic i... The flattened image patch of width and height PxP pixels gets multiplied with a learnable matrix of dimension P^2xD where D is the size of the patch embedding. In other words, it’s a linear transformation that reduces the dimensionality of the image patch.