People seem to have missed the point of what this is.
This is not "work out what the prompt is for an image"
Instead it lets you give the model a "thing" (as an image) and then use that thing in your prompts.
So for example they give it a picture of a statue and name it "S", and then can say "Elmo sitting in the same pose as S" and it correctly generates it.
This is not "work out what the prompt is for an image"
Instead it lets you give the model a "thing" (as an image) and then use that thing in your prompts.
So for example they give it a picture of a statue and name it "S", and then can say "Elmo sitting in the same pose as S" and it correctly generates it.