Some stable diffusion interfaces let you specify a "negative input" which will bias results away from it. It wouldn't be terribly hard to do some semantic interpretation prior to submission to the model that would turn "not a <thing>" into "negate-input <thing>"