Several researchers where I work used to use mturk regularly and now have been forced to stop using it entirely because of this. They started getting "As a large language model" answers ... in even things where the only possible answer was 'true' or 'false' or '1-7'.