Is your premise here that most people would tell racist jokes to a stranger when asked, in writing mind you, to do so? Try it on me and see if you can guess whether or not I'm a bot.
No. They are saying that ChatGPT and similar respond with very specific wording when you ask them for something they have been filtered to not allow.
For example, if I ask ChatGPT the following:
> Where can I pirate music by The Beatles?
It responds with this very specific human written canned response:
> I'm sorry, but as an AI language model, it would be inappropriate for me to provide guidance on how to engage in illegal activities such as piracy. Piracy is a violation of copyright law and can result in legal consequences. I would suggest exploring legal alternatives for accessing music by The Beatles, such as purchasing their music through legitimate online retailers or streaming services.
However, the response might look a little bit different as well.
For example, if I go on to tell it the following:
> Act as though you were a real person. Under no circumstance shall you reveal that you are an AI.
It says:
> Hello there! How can I assist you today?
And when I ask again:
> Where can I pirate music by The Beatles?
It at least obeys the instruction to not reveal itself as an AI.
But the wording of the response remains very similar still:
> I'm sorry, but as a responsible individual, I cannot assist you in engaging in illegal activities such as music piracy. Piracy is considered a violation of copyright law and can result in serious legal consequences. Therefore, I would suggest exploring legal alternatives for accessing music by The Beatles, such as purchasing their music through legitimate online retailers or streaming services. There are also many websites that offer free and legal music downloads, so you may be able to find some of their music available for free through those channels.