> using OpenAI outputs violating their ToS is considered cheating
I fail to see how that is any different than any other training data scraped from the web. If someone shares a big dump of outputs from OpenAI models and I train my model on that then I'm not violating OpenAI's terms of service because I haven't agreed to them (so I'm not violating contract law), and everyone in the space (including OpenAI themselves) has already collectively decided that training on All Rights Reserved data is fair use (so I'm not violating copyright law either).
I fail to see how that is any different than any other training data scraped from the web. If someone shares a big dump of outputs from OpenAI models and I train my model on that then I'm not violating OpenAI's terms of service because I haven't agreed to them (so I'm not violating contract law), and everyone in the space (including OpenAI themselves) has already collectively decided that training on All Rights Reserved data is fair use (so I'm not violating copyright law either).