we're currently planning on adding the ability for app creators to charge users, similar to Roblox/Robux if you're familiar, and that revenue will be shared between us and the app creator
Kind of. Those that are explicitly trained to do that with consistent formats will do it better. They'll also save you the extra tokens needed to explain the format/method of interacting with functions. But yeah, you can simulate this with any recent model and enough explanation.
Agree with not using mocks and following PRY especially. Oh man - it's so easy to want to make code so neat and clean with perfect reusable classes and tests, but honestly, its more effort than its worth in 99% of cases.