I wouldn't be so sure. Combine current capability levels with the larger context windows and they can probably already point out most of the problems with code.
I recently fed a very large file into GPT4 and it handed me a few serious bugs that I hadn't noticed after a few self-reviews.
I recently fed a very large file into GPT4 and it handed me a few serious bugs that I hadn't noticed after a few self-reviews.