It's funny because I wouldn't consider the comment that they highlight in their ...

jbirer · 2024-12-22T08:26:38 1734855998

Two theories:

1) This is an ego problem. Whoever is doing the development cannot handle being called out on certain software architecture / coding mistakes, so it becomes "nitpicking".

2) The software shop has a "ship out faster, cut corners" culture, which at that point might as well turn off the AI review bot.

AgentOrange1234 · 2024-12-22T15:46:19 1734882379

1 is super interesting.

I’ve found it’s nice to talk to an LLM about personal issues because I know it’s not a real person judging me. Maybe if the comments were kept private with the dev, it’d be more just a coaching tool that didn’t feel like a criticism?

Comfy-Tinwork · 2024-12-22T18:30:20 1734892220

Private with the Dev, the operator & everyone who uses any product of the training data they build.

trash_cat · 2024-12-22T19:15:15 1734894915

This is a company culture problem.

michaelcampbell · 2024-12-23T15:01:58 1734966118

> 1) This is an ego problem.

This goes both ways. I worked for a company where the majority of PR comments were of the "Well, _I_ wouldn't do it this way." form. In some cases, the "way" they were complaining about were direct versions of examples in the language library docs.

One specific case was a PR was held up from merging because I used the plural of "regex" as "regexen" and not "regexes". IN A COMMENT. <eye roll>

dakshgupta · 2024-12-22T01:26:19 1734830779

This is an important point - there is no universal understanding of nitpickiness. It is why we have it learn every new customers ways from scratch.

mannykannot · 2024-12-22T13:58:22 1734875902

This does not address the issue raised in iLoveOncall's third paragraph: "the same comment can be a nitpick on one CR but crucial on another..." In "attempt 2", you say that "the LLMs judgment of its own output was nearly random", which raises questions that go well beyond just nitpicking, up to that of whether the current state of the art in LLM code review is fit for much more than ticking the box that says "yes, we are doing code review."

lupire · 2024-12-22T20:19:24 1734898764

If you are using an LLM for judgment, you are using it wrong. An LLM is good for generating suggestions, brainstorming, not making judgments.

That's why it is called Generative AI.

mannykannot · 2024-12-22T20:52:35 1734900755

Indeed - and if you are doing code review without judgement, you are doing it wrong.