Google CEO says more than a quarter of the company's new code is created by AI

S0y · 2024-10-30T02:15:49 1730254549

asdfman123 · 2024-10-31T00:34:48 1730334888

I work for Google, and I just got done with my work day. I was just writing I guess what you'd call "AI generated code."

But the code completion engine is basically just good at finishing the lines I'm writing. If I'm writing "function getAc..." it's smart enough to complete to "function getActionHandler()", and maybe suggest the correct arguments and a decent jsdoc comment.

So basically, it's a helpful productivity tool but it's not doing any engineering at all. It's probably about as good, maybe slightly worse, than Copilot. (I haven't used it recently though.)

NotAnOtter · 2024-10-31T08:55:14 1730364914

I also work at google (until last Friday). Agree with what you said. My thoughts are

1. This quote is clearly meant to exaggerate reality, and they are likely including things like fully automated CL/PR's which have been around for a decade as "AI generated".

2. I stated before that if a team of 10 is equally as productive as a team of 8 utilizing things like copilot, it's fair to say "AI replaced 2 engineers", in my opinion. More importantly, Tech leaders would be making this claim if it were true. Copilot and it's clones have been around long enough know for the evidence to be in, and no one is stating "we've replaced X% of our workforce with AI" - therefor my claim is (by 'denying the consequent'), using copilot does not materially accelerate development.

ahmedfromtunis · 2024-10-31T10:55:48 1730372148

> no one is stating "we've replaced X% of our workforce with AI"

Even if that's been happening, I don't think it would be politically savvy to admit it.

In today's social climate claiming to replace humans with AI would attract the wrong kind of attention from politicians (during an election year) and from the public in general.

This would be even more unwise to admit for a company like Google who's an "AI producer". They may leave such a language for closed meetings with potential customers during sales pitches though.

whywhywhywhy · 2024-10-31T11:52:14 1730375534

> and from the public in general

Don't think the public will be that concerned about people in Google's salary bracket losing their jobs.

jl6 · 2024-10-31T16:07:31 1730390851

It’s a disservice to the public to assume they aren’t capable of understanding why AI job losses might be concerning even if they aren’t directly impacted. Most people aren’t so committed to class warfare that they will root for the apocalypse as long as it stomps a rich guy first.

wavewrangler · 2024-10-31T17:08:10 1730394490

You mean poor person. As long as it stomps a poor person. The rich don’t have a habit of getting stomped. They direct other poor people to stomp their contemporaries. The poor don’t have a chance.

whatshisface · 2024-10-31T17:45:10 1730396710

I don't think a lot of people realize how few people are "rich" in the sense of not being impacted by the labor market, or how virtually all of them are retirees. CFOs aren't looking forward to a massive shift in the labor market for accountants any more than CPAs. Warren Buffet has a "job," he writes those letters for BH and oversees the firm's investments at a high level... and most of the people who live off of investments have children in the workforce. Even most people whose children live off of their investments have kids in the (nonprofit) workforce.

tehjoker · 2024-10-31T16:21:53 1730391713

Software engineers and grocery store workers are in different income brackets, but in the same class (labor/prolaterian). It is managers, executives, and investors that are in the capitalist class. Class is determined by your relationship to production.

barrkel · 2024-10-31T17:30:04 1730395804

Software engineer salaries and stock compensation can be enough to shift alignment somewhat, especially after many years of capital accumulation.

tehjoker · 2024-10-31T17:42:10 1730396530

if you make the majority of your earnings from passive income or you do not need to work to live you are more part of the leisure class

barrkel · 2024-10-31T18:05:10 1730397910

Two things: capitalists don't not work; and if you have a sizeable portfolio, you may not need to work and may earn plenty of passive income, yet still work because you add more value at the margin working than fiddling with stock allocations or angel investing or whatnot (vs index funds etc.).

datavirtue · 2024-11-01T11:41:48 1730461308

It's easy to get a capitalist to come out of retirement. Most of the time you just have to ask them to take a look at your business. Before you know it they accept a board position and shortly thereafter they are running point as President.

tehjoker · 2024-11-03T04:36:45 1730608605

For an illustrated example, you can watch Succession

DAGdug · 2024-11-04T01:07:45 1730682465

I’ve switched from manager to IC and vice-versa a few times at FAANG. Didn’t strike me as moving between the capitalist and proletariat classes, lol!

ytss · 2024-10-31T12:01:18 1730376078

The public might though be concerned that if they are being replaced, many in other positions at other companies will soon be replaced as well.

darth_avocado · 2024-10-31T15:21:28 1730388088

That’s not how the mind works. People cheered when Elon fired 80% of the Twitter staff. No one cares when people with high paying jobs suffer.

mmcdermott · 2024-10-31T16:00:02 1730390402

The people who cheered about the firing of 80% of the Twitter staff largely believed (rightly or wrongly) that they were being adversely affected by them. While Google may be seen with more wariness in tech circles, I don't think the average person believes that Google is actively harming them (again, rightly or wrongly).

ahmedfromtunis · 2024-10-31T20:43:59 1730407439

These aren't the same types of events. In Twitter's case, it was a one-off act, caused by one-off circumstances. With Google, it'd be more of a precursor to a new trend that might soon take root and impact me or those I care about.

almatabata · 2024-10-31T19:10:41 1730401841

I think twitter is an outlier because people hated the employees already for various reasons.

For example they thought that twitter had a bloated workforce because of videos like this (https://www.youtube.com/watch?v=buF4hB5_rFs).

And a lot of people heavily disagreed with how they handled moderation. You can take things like the hunter Biden laptop suppression or in the funny category you had the getting banned for saying learn to code (https://reason.com/2019/03/11/learn-to-code-twitter-harassme...).

Take random company without controversies and you will find less vitriol about them getting fired.

pjmlp · 2024-10-31T16:18:10 1730391490

No one cares about self checkout on supermarkets impact on their employees, until their employer does something similar.

alsetmusic · 2024-11-03T01:57:39 1730599059

I care as a consumer who hates standing in long lines. My former bank branch had thirteen teller stations and two tellers. This wasn't on a bad day. This was for years.

whatshisface · 2024-10-31T17:37:53 1730396273

People in Google salary brackets get jobs at Google-1 salary brackets, pushing junior people at Google-1 to Google-2, all the way down to IT departments at non-tech firms. This impacts everybody who's in the industry or capable of switching.

ahmedfromtunis · 2024-10-31T20:41:15 1730407275

Why would the general public care about Google employees. Google is however a major saas provider. And people might start to worry that their employer is going to soon buy a subscription to whatever that that Google used to automate jobs.

wbl · 2024-10-31T16:39:17 1730392757

The bank tellers didn't go away: they just became higher paid and higher skilled when cash management was no longer the job.

burningChrome · 2024-10-31T16:50:15 1730393415

>> Even if that's been happening, I don't think it would be politically savvy to admit it.

When I was working in RPA (robotic process automation) about 7 years ago, we were explicitly told not to say "You can reduce your team size by having use develop an automation that handles what they're doing!"

Even back then we were told to talk about how RPA (and by proxy AI) empowers your team to focus on the really important things. Automation just reduces the friction to getting things done. Instead of doing 4 hours of mindless data input or moving folders from one place to the other, automation gives you back those four hours so your team can do something sufficiently more important and focus on the bigger picture stuff.

Some teams loved the idea. Other leaders were skeptical and never adopted it. I spent the majority of those three years trying to selling them on this idea automation was good and very little time actually coding. Its interesting seeing the paradigm shift and seeing this stuff everywhere now.

aleph_minus_one · 2024-10-31T17:08:28 1730394508

> Even back then we were told to talk about how RPA (and by proxy AI) empowers your team to focus on the really important things.

As a non-politically savy person ;-) I have a feeling that this is a similarly dangerous message, since what prevents many teams to focus on really important things is often far too long meetings with managers and similar "important" stakeholders.

ethbr1 · 2024-10-31T19:06:24 1730401584

The reason you don't lead with headcount reduction is two-fold.

1. Almost every business has growing workload. That means reassigning good employees and not hiring new headcount, not firing existing headcount. Unipurpose, low-value offshore teams are the only ones who get cut (e.g. doing "{this} for every one of {these}" work).

2. Most operational automation is impossible to build well without deep process expertise from the SME currently performing it. If you fire that person immediately after automating their task, what do you think the next SME tells you, when you need their help?

Successfully scaling operational automation programs therefore rely on additional headcount avoidance (aka improving their volume:employee ratio) and value measurement (FTE-equivalent time savings) to justify/measure.

lenerdenator · 2024-10-31T14:09:55 1730383795

> I don't think it would be politically savvy to admit it.

Would it be? Do they care?

Sam Altman's been talking about how GenAI could break capitalism (maybe not the exact quote, but something similar), and these companies have been pushing out GenAI products that could obviously and easily be used to fake photographic or video evidence of things that have occurred in the real world. Elon's obsessed with making an AI that's trained to be a 20-year-old male edgelord from the sewer pits of the internet.

Compared to those things, "we've replaced X% of our workforce with AI" is absolutely anodyne.

agentultra · 2024-10-31T16:01:08 1730390468

100%.

Altman encourages anyone that will listen to him that monopolies are the only path to success in business. He has a lot riding on making sure everyone is addicted to AI and that he’s the one selling the shovels.

Google isn’t far off.

Most capitalists have this fantasy that they can reduce their labour expenses with AI and continue stock buy-backs and ever-increasing executive payouts.

What sucks is that they rely on class divisions so that people don’t feel bad when the “overpaid” software developers get replaced. Problem is that software developers are also part of the proletariat and creating these artificial class divisions is breaking up the ability to organize.

It’s not AI replacing jobs, it’s capital holders. AI is just the smoke and mirrors.

ahmedfromtunis · 2024-10-31T20:46:16 1730407576

Sam's company is not a multi-trillion dollar behemoth that employs hundreds of thousands and has practical (near-)monopoly on a huge swaths of the digital economy.

rty32 · 2024-10-31T13:21:41 1730380901

> I don't think it would be politically savvy to admit it.

Depends on who you ask.

If Trump wins and Elon Musk actually gets a new job, they would be bragging about replacing humans with AI all day long. And corporates are going to love it.

Not sure about what voters think though. But the fact that most of these companies are in California, New York etc means that it barely matters.

petre · 2024-10-31T16:46:54 1730393214

Yup, just like full self driving and ending the war in Ukraine on 24 hours.

sfink · 2024-10-31T18:10:24 1730398224

I find the boast about ending the war to be reasonably likely -- if it is clear the US is switching sides in the conflict, a negotiated capitulation could happen pretty quickly.

In a similar vein, solving world hunger is closer today than it's ever been. The previous best hope was global thermonuclear war, but honestly that would leave enough survivors as to be mostly ineffective, and much more likely to have the opposite result. Severe climate change has a better shot at fully eliminating [human] hunger.

ulfw · 2024-10-31T16:38:46 1730392726

Corporates will soon have to realise the hard reality that when masses of humans have been replaced there won't be masses of humans with salaries to buy said corporate's goods anymore.

datavirtue · 2024-11-01T11:50:43 1730461843

AI is socialism, and it's unstoppable. People are trying to stop progress and go back to the old days. Nothing about the universe permits this.

A new economy is forming and there is nothing that can stop it without causing major, unintended fallout.

burningChrome · 2024-10-31T18:09:57 1730398197

>> they would be bragging about replacing humans with AI all day long.

Has either bragged about this at all?

The only thing I've heard floated is Musk running a "government efficiency commission" which I just assumed meant he would be looking for ways to gut a lot of the never ending, never dying government programs. I've never heard him saying the commissions goal was to replace people with AI.

https://www.newsnationnow.com/politics/2024-election/trump-m...

The former president said such an audit would be to combat waste and fraud and suggested it could save trillions for the economy.

As the first order of business, Trump said that this commission will develop an action plan to eliminate fraud and improper payments within six months.

datavirtue · 2024-11-01T11:47:37 1730461657

Trump and Musk will get bored quickly if elected. Once in office your power is checked.

tjahg · 2024-10-31T13:56:57 1730383017

[flagged]

lenerdenator · 2024-10-31T14:11:58 1730383918

That would be the way someone with no real awareness of the philosophies and realities of the two parties in the US would see it. And to be fair, that's a good description of a large chunk of the American electorate.

But you can't have a guy who literally used to relieve himself into a golden toilet take over your party and be anything but the party of big business and billionaires.

Thorrez · 2024-10-31T15:17:34 1730387854

>Despite widespread rumors, there is no verified evidence that Trump actually owns a gold toilet.

https://royaltoiletry.com/does-trump-have-a-gold-toilet-unpa...

lenerdenator · 2024-10-31T17:03:13 1730394193

Fair enough.

Still a guy who operated multiple luxury hotel and golf course properties that would laugh a working man out the front door if he asked for an affordable room.

onion2k · 2024-10-31T09:25:12 1730366712

no one is stating "we've replaced X% of our workforce with AI"

That's only worth doing if you're trying to cut costs though. If the company has unmet ambitions there's no reason to shrink the headcount from 10 to 8 and have the same amount of output when you can keep 10 people and have the output of 12 by leveraging AI.

hyperpape · 2024-10-31T10:06:47 1730369207

Almost all the big tech companies have had layoffs over the past several years. I think it’s safe to say cost cutting is very much part of their goal.

lupire · 2024-10-31T11:46:17 1730375177

But the specific roles being laid off are arbitrary, and the overall goal headcount reduction is driven by macroeconomics factors (I'm being generous there), not based on new efficiencies.

Note the difft between "cost cutting" (do less, to lower cost) and "efficiency" (do same, but with less cost)

theptip · 2024-10-31T17:00:04 1730394004

The goal of these cost cutting initiatives is not an absolute reduction in cost, but a relative one. They needed to show an improvement in operating margin, ie % of revenue spent on engineers.

If your engineers become 20% more efficient then your margins are better and your problem is solved. (Indeed if you have tech that can make any engineer 20% more efficient then you are back in the game of hiring as many as you can find, as long as each added engineer brings in enough additional revenue.)

ktnaWA · 2024-10-31T12:02:51 1730376171

Thanks, that is how I read the announcement. The powers that be decided that there must be some quota to be fulfilled, and magically that quota was fulfilled.

AI engineers will not yet get a Nobel prize for putting everyone out of work.

pj_mukh · 2024-10-31T14:01:26 1730383286

"we've replaced X% of our workforce with AI"

Most likely what is actually happening is that the X% of workforce you would lay off is being put to other projects and Google in general can take on X% more projects for the same labor $$. So there is no real reason to make that particular "replaced" statement.

Sparkyte · 2024-10-31T18:27:48 1730399268

Google has to sell its AI some how. The problem is that businesses will see this and want to fire head count because they go, "Well I guess AI can do it for freeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee!". Nope no way is it writing code freely.

wcoenen · 2024-10-31T16:34:33 1730392473

> including things like fully automated CL/PR's which have been around for a decade

I haven't seen this yet so I'm intrigued. Is this a commercial product, or internal tooling?

OkGoDoIt · 2024-10-31T17:12:37 1730394757

I’m assuming this refers to things analogous to dependabot on GitHub where maybe it automatically updates a library version reference and runs the tests and creates a PR if everything seems good, or similarly for fixing style issues or other stuff that is pretty trivial and has good test coverage.

When you maintain an open source project on GitHub you will occasionally get some open source automated bot that submits a PR to do things like this without you even asking, and I’m sure there’s plenty more you can sign up for or implement yourself.

I wouldn’t really call it AI, but it is automated. I agree with the parent comment that a journalist trying to push an angle would probably lump it in as AI in order to make the number seem larger.

NotAnOtter · 2024-11-02T19:30:50 1730575850

It's common at most mega-corps like google. For example, if a utility function in an internal library was deprecated and replaced with a different function that has the same functionality. A team might write a script which generates hundreds/thousands of PR's to make the migration to the new function.

You don't want a single PR that does that, because that would affect thousands of projects, and if something goes wrong with a single one, the whole PR needs to be rolled back.

nlehuen · 2024-10-31T09:09:59 1730365799

I also work at Google and I agree with the general sentiment that AI completion is not doing engineering per se, simply because writing code is just a small part of engineering.

However in my experience the system is much more powerful than you described. Maybe this is because I'm mostly writing C++ for which there is a much bigger training corpus than JavaScript.

One thing the system is already pretty good at is writing entire short functions from a comment. The trick is not to write:

  function getAc...

But instead:

  // This function smargls the bleurgh
  // by flooming the trux.
  function getAc...

This way the completion goes much farther and the quality improves a lot. Essentially, use comments as the prompt to generate large chunks of code, instead of giving minimum context to the system, which limits it to single line completion.

Aachen · 2024-10-31T10:48:19 1730371699

This type of not having to think about the implementation, especially in a language that we've by now well-established can't be written safely by humans (including by Google's own research into Android vulnerabilities if I'm not mistaken), at least with the current level of LLM, worries me the most

Time will tell whether it outputs worse, equal, or better quality than skilled humans, but I'd be very wary of anything it suggests beyond obvious boilerplate (like all the symbols needed in a for loop) or naming things (function name and comment autocompletes like the person above you described)

munksbeer · 2024-10-31T14:50:57 1730386257

> worries me the most

It isn't something I worry about at all. If it doesn't work and starts creating bugs and horrible code, the best places will adjust to that and it won't be used or will be used more judiciously.

I'll still review code like I always do and prevent bad code from making it into our repo. I don't see why it's my problem to worry about. Why is it yours?

Aachen · 2024-10-31T15:51:33 1730389893

Because I do security audits

Functional bugs in edge cases are annoying enough, and I seem to run into these regularly as a user, but there's yet another class of people creating edge cases for their own purposes. The nonchalant "if it doesn't work"... I don't know whether that confirms my suspicion that not all developers are aware of (as a first step; let alone control for) the risks

twoWhlsGud · 2024-10-31T16:35:32 1730392532

And especially if it generates bugs in ways different from humans - human review might be less effective at catching it...

xp84 · 2024-11-02T16:13:57 1730564037

It generates bugs in pretty similar ways. It’s based on human-written code, after all.

Edge cases will usually be the ones to get through. Most developers don’t correctly write tests that exercise the limits of each input (or indeed have time to both unit test every function that way, and integration test to be sure the bigger stories are correctly working). Nothing about ai assist changes any of this.

(If anybody starts doing significant fully unsupervised “ai” coding they would likely pay the price in extreme instability so I’m assuming here that humans still basically read/skim PRs the same as they always have)

mbfg · 2024-10-31T16:11:49 1730391109

Except that no one trusts Barney down the hall that has stack overflow open 24/7. People naturally trust AI implicitly.

caeril · 2024-10-31T15:19:07 1730387947

It's worrying, yes, but we've had stackoverflow copy-paste coding for over a decade now already, which has exactly the same effects.

This isn't a new concern. Thoughtless software development started a long time ago.

Aachen · 2024-10-31T15:59:23 1730390363

As a security consultant, I think I'm aware of security risks all the time, also when I'm developing code just as a hobby in spare time. I can't say that I've come across a lot of stackoverflow code that was unsafe. It happened (like unsafe SVG file upload handling advice) and I know of analyses that find it in spades, but I personally correct the few that I see (got enough stackoverflow rep to downvote, comment, or even edit without the user's approval though I'm not sure I've ever needed that) and the ones found in studies may be in less-popular answers that people don't come across as often because we should be seeing more of them otherwise, both personally and in the customer's code

So that's not to say there is nothing to be concerned about on stackoverflow, just that the risk seems manageable and understood. You also nearly always have to fit it to your own situation anyway. With the custom solutions from generative models, this is all not yet established and you're not having to customise (look at) it further if it made a plausible-looking suggestion

Perhaps this way of coding ends up introducing fewer bugs. Time will tell, but we all know how many wrong answers these things generate in text as well as what they were trained on, giving grounds for worry—while also gathering experience, of course. I'm not saying to not use it at all. It's a balance and something to be aware of

I also can't say that I find it to be thoughtless when I look for answers on stackoverflow. Perhaps as a beginning coder, you might copy bigger bits? Or without knowing what it does? That's not my current experience, though

miki123211 · 2024-10-31T10:22:23 1730370143

This is a good idea even outside of Google, with tools like copilot and such.

Often when I don't know exactly what function / sequence of functions I need to achieve a particular outcome, I put in a comment describing what I want to do, and Copilot does the rest. I then remove the comment once I make sure that the generated code actually works.

I find it a lot less flow-breaking than stackoverflow or even asking an LLM.

It doesn't work all of the time, and sometimes you do have to Google still, but for the cases it does work for, it's pretty nice.

Aachen · 2024-10-31T10:44:44 1730371484

Why remove the comment that summarises the intent for humans? The compiler will ignore your comment anyway, so it's only there for the next human who comes along and will help them understand the code

miki123211 · 2024-11-01T19:39:56 1730489996

Because the code, when written, is usually obvious enough.

Something like:

  query = query.orderBy(field: "username", Ordering.DESC)

Doesn't need an explanation, but when working in a language I don't know well, I might not remember whether I'm supposed to call orderBy on the query or on the ORM module and pass query as the argument, whether the kwarg is called "field" or "column", whether it wants a string or something like `User.name` as the column expression, how to specify the ordering and so on.

randomdata · 2024-10-31T13:01:15 1730379675

Like he says, the "comment" describes what he wants to do. That's not what humans are interested in. The human already knows "what he wants to do" when they read the code. It's the things like "why did he want to do this in the first place?" that is lacking in the code, and what information is available to add in a comment for the sake of humans.

Remember, LLMs are just compilers for programming languages that just so happen to have a lot of similarities with natural language. The code is not the comment. You still need to comment your code for humans.

JohnFen · 2024-10-31T14:29:34 1730384974

> Like he says, the "comment" describes what he wants to do. That's not what humans are interested in.

When I'm maintaining other people's code, or my own after enough time has gone by, I'm very interested in that sort of comment. It gives me a chance to see if the code as written does what the comment says it was intended to do. It's not valuable for most of the code in a project, but is incredibly valuable for certain key parts.

You're right that comments about why things were done the way they were are the most valuable ones, but this kind of comment is in second place in my book.

mithametacs · 2024-10-31T16:56:33 1730393793

Or for something that needs like a quick mathematical lemma or a worked example. A comment on what is fantastic.

qwertox · 2024-10-31T11:16:12 1730373372

It's often unnecessarily verbose. If you read a comment and glance at the code that follows, you'll understand what it is supposed to do. But the comment you're giving as an instruction to an LLM usually contains information which will then be duplicated in the generated code.

Aachen · 2024-10-31T11:34:09 1730374449

I see. Might still be good to have a verbose comment than no comment at all, as well as a marker of "this was generated" so (by the age of the code) you have some idea of what quality the LLM was in that year and whether to proofread it once more or not

lupire · 2024-10-31T11:52:40 1730375560

External comments are API usage comments. LLM prompts are also implementation proposal.

Implementation comments belong inside the implementation, so they should be over if not deleted.

cryptonym · 2024-10-31T10:58:53 1730372333

Next human will put the code in a prompt and ask what it does. Chinese Whispers.

Aachen · 2024-10-31T11:31:10 1730374270

I tried making a meme some months ago with exactly this idea, but for emails. One person would tell an LLM "answer that I'm fine with either option" and sends a 5 KB email, in response to which the recipient receives it and gets the automatic summary function to tell them (in a good case) "they're happy either way" or (in a bad case) "they don't give a damn". It didn't really work, too complex for meme format as far as my abilities went, but yeah the bad translator effect is something I'm very much expecting from people who use an LLM without disclosing it

_heimdall · 2024-10-31T12:39:12 1730378352

If someone is going to use an LLM to send me an email, I'd much rather them just send me the prompt directly. For the LLM message to be useful the prompt would have included all the context and details anyway, I don't need an LLM to make it longer and sound more "professional" or polite.

Aachen · 2024-10-31T14:43:10 1730385790

That is actually exactly my unstated point / the awareness I was hoping to achieve by trying to make that meme :D

mithametacs · 2024-10-31T16:58:49 1730393929

Not necessarily. Your prompt could include instructions to gather information from your emails and address book to tell your friend about all the relevant contacts you know in the shoe industry.

_heimdall · 2024-11-01T01:36:11 1730424971

Well that sounds reasonable enough. My only request is that you send me the prompt and let me decide if I want to comply...informed consent!

alexxys · 2024-11-06T00:48:24 1730854104

This meme already exists ;) https://www.reddit.com/r/ChatGPT/comments/128nwi2/bullet_poi...

rty32 · 2024-10-31T13:34:35 1730381675

Wow, I love good, original programming jokes like these, even the ideas of the jokes. I used to browse r/ProgrammerHumor frequently, but it is too repetitive -- mostly recycled memes and there is anything new.

This is one that I really liked: https://www.reddit.com/r/ProgrammerHumor/comments/l5gg3t/thi...

lupire · 2024-10-31T11:50:51 1730375451

(No need to Orientalize to defamiarize, especially when a huge fraction of the audience is Chinese, so Orientalizing doesn't defamiliarize. Game of Whispers or Telephone works fine.)

protomolecule · 2024-10-31T13:09:46 1730380186

Do the Chinese call it English Whispers?

tessierashpool · 2024-10-31T15:29:29 1730388569

Chinese-Americans, at least, call it a game of Telephone, like everyone else in the English-speaking world except for the actual English.

We call it “Telephone” because “Chinese Whispers” not only sounds racist, it is also super confusing. You need a lot of cultural context to understand the particular way in which Chinese whispers would be different from any other set of whispers.

tessierashpool · 2024-11-13T18:54:40 1731524080

I happened to re-read this, and to be clear, I'm not Chinese-American. the "we" there means "everyone else in the English-speaking world except for the actual English."

ahoka · 2024-10-31T14:08:21 1730383701

It’s all Greek to them.

cryptonym · 2024-11-04T08:30:55 1730709055

Pardon my French.

jappgar · 2024-10-31T11:17:41 1730373461

I can guarantee you there is more publicly accessible javascript in the world than C++.

Copilot will autocomplete entire functions as well, sometimes without comments or even after just typing "f". It uses your previous edits as context and can assume what you're implementing pretty well.

infecto · 2024-10-31T12:01:22 1730376082

I can guarantee you that the author was referencing code within Google. That is, their tooling is trained off internal code bases. I am imagining c++ dwarfs javascript.

lupire · 2024-10-31T11:48:27 1730375307

Google does not write much publicly available JavaScript. They wrote their own special flavor. (Same for any hugel legacy operation)

bilekas · 2024-10-31T12:08:53 1730376533

Can we get some more info on what you're reffering to ?

jkaptur · 2024-10-31T14:54:25 1730386465

They're probably talking about Closure Compiler type annotations [0], which never really took off outside Google, but (imo) were pretty great in the days before TypeScript. (Disclosure: Googler)

0. https://github.com/google/closure-compiler/wiki/Annotating-J...

cryptonym · 2024-10-31T11:06:18 1730372778

I find writing code to be almost relaxing plus that's really a tiny fraction of dev work. Not too excited about potential productivity gains based purely on authoring snippets. I find it much more interesting on boosting maintainability, robustness and other quality metrics (not focusing on quality of AI output, actual quality of the code base).

xp84 · 2024-11-02T16:00:20 1730563220

I frequently use copilot and also find that writing comments like you do, to describe what I expect each function/class/etc to do gives superb results, and usually eliminates most of the actual coding work. Obviously it adds significant specification work but that’s not usually a bad thing.

michaelbuckbee · 2024-10-31T11:23:55 1730373835

I don't work at Google, but I do something similar with my code: write comments, generate the code, and then have the AI tooling create test cases.

AI coding assistants are generally really good at ramping up a base level of tests which you can then direct to add more specific scenario's to.

tomhallett · 2024-10-31T14:04:21 1730383461

Has anyone made a coding assistant which can do this based off audio which I’m saying out loud while I’m typing (interview/pairing style), so instead of typing the comment I can just say it?

hecanjog · 2024-10-31T15:15:54 1730387754

I had some success using this for basic input, but never took it very far. It's meant to be customizable for that sort of thing though: https://talon.wiki/quickstart/getting_started/ (Edit: just the voice input part)

alickz · 2024-10-31T11:15:33 1730373333

Comment Driven Programming might be interesting, as an offshoot of Documentation Driven Programming

gniv · 2024-10-31T11:47:25 1730375245

That's pretty nice. Does it write modern C++, as I guess it's expected?

nlehuen · 2024-11-01T16:11:36 1730477496

Yes it does. Internally Google uses C++20 (https://google.github.io/styleguide/cppguide.html#C++_Versio...) and the model picks the style from training, I suppose.

atoav · 2024-10-31T04:59:46 1730350786

So this is basically the google CEO saying "a quarter of our terminal inputs is written by a glorified tab completion"?

asdfman123 · 2024-10-31T06:03:32 1730354612

Yes. Most AI hype is this bad. They have to justify the valuations.

remus · 2024-10-31T07:41:20 1730360480

"tab completion good enough to write 25% of code" feels like a pretty good hit rate to me! Especially when you consider that a good chink of the other 75% is going to be the complex, detailed stuff where you probably want someone thinking about it fairly carefully.

rantallion · 2024-10-31T08:32:12 1730363532

The problem being that the time spent fixing the bugs in that 25% outweighs the time saved. Now that tools like Copilot are being widely used, studies are showing that they do not in fact boost productivity. All claims to the contrary seem to be either anecdotal or marketing fluff.

https://www.techspot.com/news/104945-ai-coding-assistants-do...

pawelmurias · 2024-10-31T10:45:29 1730371529

The AI tap complition is >100000% better than the coding assistants, it just saves you typing and doesn't introduce new bugs you need to fix instead of writting buggy shitty code from a text description.

red_admiral · 2024-10-31T11:38:19 1730374699

As far as I know, LLMs are a genuine boost for junior developers, but still not close to what senior/principal engineers get up to.

makestuff · 2024-10-31T13:25:14 1730381114

I have around 7 YOE, and I have found LLMs useful for very specific questions about syntax whenever I am working in a new language. For example, I needed to write some typescript recently and asked it how can I make a type that does X.

It is not as good with questions about API documentation for popular java libraries though and it will just hallucinate APIs/method names.

If I ask it a generic question like "how can I create a class in Java to invoke this API and store the data in this database" it is pretty useless. I'm sure I could spend more time giving it a better prompt but at that point I can just write the code myself.

Overall they are a better search engine for stackoverflow, but the LLMs are not really helping me code 30% faster or whatever the latest claim is.

_heimdall · 2024-10-31T12:44:10 1730378650

It'd be interesting to know how much of Google's code is written by junior engineers. I can't imagine 25% of the code is from juniors, at which point Google's CEO is either exaggerating what he considers LLM-generated code or more than just juniors are using it.

I agree with your take though, it does seem helpful to juniors but not beyond that (yet), and this OP stat seems dubious unless juniors are doing a big portion of the work.

red_admiral · 2024-10-31T11:37:06 1730374626

"rm re[TAB]" to remove a file called something like "report-accounting-Q1_2024.docx" is really helpful, especially when it adds quotes as required, but not exciting enough to get me out of bed any earlier in the morning.

I feel it's a bit like the old "measuring developer productivity in LoC" metric.

As I hinted at in another comment, in Java if you had a "private String name;" then the following:

    /**
     * Returns the name.
     * @return The name.
     */
    public String getName() {
        return this.name;
    }

and the matching setter, are easy enough to generate automatically and you don't need a LLM for it. If AI can do that part of coding a bit better, sure it's helpful in a way, but I'm not worried about my job just yet (or rather, I'm more worried about the state of the economy and other factors).

Maxion · 2024-10-31T07:43:13 1730360593

For me it's really goddam satisfying having good autocomplete, especially when you are just writing boilerplate lines of code to get the code into a state where you actually get to work on the fun stuff (ther harder problems).

amelius · 2024-10-31T07:56:34 1730361394

Also if your code gets sent to someone else's cloud?

infecto · 2024-10-31T12:08:37 1730376517

I don't care. The vast majority of code written in the private space is garbage and not unique. Products are usually not won because of the code.

Would I send the source of a trading algo or chatgpt to a third party, probably not but those are the outliers. The code for your xyz SAAS does not matter.

I am probably an outlier in that I don't really care what corpus a LLM trains off of. Its its available in the public space, go for it.

mewpmewp2 · 2024-10-31T08:19:10 1730362750

Have you ever had your code repository hosted by Github, Bitbucket, Gitlab or similar?

If so, all your code is sent to cloud.

amelius · 2024-10-31T11:29:16 1730374156

Answer: yes, some code. But other code I and my company like to keep private.

mewpmewp2 · 2024-10-31T11:44:36 1730375076

Where exactly is the repo hosted if there is one?

cesarb · 2024-10-31T13:19:00 1730380740

It's common for companies to have something like self-hosted GitHub Enterprise or self-hosted GitLab hidden behind the company's VPN.

mewpmewp2 · 2024-10-31T14:52:23 1730386343

But where is the box where it's hosted? Is it in-house?

_heimdall · 2024-10-31T12:47:08 1730378828

There are alternatives out there for self-hosted git. I have a Gitea instance running on a mini PC at home for my own projects.

mewpmewp2 · 2024-10-31T14:53:57 1730386437

Do you have backups of that as well? If something were to happen to your mini pc would you lose your code?

_heimdall · 2024-10-31T18:29:43 1730399383

Great question, yeah I do. Right now it backs up to a separate NAS on my home network. Every once in a while I'll copy the most important directories onto a microSD card backup, but its usually going to be at least a few weeks out of date.

amelius · 2024-10-31T12:59:56 1730379596

Own servers.

mewpmewp2 · 2024-10-31T14:53:16 1730386396

Do they manage their own servers? I wonder what proportion of companies would have in house servers managed by themselves.

amelius · 2024-10-31T17:04:07 1730394247

They are colocated in a data center and you need physical keys to access the rack.

red_admiral · 2024-10-31T11:41:00 1730374860

Internally hosted gitlab instances are a thing.

mewpmewp2 · 2024-10-31T11:44:03 1730375043

They are, but frequently the boxes where they are hosted are in AWS or similar. Or do frequently companies have actual in house servers for this purpose?

red_admiral · 2024-10-31T16:20:29 1730391629

Not in house, but in a "segmented" part of the cloud that comes with service level agreements and access control and restrictions on which countries the data can be hosted in and compliance procedures etc. etc.

An extreme example of this would be the AWS GovCloud for government/military applications.

keybored · 2024-10-31T10:30:15 1730370615

25% is a great win if you are prone to RSI. And for quicker feedback. But in terms of the overarching programming goal? Churning out code is a small part of it.

Code is often a liability.

shombaboor · 2024-10-31T14:00:22 1730383222

It would be funny if they had a metric for how much code is completed by CTRL+V

unglaublich · 2024-10-31T07:24:27 1730359467

Yes, isn't that the essential idea of industrialization and automation?

OtherShrezzing · 2024-10-31T08:38:36 1730363916

I think the critique here is that the AI currently deployed at Google hasn't meaningfully automated this user's life, because most IDEs already solved "very good autocomplete" more than a decade ago.

tormeh · 2024-10-31T12:50:03 1730379003

LLM autocomplete is on an entirely different level. It's not comparable to traditional autocomplete and mostly does not even compete with traditional autocomplete. LLM autocomplete will sometimes write entire blocks of code for you, with surprising skill. I often wonder how it knew what I wanted. It also generates some wrong code from time to time, but that's well worth it.

randomdata · 2024-10-31T13:14:03 1730380443

> LLM autocomplete is on an entirely different level.

Which is how they've surpassed 25% in new code, as compared to the 10% (made up number, but clearly non-zero) in the past. But incremental improvement, is all.

busterarm · 2024-10-31T14:33:13 1730385193

glorified, EXPENSIVE tab completion.

walthamstow · 2024-10-31T15:05:50 1730387150

I assume you're referring to the compute/energy used to run the completion?

busterarm · 2024-10-31T15:27:03 1730388423

to train the model

mmmpetrichor · 2024-10-31T06:24:42 1730355882

Yeah, but he wants people to hear "reduce headcount by 25% if you buy our shit!"

mewpmewp2 · 2024-10-31T08:21:19 1730362879

How do you know that? You are creating this false sense of expectations and hype yourself.

I am going to argue contrary. If AI increases productivity 2x, it opens up as much new usecases that previously didn't seem worthy to do for its cost. So overall there will just be more work.

JimDabell · 2024-10-31T09:48:14 1730368094

> I am going to argue contrary. If AI increases productivity 2x, it opens up as much new usecases that previously didn't seem worthy to do for its cost. So overall there will just be more work.

This is the entire history of the computing industry. We’ve been automating our work away for decades and it just creates more demand.

mewpmewp2 · 2024-10-31T10:20:01 1730370001

Yeah, this is only side projects, but I've been spending pretty much all of my free time now on side projects, largely because I feel much faster building them with LLMs and it has a compounding motivational effect. I also see so many use cases and work left to do, even with AI, the possibilities almost overwhelm me.

Well I do freelancing as well besides my usual day to day work, and that's also where direct benefits apply, and I'm getting more and more work, overwhelmingly so.

pawelmurias · 2024-10-31T10:46:50 1730371610

[flagged]

binkHN · 2024-10-31T11:48:21 1730375301

I wouldn't call it genius tab completion. Unfortunately, more than half of the time that the "genius" produces the code, I'm wasting my time reviewing code that is incorrect.

_psrj · 2024-10-31T11:07:40 1730372860

I'm sorry but I don't understand how people say LLMs are simply "tab completion".

They allow me to do much more than that thanks to all the knowledge they contain.

For instance, yesterday I wanted to write a tool that transfers any large file that is still being appended to to multiple remote hosts, with a fast throughput.

By asking Claude for help I obtained exactly what I want in under two hours.

I'm no C/C++ expert yet I have now a functional program using libtorrent and libfuse.

By using libfuse my program creates a continuously growing list of virtual files (chunks of the big file).

A torrent is created to transfer the chunks to remote hosts.

Each chunk is added to the torrent as it appears on the file system thanks to the BEP46 mutable torrent feature in libtorrent.

On each receving host, the program rebuilds the large file by appending new chunks as soon as they are downloaded through the torrent.

Now I can transfer a 25GB file (and growing) to 15 hosts as it is being written too.

Before LLM this would have taken me at least four days as I did not know those libraries.

LLMs aren't just parrots or tab completers, they actually contain a lot of useful knowledge and they're very good at explaining it clearly.

qwertox · 2024-10-31T11:24:55 1730373895

> By asking Claude for help I obtained exactly what I want in under two hours.

Did you use it in your editor or via the chat interface in the browser? Because they are two different approaches, and the one in the editor is mostly a (pretty awesome) tab completion.

When I tell an LLM to "create a script which does ..." I won't be doing this in the editor, even if copilot does have the chat interface. I'll be doing this in the browser because there I have a proper chat topic to which I can get back later, or review it.

_psrj · 2024-10-31T11:38:59 1730374739

I did not use copilot or cursor. I used the Claude interface. I'm planning to setup a proper editor tool such as Cursor as I believe they got much better lately. Last time I tried was 2023 and it was kind of a pain in the butt.

qwertox · 2024-10-31T11:55:38 1730375738

I tried Cursor this month but even though it is much better than copilot, it also tries to do too much. And both of them fail regularly at generating proper autocompletions, which makes Cursor a bigger annoyance because it messes up your code quite often, which copilot doesn't do. Cursor is too aggressive.

But using copilot as a better autocomplete is really helpful and well worth the subscription. Just while typing as well as giving it more precise instructions via comments.

It's like a little helper in the editor, while the ChatGPT/Claude in the browser are more like "thinking machines" which can generate really usable code.

_psrj · 2024-10-31T12:28:15 1730377695

good to know, thanks

lupire · 2024-10-31T11:55:17 1730375717

That's fine for your quick hack that is probably a reimplementation of an existing program you can't find.

But it's not a production quality implementation of new need.

pizzafeelsright · 2024-10-31T15:51:44 1730389904

I am of the strong opinion most problems were solved 20-40 years ago and that most code written today is reimplementation using different languages.

I have shipped production code using LLMs in languages I did not study approved by seasoned SWE's is evidence that an acceleration is happening.

_psrj · 2024-10-31T12:26:37 1730377597

It's a knowledge base that can explain the knowledge it returns when you ask, how is that not useful in a professional environment for production code?

I mean if you assume all devs are script kiddies who simply copy paste what they find on google (or ChatGPT without asking for explanations) then yeah it's never gonna be useful in a prod setting.

Also you're very wrong to believe every technical need or combination of libraries has already been implemented in open source before.

rty32 · 2024-10-31T13:42:50 1730382170

True, but hey, even if it's not production code, it may be an ad-hoc thing that never gets push to production, it may be code reviewed by C++ experts and improved to production quality. At very least, someone saved four days with it, and could use the time for something, maybe something they are expert at. Isn't that still good?

mdavid626 · 2024-11-01T06:30:20 1730442620

Most of the time saving time is just an illusion. When that code will needed to be changed, people will spend more than 4 days debugging and understanding it. The mental model of it was written by AI. It can make sense or not at all. You’ll figure it out after 4 days.

_psrj · 2024-11-01T09:46:28 1730454388

The code is 2 files of 80 lines each and is very clear. There's no way any software developer needs 4 days to understand what it does.

Moreover Claude can explain the functions used very clearly (if you're too lazy to jump to definition in your editor)

LLMs are becoming actually useful to developers new to a language. Just as Google was 20 years ago.

mdavid626 · 2024-11-02T07:28:30 1730532510

People talk about completey different things. The article was about Google using LLM-s to generate code, not people making 80 lines with them at home. There is a huge difference. I don’t see any problem with the latter, but with the former there are many problems.

znpy · 2024-10-31T11:26:00 1730373960

That sounds like a great idea, are you going to open source that?

_psrj · 2024-10-31T11:40:06 1730374806

I think I will, I don't have time to maintain additional software right for other people now but I'm definitely planning on open sourcing it when I get time

znpy · 2024-10-31T12:35:08 1730378108

Yeah i see your point.

However i think that you might open source the thing with a disclaimer of no maintenance. Whoever is willing to maintain it can just fork it and move along.

bitcharmer · 2024-10-31T14:56:27 1730386587

> thanks to all the knowledge they contain

This is what's problematic with modern "AI". Most people inexperienced with it, like the parent commenter will uncritically assume these LLMs poses "knowledge". This I find the most dangerous and prevalent assumption. Most people are oblivious to the fact how bad LLMs are.

_psrj · 2024-10-31T15:14:35 1730387675

I know excatly how bad the output they give is, because I ask for output that I can understand, debug and improve.

People misusing tools don't make tools useless or bad. Especially since LLMs designers never claimed the compressed information inside models is spotless or 100% accurate, or based on logical reasoning.

Any serious engineer with a modicum of knowledge about neural networks knows what can or can't be done with the output.

OnionBlender · 2024-10-31T04:41:03 1730349663

Do people find these AI auto complete things helpful? I was trying the XCode one and it kept suggesting API calls that don't exist. I spent more time fixing its errors than I would have spent typing the correct API call.

_kidlike · 2024-10-31T07:58:15 1730361495

I really really dislike the ones that get in your way. Like I start typing something and it injects random stuff (yes in the auto-complete colors). I have a similar feeling to when you hear your voice back in a phone: completely disabling your thought process.

In IntelliJ thankfully you can disable that part of the AI, and keep the part that you trigger it when you want something from it.

frereubu · 2024-10-31T09:50:48 1730368248

> I have a similar feeling to when you hear your voice back in a phone: completely disabling your thought process.

This is a fantastic description of how it disturbs my coding practice which I hadn't been able to put into words. It's like someone is constantly interrupting you with small suggestions whether you want them or not.

gtirloni · 2024-10-31T10:58:22 1730372302

This is it. I have a picture in my mind and then it puts 10 lines of code in front of me and my brain can't ignore. When I'm done reviewing that, it's already tainted my idea.

mu53 · 2024-10-31T04:48:01 1730350081

I find the simpler engines work better.

I want the end of the line completed with focus on context from the working code base, and I don't want an entire 5 line function completed with incomplete requirements.

It is really impressive when it implements a 5 line function correctly, but its like hitting the lottery

ncruces · 2024-10-31T08:27:00 1730363220

I particularly like the part where it suggests changes to pasted code.

When I copy and paste code, very often it needs some small changes (like changing all xs to ys and at the same time widths to heights).

It's very good at this, and does the right thing the vast majority of the time.

It's also good with test code. Test code is supposed to be explicit, and not very abstracted (so someone only mildly familiar with a codebase that's looking at a failing test can at least figure the cause). This means it's full of boilerplate, and a smart code generator can help fill that in.

andyjohnson0 · 2024-10-31T10:07:05 1730369225

Visual Studio "intellisense" has always been pretty good for me. Seemed to make good guesses about my intentions without doing anything wild. It seemed to use ad hoc rules and patterns, but it worked and then got out of the way.

Then it got worse a couple of years ago when they tried some early-stage AI approach. I turned it off. I expect that next time I update VS it'll have got substantially worse and it will have removed the option for me to disable it.

nobleach · 2024-10-31T17:56:05 1730397365

Agreed, the old Visual Basic, Visual C++, Borland Delphi, Visual C# experiences were how I dove into the deep end of several languages back in the late 90's/early 2000's. Things were VERY discoverable at that point. Obviously a deeper understanding of a language is necessary for doing real work, but noodling around just trying to get a feel for what can be done, is a great way to get started.

mcintyre1994 · 2024-10-31T07:11:51 1730358711

I like Cursor, it seems very good at keeping its autocomplete within my code base. If I use its chat feature and ask it to generate new code that doesn’t work super well. But it’ll almost always autocomplete the right function name as I’m typing, and then infer the correct parameters to pass in if they’re variables and if the function is in my codebase rather than a library. It’s also unsurprisingly really good at pattern recognition, so if you’re adding to an enum or something it’ll autocomplete that sensibly too.

I think it’d be more useful if it was clipboard aware though. Sometimes I’ll copy a type, then add a param of that type to a function, and it won’t have the clipboard context to suggest the param I’m trying to add.

qeternity · 2024-10-31T08:32:20 1730363540

I really like Cursor but the more I use it the more frustrated I get when it ends up in a tight loop of wanting to do something that I do not want to do. There doesn’t seem to be a good way to say “do not do this thing or things like it for the next 5 minutes”.

M4v3R · 2024-10-31T06:17:39 1730355459

It probably depends on the tool you use and on the programming language. I use Supermaven autocomplete when writing Typescript and it’s working great, it often feels like it’s reading my mind, suggesting what I would write next myself.

vbezhenar · 2024-10-31T07:41:46 1730360506

I mostly use one-line completes and they are pretty good. Also I really like when Copilot generates boilerplate like

    if err != nil {
      return fmt.Errorf("Cannot open settings: %w", err);
    }

I_AM_A_SMURF · 2024-10-31T06:27:30 1730356050

I use the one at G and it's definitely helpful. It's not revolutionary, but it makes writing code less of a headache when I kinda know what that method is called but not quite.

skybrian · 2024-10-31T05:15:19 1730351719

I often delete large chunks of it unread if it doesn't do what I expected. It's much like copy and paste; deleting code doesn't take long.

card_zero · 2024-10-31T06:25:53 1730355953

So your test is "seems to work"?

skybrian · 2024-10-31T06:30:51 1730356251

No, what I meant is that, much like when copying code, I only keep the generated source code if it's written the way I would write it.

(By "unread" I meant that I don't look very closely before deleting if it looks weird.)

And then write tests. Or perhaps I wrote the test first.

card_zero · 2024-10-31T06:44:56 1730357096

Oh, if the AI doesn't do what you expected, got it.

binkHN · 2024-10-31T11:53:21 1730375601

Right now my opinion is that they're 60% unhelpful, so I largely agree with you. Sometimes I'll find the AI came up with a somewhat better way of doing something, but the vast majority of the time it does something wrong or does something that appears right, but it's actually wrong and I can only spot it with a somewhat decent code review.

guappa · 2024-10-31T07:52:40 1730361160

I suspect that if you work on trivial stuff that has been asked on stackoverflow countless of times they work very nicely.

OnionBlender · 2024-10-31T16:14:42 1730391282

This is what I've been noticing. For C++ and Swift, it makes pretty unhelpful suggestions. For Python, its suggestions are fine.

Swift is especially frustrating because it will hallucinate the method name and/or the argument names (since you often have to specify the argument names when calling a method).

guappa · 2024-11-04T10:41:36 1730716896

Ah I've had it hallucinate non-existing methods in python rather often.

Or when I say I need to do something, it invents a library that conveniently happens to just do that thing and writes code to import and use it. Except there's no such library of course.

0points · 2024-10-31T07:43:16 1730360596

No, not at all.

"classic" intellisense is reliable, so why introduce random source in the process?

4lb0 · 2024-10-31T15:08:41 1730387321

I use Codeium in NeoVim and yes I find it very helpful. Of course, is not 100% error free, but even when it has errors most of the time it is easier for me to fix them than to write it from scratch.

sharpy · 2024-10-31T05:38:05 1730353085

Often yes. There were times when I was writing unit tests that was me just naming the test case, with 99% of the test code auto generated based on the existing code, and the name.

simne · 2024-10-31T10:56:50 1730372210

Looks like model is not trained well. From my exp, after make few projects (2 looks enough), oldest XCode managed to give good suggestions in much more than 50% cases.

karmasimida · 2024-10-31T08:13:08 1730362388

It is useful in our use case.

Realtime tab completion is good at some really mundane things within the current file.

You still need a chat model, like Claude 3.5 to do more explorational things.

DecoySalamander · 2024-10-31T14:05:37 1730383537

I was evaluating it for a month and caught myself regularly switching to an IDE with non-AI intellisense because I wanted code that actually works.

mdavid626 · 2024-10-31T07:52:37 1730361157

No, not at all. It’s just the hype. It doesn’t replace engineering.

saagarjha · 2024-10-31T07:39:48 1730360388

The one Xcode has is particularly bad, unfortunately.

myworkinisgood · 2024-10-31T09:21:43 1730366503

Copilot is very good.

cryptica · 2024-10-31T06:43:46 1730357026

This is my experience as well. LLMs are great to boost productivity, especially in the hands of senior engineers who have a deep understanding of what they're doing because they know what questions to ask, they know when it's safe to use AI-generated code and they know what issues to look for.

In the hands of a junior, AI can create a false sense of confidence and it acts as a technical debt and security flaw multiplier.

We should bring back the title "Software engineer" instead of "Software developer." Many people from other engineering professions look down on software engineers as "Not real engineers" but that's because they have the same perspective on coding as typical management types have. They think all code is equal, it's unavoidable spaghetti. They think software design and architecture doesn't matter.

The problems a software engineer faces when building a software system are the same kinds of problems that a mechanical or electrical engineer faces when building any engine or system. It's about weighing up trade-offs and making a large number of nuanced technical decisions to ultimately meet operational requirements in the most efficient, cost-effective way possible.

alxjrvs · 2024-10-31T04:05:20 1730347520

In my day to day, this still remains the main way I interact with AI coding tools.

I regularly describe it as "The best snippet tool I've ever used (because it plays horseshoes)".

tomcam · 2024-10-31T05:19:49 1730351989

Horseshoes? As in “close enough”?

ttul · 2024-10-31T06:17:08 1730355428

Or, as in, “Ouch, man! You hit my foot!”

goykasi · 2024-10-31T06:21:15 1730355675

As long as hand grenades arent introduced, I could live with that.

DanHulton · 2024-10-31T06:49:49 1730357389

Honestly, I don't think "close only count in horseshoes, hand grenades, and production code" will ever catch on...

alxjrvs · 2024-10-31T15:17:34 1730387854

This is why I frame it as a "snippets" plugin, rather than a Code generation tool.

I would be very confused if someone told me that they uncritically used the generated code from a snippet program with no manual input or understanding, and I feel the same with Copilot. At best, it suggests an auto-complete that I read and interpret before accepting.

The closest I come to "code generation" is during test writing, where occasionally I will let the description generate some setup, but only in tests where there are a broad number of examples to follow, and I am still going to end up re-writing a decent chunk of it based on personal example. I would not "let it write the test suite for me" and then trust the green, and I suspect that would easily fail code review (though it would be an interesting experiment...).

Obviously your comment as a good goof and well made, but it does speak to a little bit of the disconnect between what is being touted as an "AI coding tool" and how I, a person who makes react native apps to pay my rent, actually use the dang thing (i.e., "A pretty good snippets plugin"). Is My code 'AI generated'? I wouldn't call it that, but who can say definitively? We're in a fun new semantic world now.

davedx · 2024-10-31T08:31:56 1730363516

I'm working on a CRM with a flexible data model, and ChatGPT has written most of the code. I don't use the IDE integrations because I find them too "low level" - I work with GPT more in a sort of "pair programming" session: I give it high level, focused tasks with bits of low level detail if necessary; I paste code back and forth; and I let it develop new features or do refactorings.

This workflow is not perfect but I am definitely building out all the core features way faster than if I wrote the code myself, and the code is in quite a good state. Quite often I do some bits of cleanup, refactorings, making sure typings are complete myself, then update ChatGPT with what the code now looks like.

I think what people miss is there are dozens of different ways to apply AI to your day-to-day as a software engineer. It also helps with thinking things through, architecture, describing best practices.

littlestymaar · 2024-10-31T08:52:26 1730364746

I share your sentiment, I've written three apps where I've used language models extensively (a different one for each: ChatGPT, Mixtral and Llama-70B) and while I agree that they where immensely helpful in terms of velocity, there are a bunch of caveats:

- it only works well when you write code from scratch, context length is too short to be really helpful for working on existing codebase.

- the output code is pretty much always broken in some way, and you need to be accustomed to doing code reviews to use them effectively. If you trust the output and had to debug it later it would be a painfully slow process.

Also, I didn't really noticed a significant difference in code quality, even the best model (GPT-4) write code that doesn't work, and I find it much more efficient to use open models on Groq due to the really fast inference. Looking at ChatGPT slowly typing is really annoying (I didn't test o1 and I have no interest in doing so because of its very low throughput).

davedx · 2024-10-31T09:16:23 1730366183

> context length is too short to be really helpful for working on existing codebase.

This is kind of true, my approach is I spend a fairly large amount of time copy-pasting code from relevant modules back and forth into ChatGPT so it has enough context to make the correct changes. Most changes I need to make don't need more than 2-3 modules though.

> the output code is pretty much always broken in some way, and you need to be accustomed to doing code reviews to use them effectively.

I think this really depends on what you're building. Making a CRM is a very well trodden path so I think that helps? But even when it came to asking ChatGPT to design and implement a flexible data model it did a very good job. Most of the code it's written has worked well. I'd say maybe 60-70% of the code it writes I don't have to touch at all.

The slow typing is definitely a hindrance! Sometimes when it's a big change I lose focus and alt-tab away, like I used to do when building large C++ codebases or waiting for big test suites to run. So that aspect saps productivity. Conversely though I don't want to use a faster model that might give me inferior results.

littlestymaar · 2024-10-31T09:57:34 1730368654

> approach is I spend a fairly large amount of time copy-pasting code from relevant modules back and forth into ChatGPT

It can work, but what a terrible developer experience.

> I'd say maybe 60-70% of the code it writes I don't have to touch at all

I used to to write web apps so the ratio was even higher I'd say (maybe 80/90% of the code didn't need any modification) but the app itself wouldn't work at all if I didn't make those 10% changes. And you really need to read 100% of the code because you won't know upfront where those 10% will be.

> The slow typing is definitely a hindrance! Sometimes when it's a big change I lose focus and alt-tab away, like I used to do when building large C++ codebases or waiting for big test suites to run.

Yeah exactly, it's xkcd 303 but with “IA processing the response” instead of “compiling”. Having instant response was a game changer for me in terms of focus hence productivity.

> I don't want to use a faster model that might give me inferior results

As I said earlier, I didn't really feel the difference in quality so the switch was without drawbacks.

chrisjj · 2024-10-31T09:52:08 1730368328

> I'd say maybe 60-70% of the code it writes I don't have to touch at all.

...yet. Bugs can take time to surface.

michaelteter · 2024-10-31T10:29:16 1730370556

And this is equally true whether the code was entirely written by a human or not.

chrisjj · 2024-11-04T18:28:03 1730744883

... except "not" delivers this "the output code is pretty much always broken in some way".

creesch · 2024-10-31T10:41:40 1730371300

> Also, I didn't really noticed a significant difference in code quality, even the best model (GPT-4) write code that doesn't work,

Interesting, personally I have noticed a difference. Mostly in how well the models pick up small details and context. Although I do have to agree that the open Llama models are generally fairly serviceable.

Recently I have tended to lean towards Claude Sonnet 3.5 as it seems slightly better. Although that does differ per language as well.

As far as them being slow, I haven't really noticed a difference. I use them mostly through the API with open webui and the answers come quick enough.