The art of programming and why I won't use LLM

kqr · 2024-08-25T18:17:28 1724609848

> it is sad to me just how much people are trying to automate away programming and delegating it to a black box

I take it you're not using a compiler to generate machine code, then?

Scratch that, I guess you're not using a modern microprocessor to generate microcode from a higher-level instruction set either?

Wait, real programm^Wartists use a magnetised needle and a steady hand.

Programming has always been about finding the next black box that is both powerful and flexible. You might be happy with the level of abstraction you have settled on, but it's just as arbitrary as any other level.

Even the Apollo spacecraft programmers at MIT had a black box: they offloaded the weaving of core rope memory to other people. Programming is not necessarily about manually doing the repetitive stuff. In some sense, I'd argue that's antithetical to programming -- even if it makes you feel artistic!

bayindirh · 2024-08-25T21:18:19 1724620699

The thing is, all these stacks are built by people and verified against specifications. When they failed to perform the way they should, we fixed them.

Plus, all the parts are deterministic in this stack. Their behavior is fixed for a given input, and all the parts are interpretable, readable, verifiable and observable.

LLMs are none of that. They are stochastic probability machines, which are nondeterministic. We can't guarantee their output's correctness, and we can't fix them to guarantee correct output. They are built on tons of (unethically sourced) data, which has no correctness and quality guarantees.

Some people will love LLMs, and/or see programming as a task/burden they have to complete. Some of us love programming for the sake of it, and earn money by doing it that way, too.

So putting LLMs to the same bucket with a deterministic, task specific programming tool is both wrong, and disservice to both.

I'm also strongly against LLMs, not because of the tech, but because of how they are trained and how their shortcomings are hid and they're put forward as the "oh the savior of the woeful masses, and the silver bullet of all thy problems", and it's neither of them.

LLMs are just glorified tech demos which shows what stochastic parrots can pose as accomplishing when you feed the whole world to them.

jinen83 · 2024-08-26T04:16:25 1724645785

I would argue thats an LLM spec --> Generate probabilistic output with a degree of confidence on the output nearing p(1). IMHO End users are supposed to not take the output of these machines as is but rather iterate on top and finish their task in lesser time.

red75prime · 2024-08-25T21:40:19 1724622019

> We can't guarantee their output's correctness

We can (for programming, at least): run the output thru theorem prover, ensure that proof is constructive, the Curry-Howard correspondence guarantees that you can turn the output into a correct program. It doesn't guarantee that formal properties of the program correspond to the informal problem statement. But even people occasionally make such errors (a provably correct program doesn't do what we wanted it to do).

> and we can't fix them to guarantee correct output

Same thing with other systems capable of programming. That is people.

You just can't make a system that guarantees correct transformation from an informal problem statement into a formally correct implementation. "Informal" implies that there's wiggle room for interpretation.

No, it doesn't mean that current LLMs are ready to replace programmers, it also doesn't mean that ML models of 2030s will not be able to.

bamboozled · 2024-08-25T22:05:36 1724623536

This doesn’t mean the LLM is deterministic. It means it can spoof the right answer give enough time.

red75prime · 2024-08-25T22:21:29 1724624489

Indeterminism is not always bad. A probabilistic Turing machine is more powerful than a Turing machine, for example (BPP complexity class is a superset of P).

bamboozled · 2024-08-26T14:38:49 1724683129

Yeah, didn't mean to imply something was bad or good here, just that it's not a purely deterministic thing.

YeGoblynQueenne · 2024-08-26T11:34:56 1724672096

>> We can (for programming, at least): run the output thru theorem prover, ensure that proof is constructive, the Curry-Howard correspondence guarantees that you can turn the output into a correct program. It doesn't guarantee that formal properties of the program correspond to the informal problem statement. But even people occasionally make such errors (a provably correct program doesn't do what we wanted it to do).

That sounds very ambitious. Automated theorem provers are real sticklers for complete specifications in a formal language and can't parse natural language at all, but when you generate code with an LLM all you have in terms of a specification is a natural language prompt (that's your "informal problem statement"). In that case what exactly is the prover going to prove? Not the natural language prompt it can't parse!

The best you can do if you start with a natural language specification, like an LLM prompt, is to verify that the generated program compiles, i.e. that it is correct syntactically. As to semantic correctness, there, you're on your own.

Edit: I'm not really sure whether you're talking about syntactic or semantic correctness after all. Which one do you mean?

>> You just can't make a system that guarantees correct transformation from an informal problem statement into a formally correct implementation. "Informal" implies that there's wiggle room for interpretation.

Note that in program synthesis we usually make a distinction between complete and incomplete specifications ("problem statements") not formal and informal. An incomplete specification may still be given in a formal language. And, for the record, yes, you can make a system that guarantees that an output program is formally consistent with an incomplete specification. There exist systems like that already. You can find a bit about this online if you search for "inductive program synthesis" but the subject is spread over a wide literature spanning many fields so it's not easy to get a clear idea about it. But, in general, it works and there are approaches that give you strong theoretical guarantees of semantic correctness.

red75prime · 2024-08-26T13:49:54 1724680194

> Which one do you mean?

Ah, I said "theorem prover", I should have said "proof verifier". What I meant is something like DeepMind's AlphaProof with an additional step of generating a formal specification from a natural language description of the problem. In this way we get a semantically correct program wrt the formal specification. But with current generation of LLMs we probably won't get anything for non-trivial problems (the LLM won't be able to generate a valid proof).

> Note that in program synthesis we usually make a distinction between complete and incomplete specifications

Program synthesis begins after you can coherently express an idea of what you want to do. And getting to this point might involve a ton of reasoning that will not go into a program synthesis pipeline. That's what I mean when I say an "informal problem statement": some brain dumps of half-baked ideas that doesn't even constitute an incomplete specification because they are self-contradictory (but you haven't noticed it yet).

LLMs can help here by trying to generate some specification based on a brain dump.

YeGoblynQueenne · 2024-08-26T21:43:20 1724708600

>> What I meant is something like DeepMind's AlphaProof with an additional step of generating a formal specification from a natural language description of the problem.

That's even more ambitious. From DeepMind's post on AlphaProof:

First, the problems were manually translated into formal mathematical language for our systems to understand.

https://deepmind.google/discover/blog/ai-solves-imo-problems...

DeepMind had to resort to this manual translation because LLMs are not reliable enough, and natural language is not precise enough, to declare a complete specification of a formal statement, like a program or a mathematical problem (as in AlphaProof) at least not easily.

I think you point that out in the rest of your comment but you say "the LLM won't be able to generate a valid proof" where I think you meant to say "a valid specification". Did I misunderstand?

>> Program synthesis begins after you can coherently express an idea of what you want to do.

That's not exactly right. There are two kinds of program synthesis. Deductive program synthesis is when you have a complete specification in a formal language and you basically translate it to another language, just like with a compiler. That's when you "coherently express an idea of what to do". Inductive program synthesis is when you have an incomplete specification, consisting of examples of program behaviour, usually in the form of example pairs of the inputs and outputs of the target program, but sometimes program traces (like debug logs), abstract syntax trees, program schemas (a kind of rich program template) etc.

Input-output examples are the simplest case. Today, if you can express your problem in terms of input-output examples there are approaches that can synthesize a program that is consistent with the examples. You don't even need to know how to write that program yourself.

red75prime · 2024-08-27T08:19:05 1724746745

> where I think you meant to say "a valid specification". Did I misunderstand?

What do you mean when you say "a valid specification"? There are known algorithms to check validity of a proof. How do you check that specification is valid? People are inspecting it and agree that "yes, it seems to be expressing what was intended to be expressed in the natural language", or, "no, this turn of phrase needs to be understood in a different way" or some such. Today there's no other system that can handle this kind of a task besides humans (who are fallible) and LLMs (that are much more fallible).

That is deciding that specification is valid cannot be done without human involvement. I left that part out and focused on what we can mechanistically check (that is validity of a proof).

So, no, I didn't mean "a valid specification". And, yes, I don't think that today's LLMs would be good at producing specifications that would be deemed valid by a consensus of experts.

> Today, if you can express your problem in terms of input-output examples there are approaches that can synthesize a program that is consistent with the examples

In a limited domain with agreed-upon rules of generalization? Sure. In general? No way. The problem of generalizing from a limited number of examples with no additional restrictions is ill-defined.

And the problem "generalize as an expert would do" is in the domain of AI.

QuantumGood · 2024-08-26T07:11:54 1724656314

Curry–Howard correspondence = direct relationship between computer programs and mathematical proofs

malux85 · 2024-08-25T23:11:05 1724627465

> "oh the savior of the woeful masses, and the silver bullet of all thy problems"

Who said that? Everyone I've talked to warns about their shortcomings (including their creators) and even the platform where I use them has a warning plastered right under the input box saying "ChatGPT can make mistakes. Check important info."

skybrian · 2024-08-25T22:25:15 1724624715

Yep. LLM’s don’t guarantee any interesting mathematical properties. That’s up to you.

This why you should write good tests, review the code, and don’t approve anything you don’t understand.

It’s not a reason to reject pretty good autocomplete, though.

YeGoblynQueenne · 2024-08-25T19:03:46 1724612626

>> I take it you're not using a compiler to generate machine code, then?

The dismissive glibness of your comment makes me wonder if it's worth it trying to point out the obvious error in the analogy you're making. Compilers translate, LLMs generate. They are two completely different things.

When you write a program in a high-level language and pass it to a compiler, the compiler translates your program to machine code, yes. But when you prompt an LLM to generate code, what are you translating? You can pretend that you are "translating natural language to code" but LLMs are not translators, they're generators, and what you're really doing is providing a prefix for the generated string. You can generate strings form an LLM with an empty prefix; but try asking a compiler to compile an empty program.

>> Even the Apollo spacecraft programmers at MIT had a black box: they offloaded the weaving of core rope memory to other people.

You're referring to core rope memory:

https://en.wikipedia.org/wiki/Core_rope_memory

There is no "black box" here. Programmers created the program and handed it over to others to code it up. That's like hiring someone to type your code for you at a keyboard, following your instructions to do so. You have to stretch things very far to see this as anything like compilation.

Also, really, compilers are not black boxes. Just because most people treat them as a scary unknowable thing doesn't mean that's what they are. LLms are "black boxes" because no matter how much we peer at their weights, arrays of numerical values, there's nothing we can ... er ... glean from them. They're incomprehensible to humans. Not so the code of a compiler. Even raw binary is comprehensible, with some experience.

CJefferson · 2024-08-25T19:09:38 1724612978

I recently had used an LLM to convert a lot of Python to Rust. It got it 99% right, and it took me a short while to fix the compile-time errors, and carefully check the tests weren't broken (as I trusted the code worked when the tests passed).

Is that "compiling" or "translating"? Lots of people use language to C "compilers".

shepherdjerred · 2024-08-25T20:53:53 1724619233

Compilation is generally deterministic with strict semantics.

LLMs are great, but they are the opposite of a compiler (in a good way)

YeGoblynQueenne · 2024-08-26T10:24:02 1724667842

Translating Python to Rust is translation, prompting an LLM for new Python or Rust code is generation.

creesch · 2024-08-25T19:11:20 1724613080

Both of them are black boxes though by the author's own definition.

YeGoblynQueenne · 2024-08-26T10:24:25 1724667865

Which definition do you mean?

szundi · 2024-08-25T21:36:04 1724621764

Exactly

elric · 2024-08-25T19:09:32 1724612972

I get what you're trying to say, but I don't entirely agree. Raising levels of abstraction is generally a good thing. But up until now, those have mostly been deterministic. We can be mostly confident that the compiler will generate correct machine code based on correct source code. We can be mostly confident that the magnetised needle does the right thing.

I don't think this is true for LLMs. Their output is not deterministic (up for discussion). Their weights and the sources thereof are mostly unknown to us. We cannot really be confident that an LLM will produce correct output based on correct input.

Wowfunhappy · 2024-08-25T20:41:06 1724618466

I agree with you but I want to try to define the language better.

It's not that LLMs aren't deterministic, because neither are many compilers.

It's also not that LLMs produce incorrect output, because compilers do that to, sometimes.

But when a compiler produces the wrong output, it's because either (1) there's a logic error in my code, or (2) there's a logic error in the compiler†, and I can drill down and figure out what's going on (or enlist someone to help me) to fix the problem.

Let's say I tell an LLM to write a algorithm, and it produces broken code. Why didn't my prompt work? How do I fix it? Can anyone ever actually know? And what did I learn from the experience?

---

† Or I guess there could be a hardware bug. Whatever. I'm going to blame the compiler because it needs to produce bytes that work on my silicon regardless of whether the silicon makes sense.

shafyy · 2024-08-25T21:19:18 1724620758

Compilers are deterministic

codebje · 2024-08-25T22:09:51 1724623791

This is in general only true for either trivial toy compilers or ones which have gone to lengths to have reproducible builds. GCC for instance uses a randomised branch prediction model in some circumstances.

shafyy · 2024-08-27T21:09:14 1724792954

Ok, but my understanding is that that they are mostly deterministic. And that there are initiatives like Reproducible Builds (https://reproducible-builds.org) that try to move even more in that direction.

Wowfunhappy · 2024-08-28T00:41:00 1724805660

But what does "mostly" mean? You can compile the same code twice and literally get two different binaries. The bits don't match.

Sure, those collections of bits tend to do exactly the same thing when executed, but that's is in some sense a subjective evaluation.

---

Szundi said in a sibling comment that I was "completely [missing] the point on purpose" by bringing up compiler determinism. I think that's fair, but it's also why I opened my post by saying "I agree [with the parent], but I want to try to define the language better." Most compilers in use today are literally not deterministic, but they are deterministic in a different sense, which is useful as a comparison point to LLMs. Well, which sense? What is the fundamental quality that makes a compiler more predictable?

I'd like to try to find the correct words, because I don't think we have them yet.

shafyy · 2024-09-04T17:12:30 1725469950

I'm not an compiler expert, not by far. But my understanding is that if you compile the same code on the same machine for the same target, you'll get the same bits. Only minor things like timestamps that are sometimes introduced might differ. In this sense, maybe they are not deterministic. But I think it's fair to classify them as "determinstic" compared to LLMs.

szundi · 2024-08-25T21:31:50 1724621510

Arguing with compilers about LLM determinism is not really adequate as an analogy, completely misses the point on purpose

mxkopy · 2024-08-25T19:39:11 1724614751

I’d say it’s not only determinism, but also the social contract that’s missing.

When I’m calling ‘getFirstChar’ from a library, me and the author have a good understanding of what the function does based on a shared context of common solutions in the domain we’re working in.

When you ask ChatGPT to write a function that does the same, your social contract is between you and untold billions of documents that you hope the algorithm weights correctly according to your prompt (we should probably avoid programming by hope).

You could probably get around this by training on your codebase as the corpus, but until we answer all the questions about what that entails it remains, well, questionable.

LoganDark · 2024-08-25T20:32:34 1724617954

> we should probably avoid programming by hope

I use Cursor at work, which is basically VSCode + LLM for code generation. It's a guess and check, basically. Plenty of people look up StackOverflow answers to their problem, then verify that the answer does what they want. (Some people don't verify but those people are probably not good programmers I guess.) Well, sometimes I get the LLM to complete something, then verify the code is completed is what I would have written (and correct it if not). This saves time/typing for me in the long run even if I have to correct it at times. And I don't see anything wrong with this. I'm not programming by hope, I'm just saving time.

layer8 · 2024-08-25T21:10:54 1724620254

This increases the time you spend proofing other’s work (tedious) versus time you spend developing a solution in code (fun). Also, if the LLM output is correct 95% of the time, one tends to get more sloppy with the checking, as it will feel unnecessary most of the time.

LoganDark · 2024-08-26T01:58:30 1724637510

> This increases the time you spend proofing other’s work (tedious) versus time you spend developing a solution in code (fun).

I find that I don't use it as much for generating code as I do for automating tedious operations. For example, moving a bunch of repeating-yourself into a function, then converting the repeating blocks into function calls. The LLM's really good at doing that quickly without requiring me to perform dozens of copy-paste operations, or a bunch of multi-cursor-fu.

Also, I don't use it to generate large blocks of code or complicated logic.

szundi · 2024-08-25T21:35:43 1724621743

Just what I was thinking about lately, what if LLMs are not 95% precise, but 99,95%. After like 50-100 checks you find nothing, and you just dump the whole project to be implemented - and there come the bugs.

However ... your colleagues just do the same.

We'll see how this unfolds. As for now the industry seems to be a bit stuck at this level. Big models too expensive to train for marginal gains, smaller are getting better but doesn't help this. Until some one new idea comes in how LLMs should work, we won't see the 99.95% anyway.

bubaumba · 2024-08-26T03:39:23 1724643563

one idea is obvious: multi-model approach. it partially done today for safety checks. the same can be done for correctness. one model produces result, different model only checks the correctness. optionally several results, second model checks correctness and selects the best. this is more expensive, but should give better final output. not sure, this may have been already done.

layer8 · 2024-08-25T22:20:32 1724624432

Yeah, I’m more worried about the middle ground that would make software quality (even) worse than it is today.

bubaumba · 2024-08-26T03:24:07 1724642647

> We can be mostly confident that the compiler will generate correct machine code based on correct source code.

Recently got email about gcc 14.2, they fixed some bugs in it. Can we trust it now, these could be the last bugs. But before that it was probably a bad idea to trust. No, even compiler's output requires extensive testing. Usually it's done at once, just final result of coding and compilation.

> Their output is not deterministic

yes.

> Their weights and the sources thereof are mostly unknown to us

Some of them are known. Does it make you feel better. There are too many weights, so you are not able to track its 'thinking' anyway. There are some tools which sort of show something. Still doesn't help much.

> We cannot really be confident that an LLM will produce correct output based on correct input

No, we can't. But it's so useful when it works. I'm using it regularly for small utilities and fun pictures. Even though it can give outright wrong answers for relatively simple math questions. With explanations and full confidence.

krageon · 2024-08-25T19:31:08 1724614268

For the average programmer the infinite layers of abstraction, libraries and middleware isn't deterministic either. The LLMs actually honest to god being probabilistic estimators doesn't change anything about what they produce or how they see their own stuff.

nsonha · 2024-08-26T01:05:06 1724634306

> We cannot really be confident that an LLM will produce correct output based on correct input.

There are 2 things at play here, one is LLM with human in the loop, in which it's just a tool for programmers to do the same thing they have been doing, and the other is LLM as black box automaton. For the former, it's not a problem that the tool is undeterministic, we are double checking the results and add our manual labour anyway. The fact that a tool can fail sometimes is an unsurprising fact of engineering.

I think the criticism in this chain of comment applies more to the latter, but even it always has values to non-tech people, just like how no-code approaches are, however shitty it looks to us software enfineers.

elric · 2024-08-26T07:44:44 1724658284

I don't know. Programming with an LLM turns every line of code into legacy code that you have to maintain and debug and don't fully grok because you didn't write it yourself.

nsonha · 2024-08-26T09:30:04 1724664604

If it's in your PR then you wrote it, no one should be approving code they do not understand whether that's from AI or googling. Nothing changes there.

quaintdev · 2024-08-25T19:17:35 1724613455

What if the we can't do abstraction anymore. I mean we certainly can but we will loose ability to configure tiny details of system.

So other path forward could very well be LLMs as they can save lot of time with writing boilerplate code

computerex · 2024-08-25T19:14:03 1724613243

So what if the output is stochastic? LLM's have self consistency, so you can repeat the inference several times and pick the most frequent output.

dazilcher · 2024-08-25T19:32:56 1724614376

Most frequent output does not imply correctness, LLMs often are confidently wrong.

They can't even perform basic arithmetic (which is not surprising since they operate at the syntactic level, oblivious to any semantic rules), yet people seem to think offloading more complex tasks with strict correctness requirements is a good idea. Boggles the mind tbh.

saulpw · 2024-08-25T19:18:00 1724613480

And you can pay in time and/or $$ for the privilege of having to do do this extra unnecessary work.

rstat1 · 2024-08-25T18:24:07 1724610247

Saying a compiler, and an "AI" hallucination model are even remotely the same is a pretty huge stretch.

lovethevoid · 2024-08-25T18:36:47 1724611007

They aren't saying they're the same, I'm not sure how you got that interpretation. It's very clear they're highlighting the hypocrisy that arises from claiming to be against automating away aspects of programming while relying on tools that do exactly that for you - only being Ok with it as long as they aren't called "AI".

advael · 2024-08-25T22:00:49 1724623249

The crux of why this is a bad analogy is that everyone talking about "automating" things with LLMs is misusing the word "automation". A machine can automate a repetitive manual task. A computer can automate the operation of machinery. A machine instruction set is an abstraction on top of circuitry that can automate the labor of extrapolating the logic physically executed by that circuitry into human-comprehensible routines. In the same way, a programming language implementation (e.g. a compiler) can somewhat automate programming (in the sense that it uses higher levels of abstraction to describe the same thing, saving labor while keeping determinism). What do these things have in common? We can reliably make them approach deterministic behavior. In the case of compilers, completely and reliably and transparently so. Just because you haven't bothered to read what a compiler is doing doesn't mean someone can't verify what it's doing. Physical machines are less reliable, but we have reliable ways to test them, reliable error margins, reliable failure modes, reliable variance. When you are on a stack of abstractions like a programming language on top of a compiler on top of transistors on top of a machine, an error at the top of that stack can have a lot of implications. A tool that probabilistically generates code is not automation. We have no guarantees about how and when it will get things wrong, how much this will happen, and what kinds of things it will get wrong. We have no way to audit their results that will generalize to every problem upstream of them. We have no way to reliably measure improvement in consistency, let alone improve that margin of error reliably. The entire idea that this is an automation at all is nonsense.

conradev · 2024-08-25T18:39:04 1724611144

They’re extremely different. But you can certainly use them together. Thus, new tools:

https://ai.meta.com/research/publications/meta-large-languag...

disconcision · 2024-08-25T18:35:14 1724610914

how do you figure? people were making directly analogous arguments about compilers back in the day. (not trying to argue that they are 'the same', but their is definitely a spectrum of code generation methods, with widely varying genres of guarantees, suiting a widely varying range of use cases)

trueismywork · 2024-08-25T18:27:28 1724610448

Well, AI models don't have to hallucinate.

KeplerBoy · 2024-08-25T18:32:20 1724610740

All current LLMs do though.

thaumasiotes · 2024-08-25T18:31:49 1724610709

True, but LLMs do.

yes_man · 2024-08-25T18:40:47 1724611247

I get the point that they are in different magnitudes of unknown but the analogy is still pretty good when it comes to the median programmer, who has no idea what goes on within either one. And if you argue that compilers are ultimately deterministic, that same argument technically holds for an LLM as well.

The biggest difference to me is that we have humans that claim they can explain why compilers work the way they do. But I might as well trust someone who says the same about LLMs, because honestly I have no way to verify if they speak the truth. So I am already offloading a lot of burden of proof about the systems I work on to others. And why does this ”other” need to be a human.

milemi · 2024-08-25T19:03:54 1724612634

This is like saying “I don’t understand how airplanes fly, so I’ll happily board an airplane designed by an LLM. The reality is determined by how much I know about it.”

yes_man · 2024-08-25T19:59:43 1724615983

No, the other way around. I am saying it is not a smart take to say ”a safe airplane cannot be built if LLMs were used in the process in any way, because reasons”. The safety of the airplane (or more generally the outcome of any venture) can be measured in other ways than leaning on some rule that you cannot use an LLM for help at any stage because they are not always correct

csallen · 2024-08-25T18:23:35 1724610215

Thank you for saying this. It's always baffled me that people will decry ChangeX as unnatural and wrong when it happens in their lifetime, but happily build their lives upon NearlyIdenticalChangeY so long as it came before them.

JimDabell · 2024-08-25T18:51:20 1724611880

“I've come up with a set of rules that describe our reactions to technologies:

1. Anything that is in the world when you’re born is normal and ordinary and is just a natural part of the way the world works.

2. Anything that's invented between when you’re fifteen and thirty-five is new and exciting and revolutionary and you can probably get a career in it.

3. Anything invented after you're thirty-five is against the natural order of things.”

― Douglas Adams, The Salmon of Doubt: Hitchhiking the Galaxy One Last Time

stavros · 2024-08-25T18:57:53 1724612273

This is so accurate I'm offended.

pottspotts · 2024-08-25T18:30:41 1724610641

We think of ourselves more intelligent than the generation before us, and more wise than the ones after.

danielbln · 2024-08-25T19:52:44 1724615564

Makes you wonder if that sort of dynamic between generations is ever going to be something that can be overcome. Maybe if humanity cures aging.

throwaway22032 · 2024-08-25T18:41:49 1724611309

I don't think that this is a fair comparison because at some point the nature of the craft actually does change.

To give an analogy, a carpenter might be happy with hand tools, happy with machine tools, happy with plywood, and happy with MDF. For routine jobs they may be happy to buy pre-fabbed cabinets.

But for them to employ an apprentice (AI in this example) and outsource work to them - suddenly they are no longer really acting as a carpenter, but a kind of project manager.

edit: I agree that LLMs in their current state don't really fundamentally change the game - the point I am trying to make is that it's completely understandable that everyone has their own "stop" point. Otherwise, we'd all live in IKEA mansions.

kqr · 2024-08-25T18:49:26 1724611766

Running state of the art LLMs for programming is nowhere near project management. At least in my experience, all LLMs are really good at is dumping plausible tokens quickly. They can't think, design, or handle tradeoffs intelligently.

They help me with the keyboard work, not any of the actual programming.

jclulow · 2024-08-25T18:52:45 1724611965

An apprentice is another person performing the same kind of work as the carpenter. That's fundamentally different from using an LLM, which is not a person and does not function like a person.

Whether you think LLMs are spectacularly worthwhile or odious and destructive, it's crucial not to classify them as being a person instead of a software tool.

fuzztester · 2024-08-25T19:12:59 1724613179

yep, I would call this the anthropomorphisation of llms. undesirable, just as any other kind of anthropomorphisation is.

kif · 2024-08-25T18:54:11 1724612051

This is a very poor analogy. It's not a matter of abstractions, it's a matter of getting someone or something else to do the work, while you mostly watch and fix any errors you're able to catch.

rodrigosetti · 2024-08-25T19:11:22 1724613082

This is a qualitatively different kind of abstraction. All other abstractions still require the programmer to express the solution in a formal language, while LLMs are allowing the user to express the solution in natural language. It's no longer programming, but much more like talking to a programmer as a manager.

milemi · 2024-08-25T18:28:52 1724610532

You can learn how compilers work and understand how they do what they do. Nobody understands what’s in those billions of parameters, and no one ever will.

fuzztester · 2024-08-25T19:08:03 1724612883

why are there so many parameters in the first place? and was it humans who generated so many? seems like a very big job for a human to do, or even a team of humans to do.

disclaimer: I know next to nothing about llms. and I'm not that interested to learn about them. just asking casually.

Jensson · 2024-08-25T20:00:20 1724616020

> why are there so many parameters in the first place?

Because parsing and writing human language in a natural way is extremely complex.

> and was it humans who generated so many?

No, it is generated using an algorithm that tries to predict the next word in human written text using the words that comes before it. It ingests basically all the text on the internet to do this, without that much text the LLM performs horribly.

jdiff · 2024-08-25T19:59:42 1724615982

They're not manually generated or anything, it's just a setting. Too few and the model doesn't have enough flexibility to capture complex patterns. Too many and the model can just memorize the data you train it on rather than capturing the patterns driving it.

eternauta3k · 2024-08-25T18:23:52 1724610232

There is a difference between those abstraction levels: with LLMs you are moving into the realm of management.

mypalmike · 2024-08-25T18:38:55 1724611135

That was precisely the goal of many high level languages.

layer8 · 2024-08-25T21:17:34 1724620654

…which all failed for that reason.

sifar · 2024-08-26T04:45:51 1724647551

The question is about the abstraction being understandable and predictable. All the examples you have follow that, LLMs throw that out of the window.

>> Scratch that, I guess you're not using a modern microprocessor to generate microcode from a higher-level instruction set either?

Hell, I design gate level logic -> map it to instructions -> use them in C for the very LLMs and can fully understand[0] every aspect of it (if it doesn't behave as expected, that is a bug) but I cannot fathom or predict how the LLMs behave when i use them even though I know their architecture and implementation.

[0] Admittedly I treat the tools I use during the process, like cad tools, compiler, as black boxes, however I know that if I want to or the need arises, I can debug/understand them.

Palomides · 2024-08-25T18:26:25 1724610385

there's a strict contract between the programmer and the compiler (the language spec)

MarkusQ · 2024-08-25T18:37:39 1724611059

Which would be relevant if either side respected it.

In practice, compilers frequently have bugs and programmers even more frequently make use of "what the compiler actually does" rather than adhering to the language specification -- to the point where the de facto spec for many languages is "what the canonical implementation does".

And the specifications change over time.

kibwen · 2024-08-25T18:43:16 1724611396

The frequency with which compiler implementations functionally diverge from language specifications is dwarfed, by many orders of magnitude, by the frequency with which LLMs generate provably nonsensical code in response to a prompt.

To wit, a compiler diverging from the specification is so relatively rare that people will get angry about it and demand that it be fixed, while an LLM spewing creative nonsense is so accepted and par for the course that complaining about that fact is met with a shrug and "well, what did you expect?"

layer8 · 2024-08-25T21:20:38 1724620838

LLMs don’t have a canonical output that could serve as a specification. And if they had, we wouldn’t consider that a satisfactory specification at all.

lagrange77 · 2024-08-25T19:58:55 1724615935

I guess we are currently in the special situation, that we as human programmers can understand the output of a coding LLM. That's because programming languages are designed to be human readable. And we had an incentive to learn those languages.

I imagine that machine learning powered coding will evolve to an even blacker box, than it is today: It will transform requirements to CPU instructions (or GPU instructions, netlists, ...). Why bother to follow those indirections, that are just convenience layers for those weak carbon units (urgh)?

Simultaneously, automation will likely lead to fewer skilled programmers in the future, because there will be fewer incentives to become one.

Together those effects could lead to a situation where we are condemned to just watch.

kalind · 2024-08-25T20:21:48 1724617308

So what do you do when the LLM creates a bug in a multi-million CPU instruction program it generated and you can't prompt it to fix the bug?

lagrange77 · 2024-08-25T20:35:05 1724618105

Yeah, good question. You could ask the same question for today's or near future coding LLMs used by non programmers.

amelius · 2024-08-25T20:02:07 1724616127

>> delegating it to a black box

> I take it you're not using a compiler to generate machine code, then?

An LLM is much, much closer to a "black box" than a compiler ...

In fact, an LLM is pretty much a black box even to the people who created it.

w0m · 2024-08-25T22:07:39 1724623659

Reminds me long ago of working with a guy that did everything in C when we were rewriting things in Perl . Yes, his stuff was faster. Yes, it was also buggier, harder to debug, and took 3x as long to write for similar levels of functionality (it wasn't speed dependent code by any stretch.)

limit499karma · 2024-08-26T12:28:50 1724675330

> black box

Actually the opacity of the abstraction layer is the core of the issue. First we note that opacity is a measure in both the inner-workings of the 'box' and orthogonally (in context of LLMs) a measure of deterministic outcome.

Programming, it is asserted by some of us, is exactly the act of instructing a deterministic 'black' box.

Peer-coding with an LLM is the act of cajoling a mechanism to hopefully consistently produce the input to a sensible "blackbox". It is not programming, it is getting help. Now if the help was 100% reliable, we could discuss programming the helper.

The other day I had a vision of the future AI whisperer in the corporate setting. They wear capes of varied colors and possibly sport a wand. "It's an art you see".

freetanga · 2024-08-25T19:49:50 1724615390

By that light, the artist that is painting using store-bought pigments instead of hand made, or brushing paints instead of reallocating molecules, or even better, atoms, is also using a black box.

I think OP has a point, and is about guiding the design, then overall structure and dynamic of the code. Nobody expects to write in Assembler or not using libraries, but making a concious decision on the design.

I have met few programmers, but many coders. For them coding is a job, and generally they don’t care about overall architecture, algorithm efficiency, and code elegance is limited to syntax-coloring-themes in their editor. I respect their existence, but generally they are building on top off somobody else’s effort.

mgaunard · 2024-08-25T21:01:38 1724619698

The transformations you're referring to are fully deterministic and guaranteed to be correct.

LLMs provide statistically probable answers with no guarantee of correctness. It takes more time to review LLM code than it takes to write it correctly from scratch.

pajeets · 2024-08-25T18:24:46 1724610286

We can argue the same thing for artists. Who wrote the algorithms for your favorite Photo/Image editor? Who created the image formats and standards, infrastructures for you to be able to push binary files to millions of people?

archagon · 2024-08-26T06:22:54 1724653374

I think it would be a mistake to classify an LLM as a level of abstraction. You cannot use it as a stable black-box foundation to build on top of.

toprerules · 2024-08-25T20:39:49 1724618389

Your argument is a false dichotomy and thus a logical fallacy. Not all steps forward in abstraction or code generation are necessary good steps and have to be considered on their own merits. If LLMs are indeed superior, then you should be able to articulate their merits without condescending to fallacious attacks.

discreteevent · 2024-08-25T18:56:36 1724612196

HN commenter: Samuel, why don't you use an LLM to write this play you are working on?

Beckett: What?

HN commenter: Well it's just like when you decided to work in French instead of English. Your art was no less because of it. Now you can use an LLM instead of French. It will be so much quicker.

habosa · 2024-08-25T18:24:14 1724610254

While the base of your argument is true, it’s also a bit dishonest. LLMs are significantly different than any of these other abstractions because they can’t be reasoned about or meaningfully examined/debugged. They’re also the first of these advances which anyone has claimed would eliminate the need for programmers at all. I don’t believe the C compiler was meant to do my whole job for me.

kqr · 2024-08-25T18:31:19 1724610679

Cobol and other early high-level languages were designed with the intention of allowing businesspeople to write their own programs so programmers wouldn't be needed. Some people really believed that!

evilduck · 2024-08-25T19:16:56 1724613416

SQL too.

lukeschlather · 2024-08-25T18:36:38 1724610998

I'd really like to have everything written in Rust, not C. Rust does a lot of verification, verification that is very hard to understand. I'd like to be able to specify a function with a bunch of invariants about the inputs and outputs and have a computer come up with some memory-safe code that satisfies all those invariants and is very optimized, and also have a list of alternative algorithms (maybe you discard this invariant and you can make it O(nLog(n)) instead of O(n^2), maybe you can make it linear in memory and constant in time or vice versa...)

Maybe you can't examine what the LLM is doing, but as things get more advanced we can generate code to do things, and also have it generate executable formal proofs that the code works as advertised.

monooso · 2024-08-25T19:16:28 1724613388

I agree with the second part of your argument, regarding the assertion that LLMs may eventually replace programmers.

However, I don't understand your claim that an LLM acting as a programming assistant "...can’t be reasoned about or meaningfully examined/debugged."

I type something, and Copilot or whatever generates code which I can then examine directly, and choose to accept or reject. That seems much easier to reason about than what's happening inside a compiler, for example.

trueismywork · 2024-08-25T18:28:27 1724610507

Current LLMs, we do not know that we cannot have future LLMs which can be almost formal. Think mathematics written in English and LaTeX

card_zero · 2024-08-25T18:37:26 1724611046

If using an LLM meant carefully crafting a complex, precise, formal prompt that specified only one possible output, I might be interested. But then I wonder if the prompt would be very much shorter.

Thinking about it, this depends on which differences we consider aspects of the output program, and which ones we consider trivial differences that don't count. If you say "build an RPG about dragons with a party of magic using heroes" and the LLM spits one out, you reached a level of abstraction where many choices relating to taste and feeling and atmosphere (and gameplay too) are waved aside as trivial details. You might extend the prompt to add a few more, but the whole point of creating a program this way is not to care about most of the details of the resulting experience. Those can be allowed to be generic and bland, right? Unless you care about leaving your personal touch on, say, all of them.

nyrikki · 2024-08-25T20:02:41 1724616161

Things constructivist mathematics cannot do:

1) Prove Addition of natural numbers.

2) Prove two real numbers are equal.

RNNs are only TC with infinite precision and unlimited resources, once you have finite precision they are very limited.

LLMs do not have recursion at all and can't even emulate finite automations. In fact soft attention can only emulate TC_0

Feed forward networks are effectively DAGs and with soft attention, DAGs built with AND,OR, NOT, and threshold circuits.

One of the state of the art inference in code methods is bi-abduction, probably best described here.

https://fbinfer.com/docs/separation-logic-and-bi-abduction/

But this localization makes it computationally possible, and has limits.

The qualification and frame problems, combined with the very limited computational power of transformers is another lens.

LLMs being formalized doesn't solve the problem. Fine tuning and RAG can help with domain specificity, but hallucinations are a fundamental feature of LLMs, not a bug.

Either a use case accepts the LLM failure mode (competent, confident, and inevitably wrong) or another model must be found.

Gödel showed us the limits of formalization, unless we find he was wrong, that won't change.

trueismywork · 2024-08-26T00:14:20 1724631260

Thanks for your insightful comment. I'll read the links later.

I had just assumed that RNNs were TC, didn't think of limitation put on by bounded precision since I assumed that any bounded precision could be compensated by growing memory module.

So, after your comment, I went literature searching and I found this: https://papers.nips.cc/paper_files/paper/2021/hash/ef452c63f...

I haven't read it yet. But if it is true, then RNN would seem to be TC

nyrikki · 2024-08-26T00:41:10 1724632870

Note from the open review of that paper.

> As discussed in the paper and pointed out by the reviewer, the growing memory module is non-differentiable, and so it cannot be trained directly by SGD. We acknowledge this observation.

Two stack FSA/RNN are interesting, but as of now, not usable in practice.

spencerchubb · 2024-08-25T18:27:57 1724610477

I don't buy that you're actually examining compiled programs. Very few people do. Theoretically you could, but the whole point of the compiler is to find optimizations that you wouldn't think of yourself.

kibwen · 2024-08-25T18:37:29 1724611049

The point of an optimizing compiler is to find optimizations which, crucially, are semantics-preserving. This is the contract that we have with compilers, is the reason that we trust them to transform our code, and is the reason why people get up in arms every time some C compiler starts leveraging undefined behavior in new and exciting ways.

We have no such contract with LLMs. The comparison to compilers is highly mistaken, and feels like how the cryptocurrency folks used to compare cryptocurrency to gestures vaguely "the internet" in an attempt to appropriate legitimacy.

bena · 2024-08-25T18:33:38 1724610818

No, the point of the compiler is to translate code into machine instruction.

Yes, it can optimize things for you, but that is not its purpose.

spencerchubb · 2024-08-26T01:25:49 1724635549

Okay I'll revise my statement

A big feature of compilers is to find optimizations you wouldn't think of. I tried to make the point that compiled output is typically not read by humans

Jensson · 2024-08-25T19:56:54 1724615814

> I don't buy that you're actually examining compiled programs. Very few people do

I take it you don't write C, C++, or any language at that level? It is very common to examine compiled programs to ensure the compiler made critical optimizations. I have done that many times, there are plenty of tools to help you do that.

DougN7 · 2024-08-25T18:30:40 1724610640

I think you’re assuming your reference is the correct one. I can’t reason about the assembly language that the compiler spits out, the microcode in the CPU kernel or any of the electronics on the motherboard. That anyone can or not doesn’t change things in my opinion. It’s an arbitrary distinction to say _this_ abstraction is uniquely different in this very specific way.

grey-area · 2024-08-25T18:33:30 1724610810

Compilers are deterministic.

LLMs are not.

yodon · 2024-08-25T18:38:00 1724611080

>Compilers are deterministic

They seem that way, until you're tasked with getting a repeatable, idempotent build out of a non-trivial build system.

marcinzm · 2024-08-25T18:39:54 1724611194

LLMs are deterministic if you force a seed or disable sampling. They however do not guarantee that small input changes will cause small output changes.

Jensson · 2024-08-25T20:01:13 1724616073

> LLMs are deterministic if you force a seed or disable sampling

Not with todays GPU's, you would need to run it locally with special GPU settings or run it on your CPU to ensure it is deterministic.

DougN7 · 2024-08-25T22:01:30 1724623290

So? Compilers compile and LLMs do not. Compilers use linkers and LLMs do not. Arbitrary distinctions don’t means “this time it’s different”.

grey-area · 2024-09-05T13:43:52 1725543832

So as the OP said, all the parts are deterministic in this stack. Their behavior is fixed for a given input, and all the parts are interpretable, readable, verifiable and observable.

This is entirely different from LLMs which are opaque even to their designers, and have unpredictable flaws and hallucinations, they are probability machines based on what data they have been exposed to, which means they are not a reliable way to generate programs.

Maybe one day we'll fix this, but the current generation is not very useful for programming because of this.

carapace · 2024-08-25T22:06:00 1724623560

This is completely wrong.

Compilers are complex programs fraught with bugs. Modern microprocessors are hideously complex devices fraught with bugs. But at least we understand them in principle and practice.

LLMs are nonsense generators, you need a second device that can recognize correct programs to use them effectively. Only humans can do that all-important second part.

> Programming has always been about finding the next black box that is both powerful and flexible.

That's the opposite of programming. Programming is the art and science of developing reliable algorithms. You can treat programs as black boxes only after you're sure that they work correctly. Otherwise you're just engaged in a kind of cargo cult.

That the scary thing about the LLM fad: so many people seem so willing to abdicate their responsibility to actually think.

minkles · 2024-08-25T18:24:25 1724610265

Um those black boxes have determinism.

kqr · 2024-08-25T18:26:30 1724610390

Not at all. See the cries for reproducible builds, exploits like Spectre etc.

fire_lake · 2024-08-25T18:37:58 1724611078

You are precisely right but practically wrong here

tpoacher · 2024-08-25T21:14:10 1724620450

I 100% agree with you. Whenever I see artists buy their brushes I cringe. Real artists don't draw anything until they've grown a tree and raised horses to obtain the raw materials (wood and horse hair) to make their first brush.

Using a bought brush to paint and generating a painting via a prompt are basically identical.

/s

jodhpurcapital · 2024-08-25T18:22:21 1724610141

Nailed it.

sweeter · 2024-08-25T18:54:53 1724612093

If LLMs were actually good for programming, I would consider it, but they just aren't. Especially when we are talking about "assistants" and stuff like that. I feel like I live in an alternate reality when it comes to the AI hype. I have to wonder if people are just that bad at programming or if they have a financial incentive here.

There are a handful of cases where LLMs are useful, mainly because Google is horrifically bad at bringing up useful search results, it can help in that regard... or when you can't find the right words to describe a problem.

What I would like to see out of an AI tool, is something that gobbles up the documentation to another programming tool or language, and spits it back out when it is relevant, or some context aware question and answers like "where in the code base does XYZ originate" or w/e. the difference is having a tool that assists me VS having a tool spit out a bunch of garbage code. Its the difference between using a tool, and being used by a tool

norir · 2024-08-25T20:24:33 1724617473

> I have to wonder if people are just that bad at programming or if they have a financial incentive here.

I have similar feelings to you, but I want to be careful about making assumptions. That being said, I see so many people making hyperbolic claims about the productivity gains of llms and a huge amount (though not all) of the time, they are doing low value work that betrays their inexperience and/or lack of ability.

I have yet to see a good example of where an llm invented a novel solution to an important problem in programming. Until that happens -- and I'm not saying it won't -- I remain extremely skeptical about the grandiose claims. This is particularly true of the companies selling llm products who make vague claims about productivity benefits. Who is more productive, the person who solves the most leet code problems in a month or the person who implements a new compiler in the same time frame? The former will almost surely have the most lines of code, but they have done nothing of direct value. I point this out because of how often productivity is measured in lines of code and/or time to complete a problem with a known solution.

So for me, when people brag about how much more productive they are with llms, I wonder, ok, well what are you building? I feel like llms are as likely going to make people build fragile bridges to nowhere at scale as anything truly revolutionary.

creesch · 2024-08-25T21:05:03 1724619903

I am not expecting novel solutions from LLMs. I purely use them as a tool in my tool belt.

Some examples:

Deciphering spaghetti code: LLMs generally are pretty good at picking apart code blocks and generally explaining the functional parts. A while ago I was dealing with code that had lots of methods on single lines with tons of conditions. I put in in chatGPT, asked it to go over it and it gave me a point by point explanation of all the logic in there. Again, I don't expect it to be perfect here, it doesn't need to be. The way my mind works once I have the explanation I can much easier go to the single line mess and follow it along. If chatGPT messed up I will see that, but I will also be much further along already with deciphering as I would have been doing it manually.

Getting a quick start on technology, specifically if it is something I know I will only need to know once it helps me avoid tedious google searches. Instead I get a pretty decent rundown of whatever it is I need to know as well as some basics.

In short, I don't they are miraculous technologies transforming my work. But, they are pretty good at removing some of the more tedious tasks letting me focus on other things. So they do make me more productive in that aspect.

koonsolo · 2024-08-26T07:32:57 1724657577

If you are an expert in your field and are working on the same project day in day out, I don't think LLM's will offer you much.

If you are a one man shop, where you have to work with: Javascript, Typescript, Haxe, PHP, bash, WordPress, BuddyPress, npm, Selenium, Playwright, Jest, Kha, Three.js, HTML, CSS, Bootstrap, Tailwind, Nextjs, SQL, ... . And that is all next to marketing, devops, managing freelancers, ... . Well, then an LLM is a super fast and super cheap junior of everything that can quickly create something you need in seconds.

Edit: Let me make it more concrete with an example of the previous days. I registered a new domain and wanted to already have a quick landing page on there that sends people to a certain url. It would already use Nextjs & tailwind so that I can test out the setup on the server.

So I wanted to generate this quick landing page. I have a few options:

1. Do it myself, which will take some time digging into the css to style things.

2. Hire a freelancer, which is more expensive, and would take up more time.

3. Let ChatGPT generate the initial version in 3 seconds, I can suggest some changes and get a full reply in 3 second.

Same for a lot of other things that I do. Will ChatGPT help me out in a big, complex application that I'm writing? Probably not. But it sure has its uses for a lot of small things.

gandalfgreybeer · 2024-08-26T07:09:45 1724656185

> an llm invented a novel solution to an important problem in programming.

This will likely not happen and is a horrible benchmark for productivity. The advantage of using LLMs as a partner for coding is that I myself will have more time to generate “novel solutions” since a lot of the low level stuff that I still need to write can be done in less than half the time.

I never trust it to write code I can’t write for myself and things I can’t make tests or verify.

It’s not about generating more lines of code but having more time to think of the more mentally demanding stuff. It’s like having a junior developer that can work really fast but I will still need to check what it does.

As a very personal benchmark, at work I know how much time it takes me to build repeating things from scratch that can’t be automated. By my estimate, having the LLM saves me around an hour or two of coding a day. That doesn’t replace my time but those 20-40 hours a month of time saved is worth the 20 bucks on average I pay for it.

Tldr; it won’t make anything novel but it gives me more time to make things that are

BeetleB · 2024-08-25T21:25:53 1724621153

There's a large continuum between great and crap, and it sounds like you've placed a rather high bar to even consider using it.

I don't like BASH scripting. I wanted to automate a certain task and dump it in a justfile for convenient reference.

Learning BASH scripting would be a poor use of my time - I didn't value the knowledge I would gain.

Using Google to piece together everything I needed would have been very painful. Painful enough that I simply didn't bother in the past.

Asking an LLM solved the problem for me. It took about 6 iterations, because I had somewhat underspecified and the scripts it returned, while correct, had side effects I didn't like.

But even though it took several iterations it was infinitely more satisfying than the other options. Every time it failed I would explain to it what went wrong and it would amend the script.

It's like having an employee do the work for me, but much much cheaper.

That's the power of LLMs. They enable me to do things that just weren't worth the time in the past.

Would I use it for my main programming work? No. But does it increase my productivity? Definitely.

sweeter · 2024-08-26T01:51:41 1724637101

That sounds awful to me. I spent maybe 1 or 2 days reading the woolidge bash guide, and the Dylan araps bash Bible and now I have that skill forever. Sure I spent more time practicing but I can craft exactly what I want without even thinking about it. I value that knowledge. Use shellcheck and the bash lsp and that's it.

But they way you talk about makes me feel weird. It honestly sounds a little insane.

BeetleB · 2024-08-26T05:14:57 1724649297

If we're going with anecdotes, I have learned Bash scripting and zsh scripting separately at different points in my life. Both times I forgot it very quickly. My guess is that you value that knowledge and that helps you retain it. Practice of course helps.

For me, my primary shell both at work and at home is xonsh, which is Python-based, and 90+% of all shell scripting I do is in Python in that shell. Having that knowledge of Bash is not even worth 2 days for me. If I were an embedded developer or a system administrator where I often have to SSH into accounts I don't control I could value Bash more but that's not the case for me and a fairly significant percentage of software engineers. Why spend a few days learning it when I don't need it? Even in the example above, I did it for my convenience, not because I needed it.

koonsolo · 2024-08-26T07:18:57 1724656737

What kind of memory do you have? After 6 months of not using it, I would already forget more than 50% of it.

port19 · 2024-08-26T07:44:24 1724658264

Yes, but then after two days it's back to 90% and the missing 10 are rarely important

kristiandupont · 2024-08-26T05:33:46 1724650426

>But they way you talk about makes me feel weird. It honestly sounds a little insane.

Pot, meet kettle.

sweeter · 2024-08-26T17:59:09 1724695149

What? For learning bash and valuing that skill? Lol

kristiandupont · 2024-08-27T18:04:49 1724781889

No, for calling someone who doesn't "insane".

sweeter · 2024-08-27T22:02:33 1724796153

That's not what I find insane though, nor is it what I said. The fact that this person is willing to sink in a ton of time iteratively coercing an LLM to spit out something that they themselves admitted had negative side effects, and didn't work that well, and describing it like hounding a personal employee in the name of "productivity" is like 7 layers of insane to me. All to avoid spending, probably less, time learning. Like I'm baffled at what motivates people who are like this. It's gotta be money right?

canada_dry · 2024-08-26T03:21:06 1724642466

This.

I'm a former/old-school professional programmer (now a retired CIO) and I still love automating my life with an assortment of tech (e.g. python, bash, c#, c++, jscript, etc). My knowledge of these tools isn't indepth... so being able to use AI to assist with generating the base code is a godsend.

My experience using chatgpt has been quite positive overall... sometimes it comes up with elegant solutions, sometimes its dog shit - but generally it's a useful starting point.

Arn_Thor · 2024-08-25T20:30:36 1724617836

I know jack squat about programming. I could at one point do “hello world” in Python, if I recall correctly. Thanks to ChatGPT I now have scripts to make my life easier in a bunch of ways and a growing high-level understanding of how they work. I can’t program, an I won’t claim to. But I can be useful. Thanks to LLMs. (And before the catastrophists arrive: I know enough to be wary of the risks of running scripts I don’t understand, which is why I make a point of understanding how they function before I proceed)

sweeter · 2024-08-26T02:12:52 1724638372

Thats great for sure, but I still think you should take the time to learn the basics. Also, go to the r/bash subreddit and search "chatGPT" and see just how many people end up there with:

"HELP! ChatGPT destroyed my system" most commonly people want a command that will move pictures or something and the LLM spits out something feasible, they turn it into a shortcut on their mac and click it, and it is literally just 'find -type f -exec mv {} ..' with relative paths and it moves all of their critical files into random places. It is quite literally something I've seen happen at least 5 times.

There is a lot of benefit in just learning a little bit and then having the flexibility to write anything you want.

Arn_Thor · 2024-08-26T04:46:22 1724647582

I agree I should learn to do it from scratch. And I’ve been given an added motivation to do so by seeing how useful just a few lines of code can be.

And you’re not wrong. I’ve learned the benefit of versioning when working with LLMs because if you’re not careful asking it to do one change can easily break something else. It’s far from a panacea

TrackerFF · 2024-08-25T22:32:38 1724625158

I'll counter you here.

I'm starting to think that the people that moan the most about LLMs being terrible, might just be terrible at writing good queries.

Like everything else: garbage in, garbage out.

EDIT: I was not aiming this comment directly at you. But I've had a couple of devs try to convince me that tools like ChatGPT or Claude is garbage, and then use extremely short queries as proof.

"Write me a website with [list of specs]", and then when it either fails or spits out half-baked results, they go "See? It's garbage!"

On the other hand I've seen non-coders create usable tools, by breaking up the problem and inputting good queries for each of those sub-tasks.

sweeter · 2024-08-26T01:43:36 1724636616

Could be. I try to be verbose, but it also gets annoying to do. I usually just use some really handy GitHub search tricks and learn from other people's implementations.

fuzztester · 2024-08-25T19:28:45 1724614125

>If LLMs were actually good for programming, I would consider it, but they just aren't. Especially when we are talking about "assistants" and stuff like that. I feel like I live in an alternate reality when it comes to the AI hype. I have to wonder if people are just that bad at programming or if they have a financial incentive here.

solid points there.

it is surely some of both reasons. for the bad programmers, it will be the former. for those invested in llms, it will be the latter, that is financial incentives - to the tune of billions or millions or close to millions, depending upon whether you are an investor in or founder of a top llm company, or are working in such a company, or in a non-top company. it's the next gold rush, obviously, after crypto and many others before. picks and shovels, anyone?

and, more so for those for whom there are financial incentives, they will strenuously deny your statements, with all kinds of hand waving, expressions of outrage, ridicule, diversionary statements, etc.

that's the way the world goes. not with a bang but a whimper. ;)

sorry, t.s. eliot.

https://en.m.wikipedia.org/wiki/The_Hollow_Men

andybak · 2024-08-25T19:33:58 1724614438

I'm not sure whether I'm a "good programmer" or a "bad programmer" but sometimes I just want a problem to go way in the quickest way possible.

I'm not always trying to create a timeless, perfect, jewel and there is a limit to how much I want to follow every highway and byway needed to do stuff across several dozen languages, libraries, platforms and frameworks.

Some days I'm just tired.

fuzztester · 2024-08-26T20:06:01 1724702761

>I'm not sure whether I'm a "good programmer" or a "bad programmer" but sometimes I just want a problem to go way in the quickest way possible.

True. Most programmers would think the same, at times.

>I'm not always trying to create a timeless, perfect, jewel

No one is, most of the time. Only, some people try to create somewhat good things some of the time, even given constraints.

>and there is a limit to how much I want to follow every highway and byway needed to do stuff across several dozen languages, libraries, platforms and frameworks.

Who has the time to do it, unless one is independently wealthy, so don't need to work, and is programming just for fun (although many of us do it for fun, part-time at least).

Yes, my sentiments exactly, and I am sure it's that of many other programmers, too.

The abstraction upon abstraction upon abstraction (Howdy, Java, but not only it) and the combinatorial explosion of technologies X their version(iti)s, is hell - like DLL hell on Windows, except much worse.

>Some days I'm just tired.

So yeah, I hear you, dude, and feel your pain.

But the topic and argument was about whether llms reduce that pain enough to be worthwhile. I guess the answer is: different strokes for different folks.

port19 · 2024-08-26T07:47:52 1724658472

The more I hang out in these places the more I believe the hypothesis that people are just bad at programming

dkersten · 2024-08-26T14:14:38 1724681678

I asked both ChatGPT 4o and Claude 3.5 Sonnet how many letters there are in the word strawberry and both answered “There are two r’s in the word strawberry”. When I asked “are you sure?” ChatGPT listed the letters one by one and then said yes, there are indeed two. Claude apologized for the mistake and said the correct answer is one.

If the LLM cannot even solve such a simple question, something a young child can do, and confidently gives you incorrect answers, then I’m not sure how someone could possibly trust it for complex tasks like programming.

I’ve used them both for programming and have had mixed results. The code is always mediocre at BEST but downright wrong and buggy at worst. You must review and understand everything it writes. Sometimes it’s worth iteratively getting it to generate stuff and you fix it or tell it what to fix, but often I’m far quicker just doing it myself.

That’s not to say that it isn’t useful. It’s great as a tool to augment learning from documentation. It’s great at making pros and cons lists. It’s great as a rubber duck. It can be helpful to set you on a path by giving some code snippets or examples. But the code it generates should NEVER be used verbatim without review and editing, at best it’s a throwaway proof of concept.

I find them useful, but the thoughts that people use them as an alternative to knowing how to program or thinking about the problem themselves, that scares me.

msp26 · 2024-08-26T15:40:23 1724686823

Would you ask a blind person to pick between two colours for you? LLMs do not see the individual letters in the word, they work with tokens.

mckirk · 2024-08-26T15:44:38 1724687078

Sorry, but this 'benchmark question' really isn't all that useful. Asking an LLM questions that can only be answered at the letter level is like asking somebody who is red-green colorblind questions that can only be answered at the red-green level. LLMs are trained by first splitting text into tokens that comprise multiple letters, they never 'see' individual letters.

The 'confidently answering with a wrong solution' aspect is of course still a valuable insight, and yes, you need to double-check any answer you've received from an LLM. But if you've never tried GitHub Copliot, I can recommend doing so. I'd be surprised if it doesn't manage to surprise you. For me it was actually really useful to get those parts of code out of the way that are essentially just an 'exercise in typing', once you've written a comment explaining the idea. (It's also very useful to have a shortcut to quickly turn off its completions, because otherwise you end up spending more time reading through its suggestions than actual coding, in situations where you know it won't come up with the right answer.)

dkersten · 2024-08-27T04:57:11 1724734631

When asked to prove it, it spelled out the letters one by one, and still failed (ChatGPT asserted the answer is still 2, Claude “corrected” itself to 1). Only when forcing it to place a count beside each letter did it get it correct.

It’s not really about the specific question, that just highlights that it does not have the ability to comprehend and reason. It’s a prediction machine.

If it cannot decompose such a simple problem, then how can it possibly get complex programming problems that cannot be simply pattern matched to a solution correct? My experience with ChatGPT, Claude, and copilot writing code demonstrates this. It often generates code that on the surface level looks correct, but when tested it either fails outright or subtly fails.

Even things like CSS it gets wrong, producing output that on the surface seems to do what you asked but in fact doesn’t actually style it correctly at all.

Its lack of ability to understand, decompose, and reason is the problem. The fact that it’s so confident even when wrong is the problem. The fact that it cannot detect when it doesn’t know is the problem.

It generates text that has high probability of “looking” correct, not text that has a high probability of being correct. With simple questions like the one I posed, it’s obvious to us when it gets it wrong. With complex programming tasks, the solution is complex enough that it often takes significant effort to determine if it’s correct or wrong. There’s more room for it to “look” correct without “being” correct.

> But if you've never tried GitHub Copliot

I’ve used it for almost a year before I cancelled my subscription because it wasn’t adding much value. I found copilot chat a bit more useful, but ChatGPT was good enough for that. I still use ChatGPT when programming: as a tool to help with documentation (what’s the react function to do X, type questions), to rubber duck, to ask for pros and cons lists on ideas or approaches, and to get starting points. But never to write the code for me, at least not without the expectation of significant rewriting, unless it’s super trivial (but then I likely would have written it faster myself anyway).

Basically, I use it like this person does: https://news.ycombinator.com/item?id=41350207

mckirk · 2024-08-28T10:42:46 1724841766

Thanks for taking the time to answer so thoroughly :)

In that case I stand corrected, I'd just assumed you hadn't used Copilot because, to me, it was so more effective at aiding with programming that ChatGPT. But I suspect that very much depends on the use-case. I liked it a lot for e.g. writing numpy code, where I'd have had to look up the documentation on every function otherwise, or for writing database migrations by hand, where the patterns are very clear, and in those situations it felt like a huge time-saver. But for other applications it didn't help at all, or admittedly even introduced subtle bugs that were fun to find and fix.

After my free year of Copilot ran out I also didn't re-subscribe, because at this point I have too many AI-related subscriptions as it stands, but I'd definitely (carefully) use it if I had access to it via an org or an OS project.

dkersten · 2024-09-02T07:38:30 1725262710

To be completely fair, there are some things I did have success with getting code generated. For example, I made a little python script to pull fields out of TOML files and converted them to CSV (so that I could import the data into a spreadsheet). It did mostly ok on this (in that I didn’t have to edit the final code that much and it was in fact faster than writing it all myself).

But the cases where I find its code was good enough are 1) fairly easy tasks (ie I don’t need AI to do it, but it still saved some time), and 2) not that common for the type of development I’ve been mostly doing. The problem is that I’ve often wasted significant time to figure out whether or not it’s one of these tasks, so in the long run it just doesn’t feel that useful to me as a “write code for me” tool. But as I said, I do find AI a useful aid, just not to write my code for me.

Tiberium · 2024-08-25T20:20:08 1724617208

I'm curious about your experience - what specific LLMs and tools did you try? And what's your main programming language?

sweeter · 2024-08-26T02:07:17 1724638037

I tried ollama with llama 3 and 3.1, code llama, phi3, zephyr, chatgpt 4, all of what tgpt (cli tool) offers, copilot and a couple others but I don't remember. I primarily use Golang, as well as C, Zig, bash and learning Rust.

copilot annoyed the shit out of me and I barely get any useful code from LLMs. I think the most help I get from LLMs is asking things like "what is this operator mean '~uint64'" or other non-common language constructs. I primarily will just pull up open-source code that is of verifiably high quality and learn from that.

furyofantares · 2024-08-25T19:17:19 1724613439

I also find programming extremely enjoyable, my means of expression, and an art form. I have hundreds of side projects in my archive, maybe five of which have ever been used by another human. It's all for the sake of coding. Many of them are sizeable and many are not but they are almost all done as a creative outlet, for the joy of doing it or to satisfy a curiosity.

But I don't know man, I love coding with LLMs. It just opens up more things, I think on some projects I actually spend MORE time on traditional coding than I did in the past, because I used an LLM to write scripts to automate some tedious data processing required for the project. And there's also projects where the LLM gets me from 0 to 60 and then I rather quickly write the code I actually care about writing, and may or may not end up replacing all the LLM written code.

I'm sure it heavily depends on exactly what types of project interest you. The fact that LLMs and diffusion have both become fixations of mine also means I have a lot more data processing involved in lots of my projects, and LLMs are quite good at custom data processing scripts.

I suppose my suggestion to the author would be that perhaps their projects aren't amenable to LLMs in the way they want and that's fine, but don't lose hope that there are kindred spirits out there just because so many people love LLM coding; some of us are both and that may be more about what types of projects we do.

ThrowawayTestr · 2024-08-25T19:47:20 1724615240

>I also find programming extremely enjoyable

I that's a big gulf people don't appreciate. I don't enjoy programming. When I program a microcontroller for my hobby projects it's a means to an end. I would love a tool that takes in a natural language description of what I want and outputs code and LLMs are good at doing that for basic tasks.

brunooliv · 2024-08-25T18:41:34 1724611294

This is a post that just reads as if the author is still in the “honeymoon” stage of their career where programming is seen as this extremely liberating and highly creative endeavour that no other mortal can comprehend. I get the feeling and I was there too, but, writing code has always been a means to an end which is to deliver business value. Painting it as this almost abstract creative process is just… not true. While there are many ways to attack a given problem, the truth is once you factor in efficiency, correctness, requirements and the patterns your team uses then the search space of acceptable implementations reduces a lot. Learn a couple of design patterns, read a couple of blogs and chat with your team and that’s all you need. Letting an LLM write down the correct and specific ideas you tell it to based on what I wrote earlier means your free time to do code reviews, attend important meetings, align on high level aspects and help your team members all which multiply the value you deliver only through code. Let LLMs automate the code writing so I can finally program in peace, I say!

port19 · 2024-08-26T07:51:40 1724658700

I get that at some point you have to put food on the table, but why conflate the enterprisey, economical, object-oriented mess of things with your hobby?

You can, in theory, still program elegant little side projects with no pretense of business value or any customer besides, maybe, yourself.

I find that my work-coding and hobby-coding are different enough that they don't even feel like the same activity

kraftman · 2024-08-26T16:31:03 1724689863

When I write a side project I'm not doing it because I enjoy coding, I'm doing it because I enjoy problem solving, and I've thought of a potential problem that I could solve with code, and I want to prototype that idea. I'm not that fussed about elegance; ideally, I want to prototype that idea as quickly as possibly so I can see if its valid or not, so I'd much rather use any tool I can to get me there as quickly as possible.

happyraul · 2024-08-26T04:56:31 1724648191

This sounds like a nightmare to me. The last thing I want out of my work day is to attend more "important meetings" and "multiply" my value. This is the kind of thinking that makes us less human, just widgets that are interchangeable. No thanks.

williamcotton · 2024-08-25T18:26:00 1724610360

I still very much felt like I was creatively crafting this [0] project even though the entire approach used the Claude project feature. I had to hand-write some sections but for the most part I was just instructing, reading, refining, and then copying and pasting. I was the one who instructed the use of a bash parser and operating on the AST for translation between text and GUI. I was the one who instructed the use of a plugin architecture to enforce decoupling. I was the one who suggested every feature and the look of the GUI. The goal was to create an experimental UI for creating and analyzing bash pipelines. The goal was not to do a lot of typing!

These high level abstractions are where I find the most joy from programming. Perhaps for some there is still some modicum of enjoyment from writing a for loop but for most people twenty years into a career there's nothing but the feeling of grinding out the minutia.

There's still a lot of room for better abstractions when it comes to interfacing with computing devices. I'd love to write my own operating system, CLI interface, terminal, and scripting language, etc from scratch and to my own personal preferences. I don't imagine I could ever have the time to handcraft such a vast undertaking. I do imagine that within a few decades I will be able to guide a computing assistant through the entire process and with great joy!

[0] https://github.com/williamcotton/guish

danjl · 2024-08-25T18:37:58 1724611078

English, and other languages, are vague and imprecise. I've never understood why folks think they can write code "more efficiently" with a prompt rather than code? Are people willing to give up control? Let the LLM decide what is best? The same is true for generative art -- you get something, but you only have marginal control over what. I think this will always be something that is useful for the simplest things, simplest apps, simplest art, etc.. A race to the bottom for the bottom of the complexity stack. As problems become more complicated, it would take a great deal more prompt language to specify the behavior than code.

kmoser · 2024-08-25T19:02:08 1724612528

Nobody in their right mind gives up control or lets the LLM decide what's best. They take the output of the LLM as a starting point, changing it where necessary to accomplish their goals.

This is no different than starting with boilerplate code, e.g. from a tutorial or manual or other project (or, heaven forbid, an Internet search), and changing it for your specific needs.

As for imprecision, a developer is perfectly capable of writing imprecise, ambiguous, or just plain buggy code by themselves. If you think your code is better than that produced by an LLM (and I'm not saying it isn't), then by all means use it. But the fact is that an LLM, in some cases, can produce code better and/or faster than it would take some people to write themselves who don't want to spend several minutes or hours stumbling through finding the exact syntax or algorithm, let alone Googling for a good starting point.

As for the non-determinism that many people are decrying, this is no different from Googling and finding several examples, each of which is written for a particular use case, none of which ever seem to match yours exactly. Caveat emptor if you simply cut and paste without reading, just as with code produced by an LLM.

fuzztester · 2024-08-25T19:34:59 1724614499

this is one of the most perspicacious comments I have ever read about llms, with the disclaimer that i have not read a large number of comments on the subject. I have read some however.

however, referring to the first paragraph of the comment above, the problem is that many people are not in their right minds :)

danjl · 2024-08-25T19:14:55 1724613295

As I said, the LLM will be a good replacement for the simplest cases. The behaviors you describe are far more common among junior programmers working on simple apps.

BeetleB · 2024-08-25T21:31:53 1724621513

> The behaviors you describe are far more common among junior programmers working on simple apps.

Or among expert programmers who need to code in something out of their domain for a small portion of their job.

Suppose you suddenly are required to write a VBA macro for Excel for your job. It's a one off task - not something you'll do repeatedly. Do you prefer learning VBA for Excel and crafting a solution or asking the LLM and verifying it's solution by looking at the docs?

Hint: If you use the macro recorder in Excel and inspect the code you are closer to the LLM end of the spectrum.

throwup238 · 2024-08-25T18:59:30 1724612370

For most things I approach it from both sides: I give an English prompt but also provide some interface, function signatures, or even just a return type that constrains the LLMs output.

For example, I frequently have to implement Serde's traits for some serialization format or to marshal types, most recently to translate types from Rust to Qt QML's Javascript. By giving it some context (Serde traits and QT docs) I managed to do it with Claude in about an hour, which is roughly how long it would have taken me just to get up to date with the documentation if I had tried it myself.

prisenco · 2024-08-25T18:46:19 1724611579

I wonder about this as well. Go has 26 keywords. English as 170,000 active words.

I don't want to code in English.

Cordiali · 2024-08-26T11:15:59 1724670959

Most of those words are engPM packages. The trick is to not include the `architecture` package, if you're just making toast.

BeetleB · 2024-08-25T21:28:10 1724621290

Binary has only two bits. I really don't want to code in two bits.

prisenco · 2024-08-26T04:47:28 1724647648

There's a sweet spot for identifiers when writing software and it's somewhere between 2 and 170,000 but likely closer to 2.

frje1400 · 2024-08-25T19:03:26 1724612606

LLM output can't really be trusted so I need to "proof read" it and convince myself that it is correct. In the language I use every day and have a high degree of fluency, it's faster for me to simply write what's in my head than to proof read unknown code. So how can LLMs make me more productive in actual programming?

I use an LLM to generate ideas, to rubber duck, to get a lead on unknowns, and to generate boilerplate occasionally. So I do everything except replace the coding part because that's what requires the most precision, and LLMs are bad at precision. And yet, people claim massive productivity gains in specifically coding. What am I missing?

flembat · 2024-08-27T05:58:54 1724738334

It helps a lot if you use a strongly typed language with a strict compiler and get the LLM to write plenty of tests. Then you need to understand that the tests are logically correct. The LLM is also good at documenting the functions, so you can review that matches your intentions and the code as well. Also the LLM will pick functions from your own source code library to compose new programs for you. So the reuse of your own well tested code should increase confidence.

BeetleB · 2024-08-25T21:33:52 1724621632

> So how can LLMs make me more productive in actual programming?

Suppose you suddenly are required to write a VBA macro for Excel for your job. It's a one off task - not something you'll do repeatedly. Do you prefer learning VBA for Excel and crafting a solution or asking the LLM and verifying its solution by looking at the docs?

Hint: If you use the macro recorder in Excel and inspect the code you are closer to the LLM end of the spectrum.

dkersten · 2024-08-26T14:30:00 1724682600

This is basically the same conclusion I’ve come to, after 18ish months of almost daily use.

danjl · 2024-08-25T19:24:08 1724613848

Also, the errors that an LLM makes are not errors that a typical human would make. This makes reviewing their code particularly challenging.

kristiandupont · 2024-08-26T05:47:44 1724651264

I find the errors they make are often comically "human", like forgetting a premise.

umvi · 2024-08-25T18:26:43 1724610403

For me LLMs are like programming power tools. Use them wrong and you can hurt yourself. Use them right and you can accomplish far more in the same amount of time.

People that refuse to program with AI or intellisense or any other assistance are like carpenters who refuse to build furniture with power saws and power drills. Which is perfectly fine, but IMO that choice doesn't really affect the artistry of the final product

throwup238 · 2024-08-25T18:52:19 1724611939

> For me LLMs are like programming power tools. Use them wrong and you can hurt yourself. Use them right and you can accomplish far more in the same amount of time.

Fun analogy because if you're especially negligent you can injure yourself so badly you'll make programming forever more difficult than it needs to be or end your career altogether - like with a tablesaw cutting off fingers.

fuzztester · 2024-08-25T19:38:14 1724614694

intellisense is fine, but high speed chopping off of your limbs is not. the two are not comparable.

geor9e · 2024-08-25T19:55:33 1724615733

I use an LLM precisely BECAUSE I want to focus on the art. Like Davinci would use apprentices.

LLMs can do mindless drudgery just as well as I can, but in seconds instead of hours. There's nothing about remembering syntax, boilerplate code, forgetting a semicolon, googling the most common way of doing something, or combining some documentation to fill in the gaps that's even remotely "art" to me.

I never ask an LLM for what I'm artfully creating. I ask it for what I know it'll get instantly right, so I can move on to my next thought.

Bjorkbat · 2024-08-25T19:30:08 1724614208

I have a lot of different thoughts as to why using an LLM feels "off". One I've been thinking about as of late is that it feels flawed to measure productivity by code velocity, i.e. lines of code written per hour.

Like, ideally, it shouldn't really take that much code to implement a thing. I like to think of programming as writing a bunch of levers, starting with simple levers for simple jobs, incrementally ratcheting up to larger levers lifting the smaller levels. Before too long, it'll feel as though you've written a lever capable of lifting the world...or at least one that makes an otherwise wickedly difficult project reasonably manageable.

If you say that LLMs make you more productive because it allowed you to finish a project that would otherwise take forever to write, then I'm skeptical that an LLM is the best solution. I mean, it's a solution at least, but I can't help but wonder if there's a better solution.

If the problem is that you lack the understanding to take on such a project, then perhaps what we really need are better tools for understanding. I myself have found that LLMs are great for gaining a quick understanding of languages that otherwise have sparse information for beginners, but I have to wonder if perhaps there's a better way.

If, on the other hand, the problem is that writing that much code would take forever, then I have to wonder if the real solution is that we need a better way to turn programming languages into patterns (levers) and turn said patterns into larger patterns (larger levers)

A partial solution works, but only partially well, and occasionally has consequences one has to reckon with

Kiro · 2024-08-25T20:04:27 1724616267

I'm of the opposite opinion: I've started enjoying programming much more after embracing LLMs.

* They are great for overcoming procrastination. As soon as I don't feel like doing something or a task feels tedious I can just delegate it to an LLM. If it doesn't solve it outright it at least makes me overcome the initial feeling of dread for the task.

* They give me better solutions than I initially had in mind. LLMs have no problem adding laborious safeguards against edge-cases that I either didn't think of or that I assessed wouldn't be worth it if I did it manually. E.g. something that is unlikely and would normally go to the backlog instead. I've found that my post-LLM code is much more robust from the get go.

* They let me try out different approaches easily. They have no problem rewriting the whole solution using another paradigm, again and again. They are tireless.

* They let me focus on the creative parts that I enjoy. This surprised me since I've always thought of myself as someone who loves programming but it turns out that it is only a small subset of programming I love. The rest I'm happy to delegate away.

magicalhippo · 2024-08-25T21:10:31 1724620231

> This surprised me since I've always thought of myself as someone who loves programming but it turns out that it is only a small subset of programming I love.

I am the same, and why many of my personal projects end up stranded. Once I've solved the tricky bit, the rest often isn't that motivating as it's usually variations on a common theme.

I held off LLMs for a long time, but recently been playing with them. They can certainly confidently generate junk, but in most cases it's good enough. And like you say can be used as a driver to keep going. In that regard they can be useful.

willtemperley · 2024-08-25T21:32:18 1724621538

This is exactly how I use LLMs - I can automate the really boring parts. "Can you write me a Swift codable struct for the following JSON" will save my fingers and precious mental energy for the important and interesting parts.

It's like having a junior dev that doesn't complain and gets the work done immediately.

AI code suggestions as I type are however a different beast. It's easy to introduce subtle bugs when the suggestion "kinda looks right" but in fact the LLM had zero understanding of the context because it can't read my mind.

verditelabs · 2024-08-25T21:25:14 1724621114

Same, these are all great points that I find as well. LLMs have made me a way more productive programmer, but a lot of that is because I already was an alright programmer and know how to take advantage of the strengths and weaknesses of the LLM. I think your last bullet point is most poignant, using Claude 3.5 I've been able to do tons of GUI and web programming, things I absolutely despise and refuse to do if I'm writing code by hand.

I sort of understand some of the vitriol that I see on HN but it is incredibly overblown. I don't really get a lot of the criticisms. LLMs aren't deterministic? Neither are humans. LLMs write bugs they can't fix? So do humans. LLMs are only good at being junior programmer copy paste machines? So are lots of humans.

My current project is training an LLM to do superoptimization and it's working exceedingly well so far. If you asked anyone on hacker news if that's a good idea, they'd probably say no.

steveBK123 · 2024-08-25T18:31:21 1724610681

I do sometimes get the impression that there will be a generational gap in ability to code between millennials and zoomers.

We had an overdemand for devs during late ZIRP early COVID leading to bootcamps and self taught pulling a lot of untrained into the industry. Many of them have left the industry.

Add to that the whole data science bubble and it’s bursting where we had tons of degrees and job openings for sort-of-devs. Lot of those jobs are gone now too.

Don’t forget the pull of “product management” and its demise outside big tech.

Now we have hiring freezes and juniors leaning on LLMs instead of actually spending an hour trying to solve problems.

Interesting times.

TrianguloY · 2024-08-25T18:39:14 1724611154

I feel the same. I understand why others think ai is just another tool like intellisense, but for me intellisense and any other automatic refactor is a fixed algorithm that I understand and that I know exactly what it is doing, and I know that it is correct.

With ai I need to review the output but not because there may be some issues I didn't noticed, but because that may be issues the tool itself didn't noticed, so it's less of an "apply this specific change" but more of an "apply some change"

danjl · 2024-08-25T19:27:20 1724614040

With intelliSense, I disregard about 98% of the suggestions. Do people do that with generated code? Doubtful. With an LLM, even that last 2% requires more effort because it creates weird and irrational bugs that I have to review.

dakiol · 2024-08-25T18:17:05 1724609825

I do the “programming as art” in my free time, when working on my own side projects. I do it because it’s fun and I take proud on what I produce.

For work stuff? I couldn’t care less if the code comes from me, my colleagues or an LLM. As long as it works and it’s secure, we’ll ship it.

Folks, career != job.

pajeets · 2024-08-25T18:22:04 1724610124

Exactly, now replace code with AI generated art, photos, drawings, videos, music. Your employers couldn't care less if its convincing enough to ship. even better now that it only takes seconds to minutes.

We are at the cusp of creative destruction and we are only getting started. Ironically, blue collar jobs seem safe as there hasn't been a humanoid revolution and what I see in the white collar field is what blue collar workers experienced before the automation and offshoring of jobs

steveBK123 · 2024-08-25T18:37:42 1724611062

I think AI art is actually an interesting example. It’s mostly run of the mill schlock to replace clip art / stock images.

Adequate but not enjoyable. Lorem ipsum of visual art. Probably kills some basic graphic design jobs at the margin working on low budget projects.

Meanwhile big brands using big agencies will just incorporate it into their design process. McD Japan is a recent example. You still need a human with an eye and taste to be the editor.

But no one is reading or viewing AI art for pleasure. It’s all “that’s neat” (continues scrolling).

pajeets · 2024-08-25T20:51:55 1724619115

There are many erroneous assumptions based on conjectures and not enough real world metrics and evidences.

AI art is very much already being consumed and sold. It's not just "mill schlock" anymore. You won't even know you are consuming AI art.

thomashop · 2024-08-25T21:48:21 1724622501

Where are your real-world metrics and evidence?

pajeets · 2024-08-25T21:50:11 1724622611

steveBK made the claim nobody is consuming AI arts

he needs to prove that point

layer8 · 2024-08-25T21:29:17 1724621357

Why would you spend 40-ish hours per week with stuff that isn’t fun and you don’t take pride in, if you have the choice?

dakiol · 2024-08-26T13:56:54 1724680614

The chances of landing on an IT job for which I can take pride in, is 100% remote and that pays well is very low. To begin with, I dislike all the faang-like companies.

Besides, being good at programming makes it easier to deal with BS jobs that pay well, so it’s not that I suffer 40h/week.

warkdarrior · 2024-08-25T23:36:33 1724628993

Because it pays well and I work to live.

layer8 · 2024-08-26T11:32:43 1724671963

You're implying that the alternative wouldn't pay well, which seems like a non sequitur.

icambron · 2024-08-25T19:56:42 1724615802

To have all these thoughts, I think you'd have to have never really used an LLM to help you code, or to be almost comically closed-minded when you do. What they feel like when you actually use them is a combination of a better SO and a very prescient auto-completer. It does not at all feel like delegating programming work to a robot. No loss of artistry comes into play, and it's damn useful.

In an ideal world, our abstractions would be so perfect that there would be no mundane boiler-platey parts of a program; you'd use the abstractions to construct software from a high level and leave details be. But our abstractions are very far from perfect: there's all kinds of boring code you just have to write because, well, your program has to work. And generally that code is, if you look, most of your code. This because making good abstractions is really hard and constructing fresh ones is often more work than just typing out the different cases. If you think this is mistaken, I'd gently suggest you take a fresh look at your own code.

Anyway, that's where LLMs come in. They help write the boring code. They're pretty good at it in some cases, and very bad at it in others. When they're good at it, it's because what the code should do is sort of overspecified; it's clear from context what, say, this function has to do to be correct, and the LLM is able to see and understand that context, and thus generate the right code to implement it. This code is boring because it is in some vague sense unnecessary; if it couldn't be otherwise, why do you have to write it at all? Well you do, and the LLM has taken care of it for you.

You can call this work the LLM is displacing "art", but I wouldn't. It's more the detritus of art performed in a specific way, the manual process required to physically make the art given the tools available.

You could object that the LLMs will get better in the sense that not only that they will make fewer mistakes, but they will be able to take on increased scope, pushing closer to what I'd consider the "real" decisions of a program. If this happens -- and I hope it does -- then we should reevaluate our lofty opinions of ourselves as artists, or at least artists whose artistry is genuinely valuable.

Retr0id · 2024-08-25T18:18:37 1724609917

Writing an essay forces you to think about an idea intimately, acting as a tool for thought in and of itself. The way I use programming is the same.

youssefabdelm · 2024-08-25T18:30:06 1724610606

> does no one not find programming fun anymore?

Author needs to get into Bret Victor. Has no idea how much more fun he could be having.

Programming is a step on the way to access to the state space of information. When we get to that stage, programming will seem like a maze of syntax, that has its own idiosyncrasies that force you into corners or regions in the state space, just like any DAW plugin or 3D tool, or any tool at all that exists.

PUSH_AX · 2024-08-25T18:43:31 1724611411

This might be the most "zoomed in" take on programming I've ever read (where a zoomed out take understands that software usually just enables a business to do business). I almost thought it was satire.

I feel like you have to drop this kind of thinking to get anywhere past intermediate, not to mention you become a nightmare to anyone who has even a touch of pragmatism about them.