OpenAI is good at unminifying code

jehna1 · 2024-08-29T12:19:32 1724933972

Author of HumanifyJS here! I've created specifically a LLM based tool for this, which uses LLMs on AST level to guarantee that the code keeps working after the unminification step:

https://github.com/jehna/humanify

thomassmith65 · 2024-08-29T15:37:54 1724945874

Would it be difficult to add a 'rename from scratch' feature? I mean a feature that takes normal code (as opposed to minified code) and (1) scrubs all the user's meaningful names, (2) chooses names based on the algorithm and remaining names (ie: the built-in names).

Sometimes when I refactor, I do this manually with an LLM. It is useful in at least two ways: it can reveal better (more canonical) terminology for names (eg: 'antiparallel_line' instead of 'parallel_line_opposite_direction'), and it can also reveal names that could be generalized (eg: 'find_instance_in_list' instead of 'find_animal_instance_in_animals').

jehna1 · 2024-08-29T18:24:55 1724955895

Yes, I think you could use HumanifyJS for that. The way it works is that:

1. I ask LLM to describe what the meaning of the variable in the surrounding code

2. Given just the description, I ask the LLM to come up with the best possible variable name

You can check the source code for the actual prompts:

https://github.com/jehna/humanify/blob/eeff3f8b4f76d40adb116...

firtoz · 2024-08-29T13:33:06 1724938386

More tools should be built on ASTs, great work!

I'm still waiting for the AST level version control tbh

jansvoboda11 · 2024-08-29T13:46:25 1724939185

Unison supposedly has an AST-aware version control system: https://www.unison-lang.org/

LoganDark · 2024-08-29T15:51:22 1724946682

content-addressed too, I think!

timcobb · 2024-08-29T14:49:23 1724942963

Wow this looks so cool.

rightonbrother · 2024-08-29T17:09:50 1724951390

Smalltalk envy source controll

sebstefan · 2024-08-29T15:27:31 1724945251

What kind of question does it ask the LLM? Giving it a whole function and asking "What should we rename <variable 1>?" repeatedly until everything has been renamed?

Asking it to do it on the whole thing, then parsing the output and checking that the AST still matches?

jehna1 · 2024-08-29T18:33:19 1724956399

For each variable:

1. It asks the LLM to write a description of what the variable does

2. It asks for a good variable name based on the description from 1.

3. It uses a custom Babel plugin to do a scope-aware rename

This way the LLM only decides the name, but the actual renaming is done with traditional and reliable tools.

dotancohen · 2024-08-30T14:43:22 1725029002

This answer is reassuring.

Based on it, I went and read the readme. The readme was also excellent, and answered every question I had. Great job, thank you, I'll be trying this.

thrdbndndn · 2024-08-29T16:04:56 1724947496

Does it work with huge files? I'm talking about something like 50k lines.

Edit: I'm currently trying it with a mere 1.2k JS file (openai mode) it's only 70% done after 20 minutes. Even if it works therodically with 50k LOC file, I don't think you should try.

jehna1 · 2024-08-29T18:38:14 1724956694

It does work with any sized file, although it is quite slow if you're using the OpenAI API. HumanifyJS works so it processes each variable name separately, and keeps the context size manageable for an LLM.

I'm currently working on parallelizing the rename process, which should give orders of magnitude faster processing times for large files.

kingsloi · 2024-08-29T17:54:46 1724954086

It has this in the README

> Large files may take some time to process and use a lot of tokens if you use ChatGPT. For a rough estimate, the tool takes about 2 tokens per character to process a file:

> echo "$((2 * $(wc -c < yourscript.min.js)))" > So for refrence: a minified bootstrap.min.js would take about $0.5 to un-minify using ChatGPT.

> Using humanify local is of course free, but may take more time, be less accurate and not possible with your existing hardware.

thrdbndndn · 2024-08-29T17:59:08 1724954348

This only talks about the cost.

I'm more concerned about if it can actually deobfuscate such large file (context) and generate useful results.

punkpeye · 2024-08-29T12:22:35 1724934155

Looks useful! I will update the article to link to this tool. Thanks for sharing!

jehna1 · 2024-08-29T21:27:31 1724966851

Super, thank you for adding the link! It really helps to get people to find the tool

cryptoz · 2024-08-29T15:06:25 1724943985

Finally someone else using ASTs while working with LLMs and modifying code! This is such an under-utilized area. I am also doing this with good results: https://codeplusequalsai.com/static/blog/prompting_llms_to_m...

jehna1 · 2024-08-29T19:02:37 1724958157

Super interesting! Since you're generating code with LLMs, you should check out this paper:

https://arxiv.org/pdf/2405.15793

It uses smart feedback to fix the code when LLMs occasionally do hiccups with the code. You could also have a "supervisor LLM" that asserts that the resulting code matches the specification, and gives feedback if it doesn't.

zamadatix · 2024-08-29T22:17:49 1724969869

It's a shame this loses one of the most useful aspects of LLM un-minifying - making sure it's actually how a person would write it. E.g. GPT-4o directly gives the exact same code (+contextual comments) with the exception of writing the for loop in the example in a natural way:

    for (var index = 0; index < inputLength; index += chunkSize) {

Comparing the ASTs is useful though. Perhaps there's a way to combine the approaches - have the LLM convert, compare the ASTs, have the LLM explain the practical differences (if any) in context of the actual implementation and give it a chance to make any changes "more correct". Still not guaranteed to be perfect but significantly more "natural" resulting code.

ouraf · 2024-08-30T14:19:33 1725027573

Depends on how many tokens you want to spend.

Making the code, fully commenting it and also giving an example after that might cost three times as much

strictnein · 2024-08-30T01:46:35 1724982395

As someone who has spent countless hours and days deobfuscating malicious Javascript by hand (manually and with some scripts I wrote), your tool is really, really impressive. Running it locally on a high end system with a RTX 4090 and it's great. Good work :)

boltzmann-brain · 2024-08-29T15:29:15 1724945355

how do you make an LLM work on the AST level? do you just feed a normal LLM a text representation of the AST, or do you make an LLM where the basic data structure is an AST node rather than a character string (human-language word)?

WhitneyLand · 2024-08-29T16:18:08 1724948288

The frontier models can all work with both source code and ASTs as a result of their standard training.

Knowing this raises the question, which is better to feed an LLM source code of ASTs?

The answer is really it depends on the use case, there are tradeoffs. For example keeping comments intact possibly gives the model hints to reason better. On the other side, it can be argued that a pure AST has less noise for the model to be confused by.

There are other tradeoffs as well. For example, any analysis relating to coding styles would require the full source code.

dunham · 2024-08-29T16:06:12 1724947572

It looks like they're running `webcrack` to deobfuscate/unminify and then asking the LLM for better variable names.

jehna1 · 2024-08-29T18:45:05 1724957105

I'm using both a custom Babel plugin and LLMs to achieve this.

Babel first parses the code to AST, and for each variable the tool:

1. Gets the variable name and surrounding scope as code

2. Asks the LLM to come up with a good name for the given variable name, by looking at the scope where the variable is

3. Uses Babel to make the context-aware rename to AST based on the LLM's response

bgirard · 2024-08-29T18:52:54 1724957574

How well does it compare to the original un-minified code if you compare it against minify + humanify. Would be neat if it can improve mediocre code.

jehna1 · 2024-08-29T19:06:55 1724958415

On structural level it's exactly 1-1: HumanifyJS only does renames, no refactoring. It may come up with better names for variables than the original code though.

Thorrez · 2024-08-30T05:33:25 1724996005

Can it guarantee 1-1? Doesn't Javascript allow looking up fields using a string name? That string could be computed in a complex manner.

j4k0xb · 2024-08-30T18:45:33 1725043533

It does in fact change the structure, but only safe-ish AST transformations related to minifiers (e.g. `void 0` to `undefined`): - https://github.com/jehna/humanify/blob/eeff3f8b4f76d40adb116... - https://webcrack.netlify.app/docs/concepts/unminify.html

properties and strings aren't renamed

fny · 2024-08-29T13:44:15 1724939055

Is it possible to add a mode that doesn't depend on API access (e.g. copy and paste this prompt to get your answer)? Or do you make roundtrips?

jehna1 · 2024-08-29T14:02:57 1724940177

There is a fully local mode that does not use ChatGPT at all – everything happens on your local machine.

API access of ChatGPT mode is needed as there are many round trips and it uses advanced API-only tricks to force the LLM output.

KolmogorovComp · 2024-08-29T14:59:22 1724943562

Thanks for your tool. Have you been able to quantify the gap between your local model and chatgpt in terms of ‘unminification performance’?

jehna1 · 2024-08-29T18:48:46 1724957326

At the moment I haven't found good ways of measuring the quality between different models. Please share if you have any ideas!

For small scripts I've found the output to be very similar between small local models and GPT-4o (judging by a human eye).

anticensor · 2024-08-29T20:34:08 1724963648

Thanks for creating this megafier, can you add support for local LLMs?

jehna1 · 2024-08-29T21:25:25 1724966725

Better yet, it already does have support for local LLMs! You can use them via `humanify local`

benreesman · 2024-08-29T18:02:52 1724954572

Came here to say Humanify is awesome both as a specific tool and in my opinion a really great way to think about how to get the most from inherently high-temperature activities like modern decoder nucleus sampling.

+1

lifthrasiir · 2024-08-29T11:21:55 1724930515

JS minification is fairly mechanical and comparably simple, so the inversion should be relatively easy. It would be of course tedious enough to be manually done in general, but transformations themselves are fairly limited so it is possible to read them only with some notes to track mangled identifiers.

A more general unminification or unobfuscation still seems to be an open problem. I wrote handful of programs that are intentionally obfuscated in the past and ChatGPT couldn't understand them even at the surface level in my experience. For example, a gist for my 160-byte-long Brainfuck interpreter in C had some comment trying to use GPT-4 to explain the code [1], but the "clarified version" bore zero similarity with the original code...

[1] https://gist.github.com/lifthrasiir/596667#gistcomment-47512...

panda-giddiness · 2024-08-29T11:51:05 1724932265

> JS minification is fairly mechanical and comparably simple, so the inversion should be relatively easy.

Just because a task is simple doesn't mean its inverse need be. Examples:

  - multiplication / prime factorization
  - deriving / integrating
  - remembering the past / predicting the future

Code unobfuscation is clearly one of those difficult inverse problems, as it can be easily exacerbated by any of the following problems:

  - bugs
  - unused or irrelevant routines
  - incorrect implementations that incidentally give the right results

In that sense, it would be fortunate if chatGPT could give decent results at unobfuscating code, as there is no a priori expectation that it should be able to do so. It's good that you've also checked chatGPT's code unobfuscation capabilities on a more difficult problem, but I think you've only discovered an upper limit. I wouldn't consider the example in the OP to be trivial.

lifthrasiir · 2024-08-29T12:04:14 1724933054

Of course, it is not generalizable! In my experience though, most minifiers do only the following:

- Whitespace removal, which is trivially invertible.

- Comment removal, which we never expect to recover via unminification.

- Renaming to shorter names, which is tedious to track but still mechanical. And most minifiers have little understanding of underlying types anyway, so they are usually very conservative and rarely reuse the same mangled identifier for multiple uses. (Google Closure Compiler is a significant counterexample here, but it is also known to be much slower.)

- Constant folding and inlining, which is annoying but can be still tracked. Again, most minifiers are limited in their reasoning to do extensive constant folding and inlining.

- Language-specific transformations, like turning `a; b; c;` into `a, b, c;` and `if (a) b;` into `a && b;` whenever possible. They will be hard to understand if you don't know in advance, but there aren't too many of them anyway.

As a result, minified code still remains comparably human-readable with some note taking and perseverance. And since these transformations are mostly local, I would expect LLMs can pick them up by their own as well.

(But why? Because I do inspect such programs fairly regularly, for example for comments like https://news.ycombinator.com/item?id=39066262)

cal85 · 2024-08-29T14:01:34 1724940094

I feel you’re downplaying the obfuscatory power of name-mangling. Reversing that (giving everything meaningful names) is surely a difficult problem?

chucksmash · 2024-08-29T14:45:31 1724942731

JSNice[1] is an academic project that did a pretty good job of this in the 2010s and they give some pointers on how it is accomplished[2].

[1]: http://jsnice.org/

[2]: https://www.sri.inf.ethz.ch/jsnice

lifthrasiir · 2024-08-29T14:33:03 1724941983

I would say the actual difficulty greatly varies. It is generally easy if you have a good guess about what the code would actually do. It would be much harder if you have nothing to guess, but usually you should have something to start with. Much like debugging, you need a detective mindset to be good at reverse engineering, and name mangling is a relatively easy obstacle to handle in this scale.

Let me give some concrete example from my old comment [1]. The full code in question was as follows, with only whitespaces added:

    function smb(){
      var a,b,c,d,e,h,l;
      return t(function(m){
        a=new aj;
        b=document.createElement("ytd-player");
        try{
          document.body.prepend(b)
        }catch(p){
          return m.return(4)
        }
        c=function(){
          b.parentElement&&b.parentElement.removeChild(b)
        };
        0<b.getElementsByTagName("div").length?
          d=b.getElementsByTagName("div")[0]:
          (d=document.createElement("div"),b.appendChild(d));
        e=document.createElement("div");
        d.appendChild(e);
        h=document.createElement("video");
        l=new Blob([new Uint8Array([/* snip */])],{type:"video/webm"});
        h.src=lc(Mia(l));
        h.ontimeupdate=function(){
          c();
          a.resolve(0)
        };
        e.appendChild(h);
        h.classList.add("html5-main-video");
        setTimeout(function(){
          e.classList.add("ad-interrupting")
        },200);
        setTimeout(function(){
          c();
          a.resolve(1)
        },5E3);
        return m.return(a.promise)
      })
    }

Many local variables should be easy to reconstruct: b -> player, c -> removePlayer, d -> playerDiv1, e -> playerDiv2, h -> playerVideo, l -> blob (we don't know which blob it is yet though). We still don't know about non-local names including t, aj, lc, Mia and m, but we are reasonably sure that it builds some DOM tree that looks like `<ytd-player><div></div><div class="ad-interrupting"><video class="html5-main-video"></div></ytd-player>`. We can also infer that `removePlayer` would be some sort of a cleanup function, as it gets eventually called in any possible control flow visible here.

Given that `a.resolve` is the final function to be executed, even later than `removePlayer`, it will be some sort of "returning" function. You will need some information about how async functions are desugared to fully understand that (and also `m.return`), but such information is not strictly necessary here. In fact, you can safely ignore `lc` and `Mia` because it eventually sets `playerVideo.src` and we are not that interested in the exact contents here. (Actually, you will fall into a rabbit hole if you are going to dissect `Mia`. Better to assume first and verify later.)

And from there you can conclude that this function constructs a certain DOM tree, sets some class after 200 ms, and then "returns" 0 if the video "ticks" or 1 on timeout, giving my initial hypothesis. I then hardened my hypothesis by looking at the blob itself, which turned out to be a 3-second-long placeholder video and fits with the supposed timeout of 5 seconds. If it were something else, then I would look further to see what I might have missed.

[1] https://news.ycombinator.com/item?id=38346602

fkyoureadthedoc · 2024-08-29T15:54:28 1724946868

I believe the person you're responding to is saying that it's hard to do automated / programmatically. Yes a human can decode this trivial example without too much effort, but doing it via API in a fraction of the time and effort with a customizable amount of commentary/explanation is preferable in my opinion.

lifthrasiir · 2024-08-30T03:31:32 1724988692

Indeed that aspect was something I failed to get initially, but I still stand by my opinion because most of my reconstruction had been local. Local "reasoning" can be often done without the actual reasoning, so while it's great that we can automate the local reasoning, it falls short of the full reasoning necessary to do the general unobfuscation.

cjf101 · 2024-08-29T15:51:32 1724946692

This is, IMO, the better way to approach this problem. Minification applies rules to transform code, if we know the rules, we can reverse the process (but can't recover any lost information directly).

A nice, constrained, way to use a LLM here to enhance this solution is to ask it some variation of "what should this function be named?" and feed the output to a rename refactoring function.

You could do the same for variables, or be more holistic and ask it to rename variables and add comments (but risk the LLM changing what the code does).

refulgentis · 2024-08-29T16:09:15 1724947755

How do we end up with you pasting large blocks of code and detailed step-by-step explanations of what it does, in response to someone noting that just because process A is simple, it doesn't mean inverting A is simple?

This thread is incredibly distracting, at least 4 screenfuls to get through.

I'm really tired of the motte/bailey comments on HN on AI, where the motte is "meh the AI is useless, amateurish answer thats easy to beat" and bailey is "but it didn't name a couple global variables '''correctly'''." It verges on trolling at this point, and is at best self-absorbed and making the rest of us deal with it.

lifthrasiir · 2024-08-30T02:58:06 1724986686

Because the original reply missed three explicit adverbs to hint that this is not a general rule (EDIT: and also had mistaken my comment to be dismissive). And I believe it was not in a bad faith, so I went to give more contexts to justify my reasoning. If you are not interested in that, please just hide it because otherwise I can do nothing to improve the status quo and I personally enjoyed the entire conversation.

mgkimsal · 2024-08-29T13:04:18 1724936658

> As a result, minified code still remains comparably human-readable with some note taking and perseverance.

At least some of the time, simply taking it and reformatting to be unfolded and on multiple lines is useful enough to be readable/debuggable. FIXING that bug is likely more complex, because you have to find where it is in the original code, which, to my eyes, isn't always easy to spot.

dheera · 2024-08-30T03:11:54 1724987514

> - Comment removal, which we never expect to recover via unminification.

ChatGPT is quite good at adding meaningful comments back to uncommented code, actually.

Paste some code and add "comment the shit out of this" as a prompt.

drakythe · 2024-08-29T14:09:21 1724940561

As a point of order Code Minification != Code Obfuscation.

Minification does tend to obfuscate as as side effect, but it is not the goal, so reversing minification becomes much easier. Obfuscation on the other hand can minify code, but crucially that isn't the place it starts from. As the goal is different between minificaiton and obfuscation reversing them takes different efforts and I'd much rather attempt to reverse minification than I would obfuscation.

I'd also readily believe there are hundreds/thousands of examples online of reverse code minification (or here is code X, here is code X _after_ minifcation) that LLMs have ingested in their training data.

jmb99 · 2024-08-29T15:23:19 1724944999

Yeah, having run some state of the art obfuscated code through ChatGPT, it still fails miserably. Even what was state of the art 20 years ago it can't make heads or tails of.

johnfn · 2024-08-29T16:15:39 1724948139

> JS minification is fairly mechanical and comparably simple, so the inversion should be relatively easy.

This is stated as if it's a truism, but I can't understand how you can actually believe this. Converting `let userSignedInTimestamp = new Date()` to `let x = new Date()` is trivial, but going the other way probably requires reading and understanding the rest of the surrounding code to see in what contexts `x` is being used. Also, the rest of the code is also minified, making this even more challenging. Even if you do all that right, it's at best it's still a lossy conversion, since the name of the variable could capture characteristics that aren't explicitly outlined in the code at all.

lifthrasiir · 2024-08-30T03:04:09 1724987049

You are technically true, but I think you should try some reverse engineering to see that it is usually possible to reconstruct much of them in spite of the amount of transformations made. I do understand that this fact might be hard to believe without any prior.

EDIT: I think I got why some comments complain I downplayed the power of LLM here. I never meant to, and I wanted to say that the unminification is a relatively easy task compared to other reverse engineering tasks. It is great we can automate the easy task, but we still have to wait for a better model to do much more.

johnfn · 2024-09-02T18:33:20 1725302000

I have tried reconstructing minified code (I thought that would be obvious from my example). It feels like it takes just a bit less thought than it did to write the code in the first place, which is definitely not something I would classify as "comparably simple".

viscanti · 2024-08-29T16:43:40 1724949820

Because of how trivial that step is, it's likely pretty easy to just take lots of code and minify it. Then you have the training data you need to learn to generate full code from minified code. If your goal is to generate additional useful training data for your LLM, it could make sense to actually do that.

wwarner · 2024-08-29T17:08:37 1724951317

I suspect, but definitely do not know, that all the coding aspects of llms work something like this. It’s such a fundamentally different problem from a paragraph, which should never be the same as any other paragraph. Seems to me that coding is a bit more like the game of go, where an absolute score can be used to guide learning. Seed the system with lots and lots of leetcode examples from reality, and then train it to write tests, and now you have a closed loop that can train itself.

viscanti · 2024-08-29T17:18:24 1724951904

If you're able to generate minified code from all the code you can find on the internet, you end up with a very large training set. Of course in some scenarios you won't know what the original variable names were, but you would expect to be able to get something very usable out of it. These things, where you can deterministically generate new and useful training data, you would expect to be used.

hluska · 2024-08-29T17:59:25 1724954365

And I can’t understand why any reasonably intelligent human feels the need to be this abrasive. You could educate but instead you had to be condescending.

Max-q · 2024-08-29T12:22:45 1724934165

Converting a picture from color to black and white is a fairly simple task. Getting back the original in color is not easy. This is if course due to data lost in the process.

Minification works in the same way. A lot of information needed for understanding the code is lost. Getting back that information can be a very demanding task.

lifthrasiir · 2024-08-29T12:32:08 1724934728

But it is not much different from reading through badly documented codes without any comments or meaningful names. In fact, many codes to be minified are not that bad and thus it is often possible to infer the original code just from its structure. It is still not a trivial task, but I think my comment never implied that.

015a · 2024-08-29T19:50:08 1724961008

The act of reducing the length of variable names by replacing something descriptive (like "timeFactor") with something much shorter ("i") may be mechanical and simple, but it is destructive and reversing that is not relatively easy; in fact, its impossible to do without a fairly sophisticated understanding of what the code does. That's what the LLM did for this; which isn't exactly surprising, but it is cool; being so immediately dismissive isn't cool.

lifthrasiir · 2024-08-30T03:26:13 1724988373

I never meant to be dismissive, in fact my current job is to build a runtime for ML accelerator! I rather wanted to show that unminification is much easier than unobfuscation, and that the SOTA model is yet to do the latter.

Also, it should be noted that the name reconstruction is not a new problem and was already partly solved multiple times before the LLM era. LLM is great in that it can do this without massive retraining, but the reconstruction depends much on the local context (which was how earlier solutions approached the problem), so it doesn't really show its reasoning capability.

GaggiX · 2024-08-29T11:55:47 1724932547

Random try (the first one) with Claude 3.5 Sonnet: https://claude.site/artifacts/246c1b1a-3088-447a-a526-b1e716...

I'm not on PC so it's not tested.

lifthrasiir · 2024-08-29T12:07:17 1724933237

That's much better in that most of the original code remains present and comments are not that far off, but its understanding of global variables are utterly wrong (to be expected though, as many of them serve multiple purposes).

Earw0rm · 2024-08-29T14:46:39 1724942799

Yep, I've tried to use LLMs to disassemble and decompile binaries (giving them the hex bytes as plaintext), they do OK on trivial/artificial cases but quickly fail after that.

albert_e · 2024-08-29T11:17:37 1724930257

Should the title say ChatGPT or gpt-4 (the model) instead of OpenAI (the company)?

dantondwa · 2024-08-29T12:10:50 1724933450

There is a certain justice in the use of OpenAI as a name for their product, given that OpenAI has turned the generic technical GPT name into a brand.

j_maffe · 2024-08-29T14:08:35 1724940515

GPT is not a brand. A court ruling turned down that notion. It's a technology.

latexr · 2024-08-29T14:49:24 1724942964

That only means it’s not a legally recognised brand, but it is a brand nonetheless if people associate the two (and they do). A bit like the way people associate tissue paper with Kleenex, or photocopies with Xerox, or git with GitHub.

j_maffe · 2024-08-29T16:12:17 1724947937

I wonder if OpenAI will stick with the GPT acronym, given that most people don't know what it's an acronym for and it's a bit of a mouthful.

albert_e · 2024-08-30T08:27:41 1725006461

I doubt many people know the correct full forms of LED, LCD, USB, HDMI, Wi-Fi, HTTP, URL, etc. either

latexr · 2024-09-01T13:35:34 1725197734

> the correct full forms of (…) Wi-Fi

Wi-Fi isn’t like the others on your list.

https://en.wikipedia.org/wiki/Wi-Fi#Etymology_and_terminolog...

> The name Wi-Fi, commercially used at least as early as August 1999, was coined by the brand-consulting firm Interbrand. The Wi-Fi Alliance had hired Interbrand to create a name that was "a little catchier than 'IEEE 802.11b Direct Sequence'." According to Phil Belanger, a founding member of the Wi-Fi Alliance, the term Wi-Fi was chosen from a list of ten names that Interbrand proposed. (…)

> The name Wi-Fi is not short-form for 'Wireless Fidelity' (…) The name Wi-Fi was partly chosen because it sounds similar to Hi-Fi, which consumers take to mean high fidelity or high quality. Interbrand hoped consumers would find the name catchy, and that they would assume this wireless protocol has high fidelity because of its name.

Taylor_OD · 2024-08-30T00:24:21 1724977461

And that they launch new models so often that GPT could mean 3.5, 4, 4o mini, or 4, just to name the ones I know off the top of my head.

ChadNauseam · 2024-08-29T13:38:49 1724938729

The generative pretrained transformer was invented by OpenAI, and it seems reasonable for a company to use the name it gave to its invention in its branding.

Of course, they didn't invent Generative pretraining (GP) or transfomers (T) but AFAIK they were the first to publicly combine them

cbm-vic-20 · 2024-08-29T11:51:01 1724932261

I left my Kleenex next to the Xerox.

taneq · 2024-08-29T11:52:44 1724932364

Better Hoover it up!

dubcanada · 2024-08-29T13:07:12 1724936832

All jokes aside, I've never heard anyone call vacuuming hoover. I wonder if that was a older statement?

dmd · 2024-08-29T14:26:47 1724941607

Everyone I know from the UK says "hoovering" 100% of the time instead of vacuuming.

taneq · 2024-08-29T14:17:11 1724941031

It was the fashion at the time, even if the hoover did keep bumping the onion.

(This is actually really interesting, I had no idea that 'hoover' was specifically a U.K. thing that didn't make it to the U.S.)

commodoreboxer · 2024-08-29T13:13:45 1724937225

I have, but only as an idiom, never literally. E.g. "Microsoft just keeps hoovering up companies", but the literal act of vacuuming is only called vacuuming.

justneedaname · 2024-08-29T13:12:02 1724937122

In the UK it's very common

joseda-hg · 2024-08-29T14:44:25 1724942665

It might just be a regionalism, it's not uncommon that such genericization only applies to specific dialects (Like calling all sodas coke)

bigstrat2003 · 2024-08-29T14:13:45 1724940825

I've also never heard anyone call photocopying "xeroxing". I'm guessing maybe it's an age thing.

albert_e · 2024-08-29T14:56:17 1724943377

growing up in India over past 4 decades .. 'Xerox' was/is the default and most common word used for photocopying ... only recently have I started using/hearing the term 'photocopy'.

every town and every street had "XEROX shops" where people went to get various documents photocopied for INR 1 per page for example

Most photocopy centers are still called XEROX Shops -- and their boards say that in big bold text: https://www.google.com/search?q=xerox+shop+india&udm=2

It doesnt matter if they use Canon, HP, or other brands of machines

latexr · 2024-08-29T14:51:32 1724943092

It depends on the region. In certain countries Gillette is used for any shaving razor.

bdsa · 2024-08-29T13:12:34 1724937154

More common in the UK

BeetleB · 2024-08-29T15:03:27 1724943807

I got hurt doing it so applied some Bandaids.

Stratoscope · 2024-08-29T15:22:12 1724944932

Don't say Velcro!

https://www.youtube.com/results?search_query=don't+say+velcr...

(Content warning: profanity. This search page is SFW, but the videos it links to may not be.)

punkpeye · 2024-08-29T11:27:28 1724930848

It would have been a better title, yes.

ubj · 2024-08-29T11:28:33 1724930913

I agree, this would make the title more accurate.

johnisgood · 2024-08-29T11:23:17 1724930597

I think it should not say the name of the company, but either ChatGPT or GPT-x.

whimsicalism · 2024-08-29T16:07:59 1724947679

more likely to get downvotes that way, potentially even downweighted

j_maffe · 2024-08-29T11:14:38 1724930078

LLMs are excellent at text transformation. It's their core strength and I don't see it being used enough.

xanderlewis · 2024-08-29T11:26:29 1724930789

It’s not only their core strength — it’s what transformers were designed to do and, arguably, it’s all they can do. Any other supposed ability to reason or even retain knowledge (rather than simply regurgitate text without ‘understanding’ its intended meaning) is just a side effect of this superhuman ability.

stavros · 2024-08-29T11:40:46 1724931646

I see your point, but I think there's more to it. It's kind of like saying "all humans can do is perceive and produce sound, any other ability is just a side-effect". We might be focusing too much on their mechanism for "perception" and overlooking other capabilities they've developed.

mjburgess · 2024-08-29T13:00:46 1724936446

Sure, but that claim wouldn't be true for humans, right? So it's a nonsequiteur.

The relevant claim would be: all humans can do is move around in their environments, adapt the world around them through action, observe using adaptive sensory motor systems, grow and adapt their brains and bodies in response to novel and changing environments, abstract sensory motor techniques into symbolic concepts, vocalize this using inherited systems of meaning acquired as very young children in adaption within their environments, etc.

In the case of transformers all they can do is, in fact, sample from a compression of historical texts using a weighted probability metric.

If you project both of these into "problems an office worker has"-space, then they can appear simimlar -- but this projection is an incredibly dumb one, and offered as a sales pitch by charlatans looking to pretend that a system which can generate office emails can communicate.

fenomas · 2024-08-29T14:42:31 1724942551

> all they can do is, in fact, sample from a compression of historical texts

To me, results like the Othello paper make any sort of "stochastic parrot" thinking completely untenable.

https://thegradient.pub/othello/

mjburgess · 2024-08-29T16:26:54 1724948814

Abstract functions are fully representable by function approximations in the limit n->inf; ie., sampling from a circle becomes a circle as samples -> infinity.

This makes all "studies" whose aim is to approximate a fully representable abstract mathematical domain irrelevant to the question.

This is just more evidence of the naivety, mendacity, and pseudoscientific basis of ML and its research.

fenomas · 2024-08-30T02:34:57 1724985297

...I see...

mjburgess · 2024-08-30T08:16:21 1725005781

As you sample all pixels from all photos on a mountain, the pixels don't become the mountain.

The structure of a mountain is not a pattern of pixels. So there is no function for a statistical alg to approximate, no n->infinity which makes the approximation exact.

By sampling from historical pixel patterns in previous images you can generate images in a pixel order that makes sense to a person already acquainted with what they represent. Eg., having seen a mountain (, having perspective, colour vision, depth, counterfactual simulation, imagination, ...).

In all these disagreeably dumb research papers that come out showing "world models" and the like you have the bad mathematicians and bad programmers called "AI researchers" giving a function approximation alg an abstract mathematical domain to approximate.

ie., if the goal is to "learn a circle" and you sample points from a circle, your approximation becomes exact in n->inf, because the target is *ABSTRACT*.

It's so dumb its kinda incomprehensible. It shows what a profound lack of understanding of science is rampent across the discipline.

MNIST, Games, Chess, Circles, Rulesets, etc. are all mathematical objects (shapes, rules). It is trivial to find a mathematical approximation to a mathematical object.

The world is not made out of pixels. Models of pixel patterns are not their targets.

samatman · 2024-08-29T15:18:45 1724944725

This result is an argument for the conclusion you are reading it as arguing against.

dontlikeyoueith · 2024-08-29T19:56:46 1724961406

That's because you don't understand what you're reading.

diego_sandoval · 2024-08-29T13:50:00 1724939400

> all they can do is, in fact, sample from a compression of historical texts using a weighted probability metric.

I don't think that's all they can do.

I think they know more than what is explicitly stated in their training sets.

They can generalize knowledge and generalize relationships between the concepts that are in the training sets.

They're currently mediocre at it, but the results we observe from SOTA generative models are not explainable without accepting that they can create an internal model of the world that's more than just a decompression algorithm.

I'm going to step away from LLMs for a moment, but: How are video generator models capable of creating videos with accurate shadows and lighting that is consistent in the entire frame and consistent between frames?

You can't do that simply by taking a weighted average of the sections of videos you've seen in your training set.

You need to create an internal 3D model of the objects in the scene, and their relative positions in space across the length of the video. And no one told the model explicitly how to do that, it learned to do it "on its own".

I think the same principle applies to LLMs.

carlmr · 2024-08-29T16:46:16 1724949976

>You need to create an internal 3D model of the objects in the scene, and their relative positions in space across the length of the video. And no one told the model explicitly how to do that, it learned to do it "on its own".

Compression is understanding. If you have a model which explains shadows you can compress your video data much better. Since you "understand" how shadows work.

visarga · 2024-08-29T20:12:30 1724962350

> In the case of transformers all they can do is, in fact, sample from a compression of historical texts using a weighted probability metric.

You seem to think LLMs operate independently from humans. That doesn't happen in practice. We prompt LLMs, they don't just sample at random. We teach them new skills, share media and stories with them, work, learn and play together. It's not LLMs alone. They are pulled outside their training distribution by the user. The user brings their own unique life experience into the interaction.

xanderlewis · 2024-08-29T20:39:44 1724963984

Well, yes — absolutely. You could say something similar about any system with complex emergent behaviour. 'All computers can do are NAND operations and any other ability is just a side effect', or something.

However, I do think that in this case it's meaningful. The claim isn't that LLMs are genuinely exhibiting reasoning ability — I think it's quite clear to anyone who probes them for long enough that they're not. I was fooled initially too, but you soon come to realise it's a clever trick (albeit not one contrived by any of the human designers themselves). The claim is usually some pseudo-philosophical claim that the very definition of reasoning is simply 'outputting (at least some of the time) correct sentences' and so there's no more to be said. But this is just silly. It's quite obvious that being able to manipulate language and effectively have access to a vast (fuzzily encoded) database of knowledge will mean you can output true and pertinent statements a lot of the time. But this doesn't require reasoning at all.

Note that I'm not claiming that LLMs exhibit reasoning and other abilities 'as a side effect' of language manipulation ability — I'm claiming there's no reason to believe they have these abilities at all based on the available evidence. Humans are just very easily convinced by beings that seem to speak our language and are overly inclined to attribute all sorts of desires, internal thought processes and whatever else for which there are no evidence.

og_kalu · 2024-08-31T02:59:15 1725073155

>I think it's quite clear to anyone who probes them for long enough that they're not.

I disagree and so do a lot of people who've used them for a long while. This is just an assertion that you wish to be true rather than something that actually is. What happens is that for some bizarre reason, for machines, lots of humans have a standard of reasoning that only exists in fiction. Devise any reasoning test you like that would cleanly separate humans from LLMs. I'll wait.

> The claim is usually some pseudo-philosophical claim that the very definition of reasoning is simply 'outputting (at least some of the time) correct sentences' and so there's no more to be said.

There is nothing philosophical or pseudo-philosophical about saying reasoning is determined by output. If anything, the opposite is what's philosophical nonsense. The idea that there exists some "real" reasoning that humans perform and "fake" reasoning that LLMs perform and yet somehow no testable way to distinguish this is purely the realm of fiction and philosophy. If you're claiming a distinction that doesn't actually distinguish, you're just making stuff up.

LLMs clearly reason. They do things, novel things that no sane mind would see a human do and call anything else. They do things that are impossible to describe as anything else unless you subscribe to what i like to call statistical magic - https://news.ycombinator.com/item?id=41141118

And all things considered, LLMs are pretty horrible memorizers. Getting one to regurgitate Training data is actually really hard. There's no database of knowledge. It clearly does not work that way.

xanderlewis · 2024-09-09T00:31:43 1725841903

> Devise any reasoning test you like that would cleanly separate humans from LLMs. I'll wait.

Well, you don’t have to wait. Just ask basic questions about undergraduate mathematics, perhaps phrased in slightly out-of-distribution ways. It fails spectacularly almost every time and it quickly becomes apparent that the ‘understanding’ present is very surface level and deeply tied to the patterns of words themselves rather than the underlying ideas. Which is hardly surprising and not intended as some sort of insult to the engineers; frankly, it’s a miracle we can do so much with such a relatively primitive system (that was originally only designed for translation anyway).

The standard response is something about how ‘you couldn’t expect the average human to be able to do that so it’s unfair!’, but for a machine that has digested the world’s entire information output and is held up as being ‘intelligent’, this really shouldn’t be a hard task. Also, it’s not ‘fiction’ — I (and many others) can answer these questions just fine and much more robustly, albeit given some time to think. LLM output in comparison just seems random and endlessly apologetic. Which, again, is not surprising!

If you mean ‘separate the average human from LLMs’, there probably are examples that will do this (although they quickly get patched when found) — take the by-now-classic 9.9 vs 9.11 fiasco. Even if there aren’t, though, you shouldn’t be at all surprised (or impressed) that the sum of pretty much all human knowledge ever + hundreds of millions of dollars worth of computation can produce something that can look more intelligent than the average bozo. And it doesn’t require reasoning to do so — a (massive) lookup table will pretty much do.

> There is nothing philosophical or pseudo-philosophical about saying reasoning is determined by output.

I don’t agree. ‘Reasoning’ in the everyday sense isn’t defined in terms of output; it usually refers to an orderly, sequential manner of thinking whose process can be described separately from the output it produces. Surely you can conceive of a person (or a machine) that can output what sounds like the output of a reasoning process without doing any reasoning at all. Reasoning is an internal process.

Honestly — and I don’t want to sound too rude or flippant — I think all this fuss about LLMs is going to look incredibly silly when in a decade or two we really do have reasoning systems. Then it’ll be clear how primitive and bone-headed the current systems are.

baq · 2024-08-29T11:50:45 1724932245

> it’s all they can do

this overlooks how they do it. we don't really know. it might be logical reasoning, it might be a very efficient content addressable human-knowledge-in-a-blob-of-numbers lookup table... it doesn't matter if they work, which they do, sometimes scarily well. dismissing their abilities because they 'don't reason' is missing the forest for the trees in that they'd be capable of reasoning if they were able to run sat solvers on their output mid generation.

SiempreViernes · 2024-08-29T12:35:35 1724934935

Dismissing claims that LLMs "reason" because these machines perform no actions similar to reasoning seems pretty motivated. And I don't think "blindly take input from a reasoning capable system" counts as reasoning.

hobs · 2024-08-29T12:48:39 1724935719

Does it? I think Blindsight (the book) had a good commentary on reason being a thing we think is a conscious process but doesn't have to be.

I think most people talking past each other are really discussing whether the GPT is conscious, has a mental model of self, that kind of thing, as long as your definition of reasoning doesn't include consciousness it clearly does it (though not well.)

foobarbecue · 2024-08-29T12:56:23 1724936183

"pretty motivated"? Did you mean biased?

ChadNauseam · 2024-08-29T13:41:23 1724938883

I assume they meant motivated as shorthand for "motivated reasoning" which implies a bias that's motivating them to reason a certain way

foobarbecue · 2024-08-30T04:53:23 1724993603

Oh, interesting -- that's a new one for me.

sitkack · 2024-08-29T15:27:03 1724945223

Hinton claims they do reason. I am going to go with Hinton on this.

xanderlewis · 2024-08-29T20:22:32 1724962952

Hinton's opinions on LLMs are frankly bonkers. Just because you're famous — and intelligent and successful — doesn't mean you can't be completely wrong.

Also: what's his rationale? It's no use simply claiming something without evidence. And as far as I (and seemingly most others) can see, there's no such evidence other than that they can sometimes output sentences that happen to be true. But so can Wikipedia — does that mean Wikipedia is reasoning?

Also, any form of reasoning in the usual sense of the word would surely require the ability to allocate arbitrary amounts of computation (i.e. thought) to each question. LLMs don't do this — they don't sit and ponder; each token takes exactly the same amount of computation to produce. Once they hit an 'end of text' token, they're done.

Even empirically speaking, LLMs' ability to reason can be seen to be nonexistent. Just try asking basic mathematics questions. As soon as you ask anything for which the answer isn't available — practically verbatim — on the web already, it produces intelligent-sounding gibberish.

This whole idea that 'LLMs must be able to reason because in order to learn to fake reasoning you must learn to actually reason' is like some kind of inverted no true Scotsman fallacy.

sitkack · 2024-08-29T20:47:39 1724964459

Does slime mold reason?

Yes, Hinton can be wrong, is wrong on many things like his misunderstanding on Chomsky and language.

But I also think he has spent thousands of hours testing these systems scientifically.

Your last sentence puts a lot of words in peoples mouths. But to continue down that line, fake reasoning and actual reasoning sounds like the Chinese Room. Is that the argument you are making?

We don't understand our own mental processes well enough, so I try to not anthropomorphize reasoning and cognition.

xanderlewis · 2024-08-29T21:22:55 1724966575

> Your last sentence puts a lot of words in peoples mouths.

Well, it’s the most common sentiment I see on both here and (before I gave up) the AI-centred parts of reddit.

It’s not quite the Chinese Room, since LLMs can’t even simulate reasoning very well. So there’s no need to debate the distinction between ‘fake reasoning and actual reasoning’ — there may or may not be a difference, but it’s not the point I’m making.

As for Hinton: I’m sure he has. But inventors are often not experts on their own creations/discoveries, and are probably just as prone to FUD and panic in the face of surprising developments as the rest of us. No one predicted that autoregressive transformers would get us this far, least of all the experts whose decades of work lead us to this point.

PaulHoule · 2024-08-29T11:23:11 1724930591

Particularly those that are basically linear, that don’t involve major changes in the order of things or a deep consideration of relationships between things.

They can’t sort a list but they can translate languages, for instance, given that a list sorted almost right is wrong but that we will often settle for an almost right translation.

worldsayshi · 2024-08-29T11:27:06 1724930826

One potential benefit should be that with the right tooling around it it should be able to translate your code base to a different language and/or framework more or less at the push of a button. So if a team is wondering if it would be worth it to switch a big chunk of the code base from python to elixir they don't have to wonder anymore.

I tried translating a python script to javascript the other day and it was flawless. I would expect it to scale with a bit of hand-railing.

adamdiy · 2024-08-29T11:30:40 1724931040

see projects like https://github.com/joshpxyne/gpt-migrate

think there's also a YC company recently focusing on the nasty, big migrations with LLM help

worldsayshi · 2024-08-29T11:41:26 1724931686

It seems that this kind of application can really change how the tech industry can evolve down the line. Maybe we will more quickly converge on tech stacks if everyone can test new one's out "within a week".

scarface_74 · 2024-08-29T11:56:48 1724932608

ChatGPT is trained well enough on all things AWS that it can do a decent job translating Python based SDK code to Node and other languages, translate between CloudFormation/Terraform/CDK (in various languages).

It does a well at writing simple to medium complexity automation scripts around AWS.

If it gets something wrong, I tell it to “verify your answer using the documentation available on the web”

kamaal · 2024-08-29T13:35:31 1724938531

>>ChatGPT is trained well enough on all things AWS

It was scary to me how to chatting with GPT or Claude would give me information which was a lot more clear than what I could deduce after hours of reading AWS documentation.

Perhaps, the true successor to Google search has arrived. One big drawback of Google was asking questions that can't be converted to a full long conversation.

To that end. LLM chat is the ultimate socratic learning method tool till date.

cdelsolar · 2024-08-29T14:42:25 1724942545

ChatGPT is phenomenal for trying new techniques/libraries/etc. It's very good at many things. In the past few weeks I've used it to build me a complex 3D model with lighting/etc with Three.JS, rewrote the whole thing into React Three Fiber (also with ChatGPT), for a side project. I've never used Three.JS before and my only knowledge of computer graphics is from a class I took 20 years ago. For work I've used it to write me a CFN template from scratch and help me edit it. I've also used it to try a technique with AST - I've never used ASTs before and the first thing ChatGPT generated was flawless. Actually, most of the stuff I have it generate is flawless or nearly flawless.

It's nothing short of incredible. Each of those tasks would normally have taken me hours and I have working code in actual seconds.

kamaal · 2024-08-29T15:22:30 1724944950

And we are still at the beginning of this. Some what like where Google search was in early 2000s.

As IDE integration grows and there are more and better models, that can do this better than ever. We will unlock all sort of productivity benefits.

There is still skepticism about making these work at scale, with regards to both electricity and compute requirement for the larger audience. But if they can get this to work, we might see a new era tech boom way bigger than we have seen anything before.

pton_xd · 2024-08-29T15:56:00 1724946960

I see your point but that specific analogy makes me wince. Google search was way better in the 2000s. It has become consistently dumber since then. Usefulness doesn't necessarily increase in a straight line over time.

msp26 · 2024-08-29T11:24:34 1724930674

Isn't this already their main use case for business? We use them primarily for extracting structured data from other forms.

greenthrow · 2024-08-29T11:29:28 1724930968

The problem is the use case is where you don't care about the risk of hallucinations or you can validate the output without already having the data in a useful format. Plus you need to lack the knowledge/skill to do it more quickly using awk/python/perl/whatever.

j_maffe · 2024-08-29T14:05:01 1724940301

I think text transformation is a sufficiently predictable task that one could make a transformer that completely avoids hallucinations. Most LLMs have high temperatures which introduces randomness and therefore hallucinations into the result.

worldsayshi · 2024-08-29T11:42:46 1724931766

That's why having good test suites and tools are more important than ever.

bdcravens · 2024-08-29T17:37:39 1724953059

I'm sure there's some number greater than zero of developers who are upset because they use minification as a means of obfuscation.

Reminds me of the tool that was provided in older versions of ColdFusion that would "encrypt" your code. It was a very weak algorithm, and didn't take long for someone to write a decrypter. Nevertheless some people didn't like this, because they were using this tool, thinking it was safe for selling their code without giving access to source. (In the late 90s/early 2000s before open source was the overwhelming default)

julianeon · 2024-08-30T04:07:15 1724990835

I bet there's some kind of use case/arms race soon to happen, like this:

Website offers some kind of contest which is partly dependent on obfuscated client side code.

Clever contestants can now through it into ChatGPT to improve their chances.

Now, it begins.

ninetyninenine · 2024-08-29T16:52:57 1724950377

This is an example of superior intellectual performance to humans.

There’s no denying it. This task is intellectual. Does not involve rote memorization. There are not tons and tons of data pairs on the web of minimized code and unminified code for llms to learn from.

The llm understands what it is unminifying and it is in general superior to humans on this regard. But only in this specific subject.

indoordin0saur · 2024-08-29T17:07:28 1724951248

This is just transforming text.

> There are not tons and tons of data pairs on the web of minimized code and unminified code for llms to learn from.

Are you sure about this? These can be easily generated from existing JS to use as a training set, not to mention the enormous amount of non-minified JS which is already used to train it.

ozr · 2024-08-29T17:10:16 1724951416

I'm bullish on AI, but I'm not convinced this is an example of what you're describing.

The challenge of understanding minified code for a human comes from opaque variable names, awkward loops, minimal whitespacing, etc. These aren't things that a computer has trouble with: it's why we minify in the first place. Attention, as a scheme, should do great with it.

I'd also say there is tons of minified/non-minified code out there. That's the goal of a map file. Given that OpenAI has specifically invested in web browsing and software development, I wouldn't be surprised if part of their training involved minified/unminified data.

etbebl · 2024-08-29T23:10:49 1724973049

> These aren't things that a computer has trouble with

They are irrelevant for executing the code, but they're probably pretty relevant for an LLM that is ingesting the code and text and inferring its function based on other examples it has seen. It's definitely more impressive that an LLM can succeed at this without the context of (correct) variable names than with them.

ninetyninenine · 2024-08-29T18:08:47 1724954927

minification and unminification is a heuristic process not an algorithmic one. It is akin to decompiling code or reverse engineering. It's a step beyond just your typical AI you see in a calculator.

danbolt · 2024-08-29T17:12:54 1724951574

I don’t claim expertise in AI or understanding intelligence, but could we also say that a pocket calculator really understands arithmetic and has superior intellectual performance compared to humans?

ninetyninenine · 2024-08-29T18:06:45 1724954805

https://chatgpt.com/share/a430518b-16f8-47bb-8cb7-d9b8518376...

pornel · 2024-08-29T22:42:28 1724971348

Things are called AI only until they can be done well by a computer, and then they become just an algorithm.

There was a time when winning in Chess was a proof of humans' superior intellect, and then it became just an algorithm. Then Go.

hamstergene · 2024-08-29T23:02:51 1724972571

Why not count the fact that humans created a tool to help themselves at unminifying towards human score?

Having a computer multiplying 1000-digit numbers instantly is an example of humans succeeding at multiplying: by creating a tool for that first. Because what else is intellectually succeeding there? It’s not like the computer has created itself.

If one draws a boundary of human intelligence at the skull bone and does not count the tools that this very intelligence is creating and using as mere steps of problem solving process, then one will also have to accept that humans are not intelligent enough to fly into space or do surgery or even cook most of the meals.

gmd63 · 2024-08-29T17:55:13 1724954113

> Does not involve rote memorization. There are not tons and tons of data pairs on the web of minimized code and unminified code for llms to learn from.

GPT-4 has consumed more code than your entire lineage ever will and understands the inherent patterns between code and minified versions. Recognizing the abstract shape of code sans variable names and mapping in some human readable variable names from a similar pattern you've consumed from the vast internet doesn't seem farfetched.

okanat · 2024-08-29T18:36:21 1724956581

And a human can do it without seeing that amount of code and consuming less energy.

lgas · 2024-08-29T22:15:44 1724969744

Sure but it’s tedious and time consuming. I like things that eliminate tedium and give me back time.

plaidfuji · 2024-08-29T17:18:47 1724951927

I think I’d agree with your statement, in the same sense that a chess simulator or AlphaGo are superior to human intellect for their specific problem spaces.

LLMs are very good at a surprisingly broad array of semi-structured-text-to-semi-structured-text transformations, particularly within the manifold of text that is widely available on the internet.

It just so happens that lots of code is widely available on the internet, so LLMs tend to outperform on coding tasks. There’s also lots of marketing copy, general “encyclopedic” knowledge, news, human commentary, and entertainment artifacts (scripts, lyrics, etc). LLMs traverse those spaces handily as well. The capabilities of AI ultimately boil down to their underlying dataset and its quality.

dontlikeyoueith · 2024-08-29T19:53:39 1724961219

Yes, bow down before your god.

You people are so weird.

ninetyninenine · 2024-08-29T22:24:40 1724970280

You're in denial. Nobody is worshipping a god here.

I'm simply saying the AI has superior performance to humans on this specific subject. That's all.

Why did you suddenly make this comment of "bowing before your god" when I didn't even mention anything remotely close to that?

I'll tell you why. Because this didn't come from me. It came from YOU. This is what YOU fear most. This is what YOU think about. And your fear of this is what blinds you to the truth.

seadan83 · 2024-08-30T19:15:36 1725045336

> This is an example of superior intellectual performance to humans.

So is too multiplying _many_ large numbers together.

> The llm understands what it is unminifying and it is in general superior to humans on this regard.

Proof needed. I would grant in terms of speed as very likely to be true. In general, I do not know how accuracy would compare.

throwthrowuknow · 2024-08-29T19:40:00 1724960400

Umm yeah there are tons of examples in Github repos.

mplewis · 2024-08-29T17:17:03 1724951823

Yeah, ok. Now count the number of Rs in this word.

jackconsidine · 2024-08-29T13:42:05 1724938925

That's interesting. It's gotten a lot better I guess. A little over a year ago, I tried to use GPT to assist me in deobfuscating malicious code (someone emailed me asking for help with their hacked WP site via custom plugin). I got much further just stepping through the code myself.

After reading through this article, I tried again [0]. It gave me something to understand, though it's obfuscated enough to essentially eval unreadable strings (via the Window object), so it's not enough on it's own.

Here was an excerpt of the report I sent to the person:

> For what it’s worth, I dug through the heavily obfuscated JavaScript code and was able to decipher logic that it:

> - Listens for a page load

> - Invokes a facade of calculations which are in theory constant

> - Redirects the page to a malicious site (unk or something)

[0] https://chatgpt.com/share/f51fbd50-8df0-49e9-86ef-fc972bca6b...

api · 2024-08-29T11:06:56 1724929616

Anyone working on decompiler LLMs? Seems like we could render all code open source.

Training data would be easy to make in this case. Build tons of free GitHub code with various compilers and train on inverting compilation. This is a case where synthetic training data is appropriate and quite easy to generate.

You could train the decompiler to just invert compilation and the use existing larger code LLMs to do things like add comments.

BluSyn · 2024-08-29T11:11:49 1724929909

The potential implications of this are huge. Not just open sourcing, but imagine easily decompiling and modifying proprietary apps to fix bugs or add features. This could be a huge unlock, especially for long dead programs.

For legal reasons I bet this will become blocked behavior in major models.

roflmaostc · 2024-08-29T11:17:27 1724930247

I've never seen a law forbidding decompiling programs. But, some programs forbid to decompile applications by the license agreement. Further, you still don't have any right on this source code. It depends on the license...

lifthrasiir · 2024-08-29T11:32:22 1724931142

A mere decompilation or general reverse engineering should be fine in many if not most jurisdictions [1]. But it is a whole different matter to make use of any results from doing so.

[1] https://www.law.cornell.edu/wex/reverse_engineering

poikroequ · 2024-08-29T11:37:48 1724931468

https://en.m.wikipedia.org/wiki/Clean-room_design

DonHopkins · 2024-08-29T11:44:20 1724931860

Using an LLM (or any technique) to decompile proprietary code is not clean room design. Declaring the results "open source" is deception and theft, which undermines the free open source software movement.

poikroequ · 2024-08-29T12:26:12 1724934372

Only if you use the decompiled code. But if one team uses decompiled code to write up a spec, then another team writes an implementation based on that spec, then that could be considered clean room design. In this case, the decompiler would merely be a tool for reverse engineering.

lifthrasiir · 2024-08-29T12:34:30 1724934870

It is true that at least some jurisdictions do also explicitly allow for reverse engineering to achieve interoperability, but I don't know if such provision is widespread.

jraph · 2024-08-29T11:17:56 1724930276

> Seems like we could render all code open source

Unfortunately not really. Having the source is a first step, but you also need the rights to use it (read, modify, execute, redistribute the modifications), and only the authors of the code can grant these rights.

torginus · 2024-08-29T11:50:28 1724932228

Doesn't it count as 'clean room' reverse engineering - or alternatively, we could develop an LLM that's trained on the outputs and side-effects of any given function, and learns to reproduce the source code from that.

Or, going back to the original idea, while the source code produced in such a way might be illegal, it's very likely 'clean' enough to train an LLM on it to be able to help in reproducing such an application.

jraph · 2024-08-29T12:16:46 1724933806

IANAL but if your only source for your LLM is that code, I would assume the code it produces would be at high risk of being counterfeit.

I would guess clean room would still require having someone reading the LLM-decompiled code, write a spec, and have someone else write the code.

But this is definitely a good question, especially given the recent court verdicts. If you can launder open source licensed code, why not proprietary binaries? Although I don't think the situation is the same. I wouldn't expect how you decompile a code matters.

js8 · 2024-08-29T13:08:57 1724936937

> Seems like we could render all code open source.

I agree. I think "AI generating/understanding source code" is a huge red herring. If AI was any good at understanding code, it would just build (or fix) the binary.

And I believe how it will turn out to be, when we really have AI programmers, they will not bother with human-readable code, but code everything in machine code (and if they are tasked in maintaining existing system, they will understand in its entirety, across the SW and HW stack). It's kinda like diffusion models that generate images don't actually bother with learning drawing techniques.

Vampiero · 2024-08-29T13:45:01 1724939101

Why wouldn't AIs benefit from using abstractions? At the very least it saves tokens. Fewer tokens means less time spent solving a problem, which means more problem solving throughput. That is true for machines and people alike.

If anything I expect AI-written programs in the not so distant future to be incomprehensible because they're too short. Something like reading an APL program.

js8 · 2024-08-29T16:57:33 1724950653

I agree, they might create abstractions, but I doubt they're going to reuse the same abstractions as human programming languages.

johndough · 2024-08-29T11:15:51 1724930151

> Anyone working on decompiler LLMs?

Here is an LLM for x86 to C decompilation: https://github.com/albertan017/LLM4Decompile

croes · 2024-08-29T11:11:10 1724929870

Unminifying isn't decompiling.

It's just renaming variable and functions and inserting line breaks.

j_maffe · 2024-08-29T11:13:19 1724929999

Minifying includes way more tricks than shorter variable names and removing white-space

api · 2024-08-29T11:12:18 1724929938

No but it’s a baby brother of the same problem. Compiling is a much more complex transform but ultimately it is just a code transform.

jraph · 2024-08-29T11:52:10 1724932330

It is true that compilation and minification are both code transformations (it's a correct reduction [1]), but this doesn't seem a very useful observation in this discussion. In the end, everything you do to something is an operation. But that's not very workable.

In practice, compilation is often (not always, agreed!) from a language A to a lower level language B such that the runtime for language A can't run language B or vice-versa, if language A has a runtime at all. Minification is always from language A to the same language A.

The implication is that in practice, deminification is not the same exercise as decompilation. You can even want to run a deminification phase after a decompilation phase, using two separate tools, because one tool will be good at translating back, and the other will be good at pretty printing.

[1] https://en.wikipedia.org/wiki/Reductionism

aengelke · 2024-08-29T11:53:41 1724932421

There was a paper about this at CGO earlier this year [1]. Correctness is a problem that is hard to solve, though; 50% accuracy might not be enough for serious use cases, especially given that the relation to the original input for manual intervention is hard to preserve.

[1]: https://arxiv.org/abs/2305.12520

DonHopkins · 2024-08-29T11:42:27 1724931747

>Seems like we could render all code open source.

That's not how copyright and licensing works.

You could already break the law and open yourself up to lawsuits and prosecution by stealing intellectual property and violating its owners rights before there were LLMs. They just make it more convenient, not less illegal.

poikroequ · 2024-08-29T11:35:34 1724931334

I think there's actually some potential here, considering LLMs are already very good at translating text between human languages. I don't think LLMs on their own would be very good, but a specially trained AI model perhaps, such as those trained for protein folding. I think what an LLM could do best is generate better decompiled code, giving better names to symbols, and generating code in a style a human is more likely to write.

I usually crap on things like chatgpt for being unreliable and hallucinating a lot. But in this particular case, decompilers already usually generate inaccurate code, and it takes a lot of work to fix the decompiled code to make it correct (I speak from experience). So introducing AI here may not be such a huge stretch. Just don't expect an AI/LLM to generate perfectly correct decompiled code and we're good (wishful thinking).

layer8 · 2024-08-29T11:50:03 1724932203

It can’t really compensate for missing variable and function names, not to mention comments.

creesch · 2024-08-29T11:28:57 1724930937

This is very close to how I often use LLMs [0]. A first step in deciphering code where I otherwise would need to, to use the authors words, power through reading the code myself.

It has been incredibly liberating to just feed it a spaghetti mess, ask to detangle it in a more readable way and go from there.

As the author also discovered, LLMs will sometimes miss some details, but that is alright as I will be catching those myself.

Another use case is when I understand what the code does, but can't quite wrap my head around why it is done in that specific way. Specifically, where the author of the code is no longer with the company. I will then simply put the method in the LLM chat, explain what it does, and just ask it why some things might be done in a specific way.

Again, it isn't always perfect, but more often than not it comes with explanations that actually make sense, hold up under scrutiny and give me new insights. It actually has prevented me once or twice from refactoring something in a way that would have caught me headaches down the line.

[0] chatGPT and more recently openwebUI as a front end to various other models (Claude variants mostly) to see the differences. Also allows for some fun concepts of having different models review each others answers.

eqvinox · 2024-08-29T11:37:25 1724931445

Okay, but if the unminified code doesn't match the minified code (as noted at the end "it looks like LLM response overlooked a few implementation details"), that massively diminishes its usefulness — especially since in a lot of cases you can't trivially run the code and look for differences like the article does.

[ed.: looks like this was an encoding problem, cf. thread below. I'm still a little concerned about correctness though.]

jehna1 · 2024-08-29T12:21:16 1724934076

You need to use another tool to do the actual renames, like HumanifyJS does:

https://github.com/jehna/humanify

lifthrasiir · 2024-08-29T11:45:19 1724931919

It does seem that the unminified code is very close to the original. In some cases ChatGPT even did its own refactoring in addition to the unminification:

    // ORIGINAL:
    j.useEffect(() => {
        function r() {
            n({ height: window.innerHeight, width: window.innerWidth });
        }
        if (typeof window < "u") return n({ height: window.innerHeight, width: window.innerWidth }), window.addEventListener("resize", r), () => window.removeEventListener("resize", r);
    }, []),

    // UNMINIFIED:
    useEffect(() => {
      const handleResize = () => {
        setSize({ height: window.innerHeight, width: window.innerWidth });
      };

      // Initial size setting
      handleResize();

      window.addEventListener('resize', handleResize);
      return () => {
        window.removeEventListener('resize', handleResize);
      };
    }, []);

Note that the original code doesn't call `handleResize` immediately, but have its contents inlined instead. (Probably the minifier did the actual inlining.) The only real difference here is a missing `if (typeof window < "u")` condition.

sabbaticaldev · 2024-08-29T12:53:21 1724936001

the condition is a constant so it can be safely removed

lifthrasiir · 2024-08-29T12:57:53 1724936273

Only in the web environment. In fact the condition itself is true only when it runs in a web browser and not in a web worker.

sabbaticaldev · 2024-08-29T18:37:07 1724956627

which is the case for that code and it was added by the obfuscator

lifthrasiir · 2024-08-30T02:52:24 1724986344

No obfuscator would add only that. It is almost surely from some library that is aware of the possibility that `window` may not exist.

punkpeye · 2024-08-29T11:41:41 1724931701

This refers to the fact that ChatGPT generated version is missing some characters that are used in the original example. Namely, [looks like HN does not allow me to paste unicode characters, but I am referring to the block characters] can be seen in their version, but cannot be seen in the ChatGPT generated version. However, it very well might be that it is simply because I didn't include all the necessary context.

Discrediting the entire output because a few missing characters would be very pedantic.

Otherwise, the output is identical as far as I can tell by looking at it.

Joker_vD · 2024-08-29T11:44:54 1724931894

It's because the author miscopy-pasted the original code: those "â–‘â–’â–“â–ˆ" at the end of the O5 string are supposed to be the block characters. E.g. "â–‘" in Windows-1252 [0] is 0xE2 0x96 0xE2 which, in UTF-8, exactly the encoding for U+2592 MEDIUM SHADE [1].

[0] https://en.wikipedia.org/wiki/Windows-1252#Character_set

[1] https://www.compart.com/en/unicode/U+2592

punkpeye · 2024-08-29T11:50:06 1724932206

Possible that this is the mistake.

However, I don't think I miscopied the original code.

https://reactive.network/assets/index-8b4ef4ac.js

If you look for `oahkbdpqwmZO0QLCJUYXzcvunxrjft` in the output, you should see that those characters appear exactly like that. Maybe an issue with encoding of the script file?

Joker_vD · 2024-08-29T11:56:36 1724932596

Most definitely; if I use "View >> Repair Text Encoding" in Firefox, it shows the block characters. But I have to admit, it's strange that Firefox does not choose UTF-8 by default in this case.

punkpeye · 2024-08-29T12:05:10 1724933110

Yes, turns out I was the one who made the mistake.

I updated the article to reflect the mistake.

> Update (2024-08-29): Initially, I thought that the LLM didn’t replicate the logic accurately because the output was missing a few characters visible in the original component (e.g., ). However, a user on HN forum pointed out that it was likely a copy-paste error.

>

> Upon further investigation, I discovered that the original code contains different characters than what I pasted into ChatGPT. This appears to be an encoding issue, as I was able to get the correct characters after downloading the script. After updating the code to use the correct characters, the output is now identical to the original component.

>

> I apologize, GPT-4, for mistakenly accusing you of making mistakes.

dn3500 · 2024-08-29T14:07:57 1724940477

If no character set is specified, plain text content is assumed to be 1252. This probably extends to application/javascript as well but I'd have to check to be sure.

The web pre-dates utf-8, although not by much. Ken Thompson introduced utf-8 at winter Usenix in 1993 and CERN released the web in April, but it would be several more years before utf-8 became common. The early web was ISO 8859-1 by default. But people were pretty lazy about specifying character sets back then (still are actually) and Microsoft started sending or assuming their 1252 character set where 8859-1 was required by the spec. Eventually the spec was changed to match de facto behavior. I guess the assumption was that if you're too stupid or lazy to say what character set you're using, then it's probably 1252. (Today the assumption would be that it's probably utf-8). I'm not sure what the specs say today, but I think html is assumed to be in utf-8, and everything else is assumed to be 1252 (if the character set is not explicitly declared).

Mashimo · 2024-08-29T11:48:40 1724932120

He also told it to reimplement from JavaScript to TypeScript.

I would guess if he just told it to rename the variables and method first, it would have been closer to the original.

fasteddie31003 · 2024-08-29T15:33:25 1724945605

I recognized this a few months back when I wanted to see the algorithm that a website used to do a calculation. I just put the minified JS in ChatGPT and figured it out pretty easily. Let's take this a few steps out. What happens when a LLM can clone a whole SAAS app? Let's say I wanted to clone HubSpot. If an LLM can interact with a browser and figure out how a UI works and take code hints from un-mimified code I think we could see all SAAS apps be commoditized. The backend would be proprietary, but it could figure out API formats and suggest a backend architecture.

All this makes me think AI's are going to be a strong deflationary force in the future.

sgt101 · 2024-08-29T15:41:03 1724946063

I was with you until:

>If an LLM can interact with a browser and figure out how a UI works and take code hints from un-mimified code I think we could see all SAAS apps be commoditized. The backend would be proprietary, but it could figure out API formats and suggest a backend architecture.

whoooha! that's a lot of probing and testing of the SAAS that would be required in order to see how it behaved. SAAS aren't algorithms, they operate over data that's unseen on the front end as well...

>All this makes me think AI's are going to be a strong deflationary force in the future.

I don't get this. I've literally never worked anywhere which had enough software engineers, we've been going on about software crisis for about 50 years and things are arguably worse than ever. The gap between the demand for good software (in the sense that allocating capital to producing it would be sensible) and the fulfillment of that demand is bigger than ever. We just don't have the mechanisms to make this work and to make it work at an economically viable level.

Then we get AI to help us and everyone thinks that the economy will shrink?

IncreasePosts · 2024-08-29T16:37:02 1724949422

You wouldn't necessarily need to do much probing - consider that the documentation would provide numerous hints to the agent as to what each endpoint was actually doing.

TheKarateKid · 2024-08-29T16:03:01 1724947381

Honestly, the value in most business software isn't the actual technology. It's the customer base and data held by the platforms.

Someone could already easily clone HubSpot relatively cheaply even if they hired developers, but that doesn't mean it will be anywhere near successful.

interstice · 2024-08-29T11:13:08 1724929988

Have used Claude to reverse engineer some minified shopify javascript code recently. Definitely handy for unpicking things.

nutanc · 2024-08-29T12:21:05 1724934065

Had tweeted about this sometime back. Found a component which was open source earlier and then removed and only minfied JS was provided. Give the JS to Claude and get the original component back. It even gave good class names to the component and function names.

Actually this opens up a bigger question. What if I like an open source project but don't like its license. I can just prompt AI by giving it the open source code and ask it to rewrite it or write in some other language. Have to look up the rules if this is allowed or will be considered copying and how will a judge prove?

lifthrasiir · 2024-08-29T12:27:36 1724934456

Almost likely you would be found guilty because the intent matters. It is easy to check that the generated code is much similar to the original code, and you surely had a reason to bypass the original license. The exact legal reasoning would vary but any reasonable laywer would recommend you to do not.

In the historic Google v. Oracle suit, the only actual code that was claimed to be copied was a trivial `rangeCheck` function, but Google's intent and other circumstances like the identical code structure and documentation made it much more complicated, and the final decision completely bypassed the copyrightability of APIs possibly for this reason.

ziptron · 2024-08-29T12:12:57 1724933577

>I apologize, GPT-4, for mistakenly accusing you of making mistakes.

I am testing large language models against a ground truth data set we created internally. Quite often when there is a mismatch, I realize the ground truth dataset is wrong, and I feel exactly like the author did.

SoftTalker · 2024-08-29T14:35:11 1724942111

Apologizing to a program seems rather silly though. Do you apologize to your compiler when you have a typo in your code, and have to make it do all that work again?