A buffer overflow in the XNU kernel

lgdskhglsa · 2024-06-20T21:40:16 1718919616

In case people missed it, the name of the exploit is a blink 182 song released around the time it was discovered.

jprx · 2024-06-20T21:46:43 1718920003

You get it!!

lgdskhglsa · 2024-06-21T20:07:55 1719000475

It's my favorite song of the album :)

bartvk · 2024-06-20T19:44:28 1718912668

If you're still running the affected kernel, what are the possible consequences?

Also, this has been public for months:

- February 17, 2024: I posted the hash of TURPENTINE.c to X on Feb 17, 2024.

- May 13, 2024: macOS Sonoma 14.5 (23F79) shipped with xnu-10063.121.3, the first public release containing a fix.

axoltl · 2024-06-20T20:27:04 1718915224

The syscalls involved are in a lot of sandboxes, so worst (or best, depending on your point of view) case scenario it's a pretty universal privesc. There's a lot of steps to get there though. I'm not super familiar with the mbuf subsystem specifically but I'm going to guess mbufs are in their own allocator zone. That means you're guaranteed to overwrite an adjacent m_hdr structure. Those contains pointers that form a linked list and at first glance I don't see linked list hardening or zone checks in the MBUF macros. One could envision being able to turn this bug into a kASLR leak as well as a kernel r/w primitive and while that isn't the silver bullet it used to be on XNU (because of a whole host of hardening Apple put in) it's still pretty powerful.

TheDong · 2024-06-21T03:49:57 1718941797

> Also, this has been public for months:

Posting the hash to twitter as a proof that "something" exists reveals no actual information, so it's not considered making the exploit "public" in any meaningful way.

From the blog's timeline, it's been visible in code diffs since ~April, but only called out as a CVE since 10 days ago, so I'd consider this one hot off the presses.

throwaway71271 · 2024-06-20T19:56:14 1718913374

[flagged]

chad1n · 2024-06-20T20:16:11 1718914571

There is a bigger chance that a toddler smashing a keyboard finds a bug than gpt5. LLMs can't understand intent, so they literally work like `grep` with little to no understanding of the context, so most of the time it will false flag good code.

There are already a lot of tools already to find bugs, like fuzzers, but I am sure that LLMs won't be one of them.

barkingcat · 2024-06-20T20:53:42 1718916822

Llm powered / guided fuzzer would be pretty cool though.

zX41ZdbW · 2024-06-20T21:08:02 1718917682

https://github.com/google/oss-fuzz-gen

exe34 · 2024-06-20T20:41:17 1718916077

they don't need to understand intent, they just need to find exploits. they don't even need to do it by reading code alone - give them a vm running the code and let them throw excrement at it until something sticks!

lpapez · 2024-06-20T20:08:05 1718914085

Writing an exploit is usually much more difficult than patching the underlying bug.

Half of the work in fixing a bug report is getting a reproducible example. Nay, more than half.

If there was a magic AI which could generate exploits, I'd imagine there would be an equally magic AI patching the holes right out.

vlovich123 · 2024-06-20T20:16:36 1718914596

Maybe but keep in mind that there’s often a substantial lag in practice between a fixed vulnerability and its deployment into production.

That said, I’m quite skeptical there’s any AI’s on the horizon that can autogenerate exploits from CVEs.

saagarjha · 2024-06-20T19:57:40 1718913460

It’s definitely nowhere near capable of doing that.

favorited · 2024-06-20T20:06:49 1718914009

Is "with a sufficiently smart LLM" the new "with a sufficiently smart compiler?"

st_goliath · 2024-06-20T20:19:00 1718914740

"imagine feeding this into an LLM/ChatGPT" is the new "imagine a Beowulf cluster of these"

cozzyd · 2024-06-21T01:14:26 1718932466

grendellm?

sillywalk · 2024-06-20T21:00:28 1718917228

Apparently GPT-4 has some capacity to conduct exploits this by "reading" CVE reports. I don't know if it can autonomously create exploits though:

GPT-4 can exploit vulnerabilities by reading CVEs (theregister.com) 81 points by ignoramous 60 days ago | hide | past | favorite | 29 comments

https://news.ycombinator.com/item?id=40101846

which links to a Register Article[0], which links to a paper[1]:

"In this work, we show that LLM agents can autonomously exploit one-day vulnerabilities in real-world systems. To show this, we collected a dataset of 15 one-day vulnerabilities that include ones categorized as critical severity in the CVE description. When given the CVE description, GPT-4 is capable of exploiting 87% of these vulnerabilities compared to 0% for every other model we test (GPT-3.5, open-source LLMs) and open-source vulnerability scanners (ZAP and Metasploit). Fortunately, our GPT-4 agent requires the CVE description for high performance: without the description, GPT-4 can exploit only 7% of the vulnerabilities."[1]

[0] https://www.theregister.com/2024/04/17/gpt4_can_exploit_real...

[1] https://arxiv.org/pdf/2404.08144

saagarjha · 2024-06-20T22:49:59 1718923799

Yes, that sounds about right. LLMs aren’t quite good enough to find novel bugs and exploit them like a human would.

tedunangst · 2024-06-20T23:09:48 1718924988

Yeah, that works for web vulns where the vuln description is practically the exploit anyway. I could write a perl script that parses out variable names and writes sql injections for it.

poincaredisk · 2024-06-21T05:17:04 1718947024

For comparison, in the native world program is considered vulnerable when someone finds arbitrary write primitive (even without leak), use after free, and even double free. There is a huge gap between these and actually having a working RCE exploit. Most CVEs in this space are given without a working exploit ever written.

brcmthrowaway · 2024-06-20T20:15:26 1718914526

Have you used GPT-5?

JSDevOps · 2024-06-20T20:25:06 1718915106

If you aren’t using GPT-6a then you are years behind.

vips7L · 2024-06-20T21:55:26 1718920526

GPT-69 is already far ahead of 6a.

JSDevOps · 2024-06-20T22:10:54 1718921454

Been using 73 for months now.

exe34 · 2024-06-20T20:42:24 1718916144

you need to wake up at 4am and have a cold shower!

JSDevOps · 2024-06-20T21:02:08 1718917328

12 before 12, 12 cold showers before 12 allowing GPT-7 to take care of my daily needs.

speed_spread · 2024-06-20T20:17:25 1718914645

GPT-5, maybe not. But somebody somewhere is building something that can do that. And if they can't do it _now_ they have a plan that tells them what's missing. TLDR; it's coming, soon.

axoltl · 2024-06-20T20:29:27 1718915367

Writing exploits is a bit of an art-form. Current incarnations of GPT have trouble writing code at a level more advanced than a junior developer.

TylerE · 2024-06-20T20:25:00 1718915100

and lots of people are spending lots of time and money on AI Coding Assitants... which is more or less the knowledge base you need.

If they could use that structural training to answer queries like "Is there any code path where some_dangerous_func() is called without it's return value being checked"...

axoltl · 2024-06-20T20:36:47 1718915807

You can do this today by querying the AST output by a compiler. Regardless, the parent comment was talking about exploits, not vulnerabilities/bugs. Vulns are a dime-a-dozen compared to even PoC exploits let alone shippable exploits.

TylerE · 2024-06-20T21:03:13 1718917393

Ok, so add "and generate a C program to exploit it" to the prompt.

axoltl · 2024-06-20T21:25:33 1718918733

You're either being sarcastic or wildly underestimating how hard it is to write an exploit. I haven't written about exploit dev publicly for a _long_ time, but I invite you to read https://fail0verflow.com/blog/2014/hubcap-chromecast-root-pt... for what I consider to be a pretty trivial exploit of a very "squishy" (industry term) target.

XNU isn't the hardest target to pop but it is far from the easiest.

poincaredisk · 2024-06-21T05:20:19 1718947219

There's nobody more confident in the world, than HN poster wiring about a topic they have no experience with.

There is a huge gap (in the binary exploitation world) between identifying a problematic code pattern and having a workable bug (a reproduction), and even larger one between a reproducible crash and a working exploit (because we're not in the 90s anymore and complier/hardware mitigations are literally always enabled). Current LLMs can cross neither gap, and are not even close to bridging the second one.

mschuster91 · 2024-06-21T07:45:10 1718955910

> Like when you can just send one icmp packet with `+++ath0` and just disconnect someone's modem

Oh, I remember the "XDCC SEND KEYLOGGER 0 0 0" exploit from IRC era ~2010... dumbass middleboxes would yeet anyone whose packets crossed them.

jiveturkey · 2024-06-20T20:33:05 1718915585

the real win will be when it can also generate the codename for the exploit. FATEFATAL