Despite all the “Apple is evil” or “Apple is behind” (because they don’t do evil...

bigyabai · 2025-07-18T00:54:32 1752800072

Apple is behind. People forget that Google was shipping mobile-scale transformer-based LLMs in 2019: https://github.com/google-research/bert

By the time Apple has an AI-native product ready, people will already associate it with dehumanization and fascism.

OldfieldFund · 2025-07-18T14:46:11 1752849971

I think it's a great move that Apple is cautious with including a hardcore LLM in everything. They are not that useful to the regular user.

bigyabai · 2025-07-18T17:34:48 1752860088

Nobody is forcing "hardcore LLM" features on anyone, besides maybe Microsoft. This is that same cope as "I'm glad Apple Car can't crash and threaten people's lives" despite the fact that... yunno, Apple really wanted to bring it to market.

Siri, sideloading and AI features all all the same way; give people options and nobody will complain.

OldfieldFund · 2025-07-19T18:08:08 1752948488

If they give Siri LLMs, there will be headlines that it drove kids to suicide. People really don't need LLMs.

Sideloading is bad for business. Most users don't care. Remember, we, the devs, are not the core target/biggest spenders. They are targeting a large audience of young people who are not tech-savvy.

0x457 · 2025-07-17T22:54:50 1752792890

Guided generation is called "Structured Output" by other providers?

Well partially generated content streaming thing is great and I haven't seen it anywhere else.

ApolloVonZ · 2025-07-17T23:21:37 1752794497

Sorry if I didn’t use the correct terms. Didn’t catch up on all the terminology coming from my native language. ;) But yes, I agree, the fact that parts, different parameters, of the model can be completed asynchronous by streaming the output of the model, is quite unique. Apple/swift was late with async/await, but putting it all together, it probably plays well with the ‘never’ (I know ) asynchronous and reactive coding.

astrange · 2025-07-17T23:27:45 1752794865

An issue with this is that model quality can get a lot lower when you force it into a structured form, because it's out of distribution for the model.

(I'm pretty sure this is actually what drove Microsoft Sydney insane.)

Reasoning models can do better at this, because they can write out a good freeform output and then do another pass to transform it.

0x457 · 2025-07-18T00:58:30 1752800310

I have this toy agent I'm writing, I always laugh that I, human, write a code that generates human-readable markdown, that I feed to llm where I ask it to produce a json, so I can parse (by code I, or it wrote) and output in a consistent human-readable form.

I'm thinking about let it output freeform and then use another model to use to force that into structured.

zos_kia · 2025-07-18T12:26:22 1752841582

I've found this approach brings slightly better result indeed. Let the model "think" in natural language, then translate it's conclusions to Json. (Vibe checked, not benchmarked)

astrange · 2025-07-18T01:35:32 1752802532

IIRC yaml is easier for models than json because you don't need as much recursive syntax.

t1amat · 2025-07-18T01:53:23 1752803603

I doubt this is true anymore, if ever. Both require string escaping, which is the real hurdle. And they are heavily trained on JSON for tool calling.

0x457 · 2025-07-22T05:03:15 1753160595

I believe it could be true because I think training dataset contained a lot more yaml than json. I mean...you know how much yaml get churned out every second?

lxgr · 2025-07-18T02:09:12 1752804552

How do you think their implementation works under the hood? I'm almost certain it's also just a variant of "structured outputs", which many inference providers or LLM libraries have long supported.

a_wild_dandan · 2025-07-18T00:24:24 1752798264

Huh? Grammar-based sampling has been commonplace for years. It's a basic feature with guaranteed adherence. There is no "carefully crafting" anything, including safeguards.