More

surfingdino · 2025-06-19T06:56:56 1750316216

> structured tool output

Yeah, let's pretend it works. So far structured output from an LLM is an exercise in programmers' ability to code defensively against responses that may or may not be valid JSON, may not conform to the schema, or may just be null. There's a new cottage industry of modules that automate dealing with this crap.

sanxiyn · 2025-06-19T07:09:45 1750316985

No? With structured outputs you get valid JSON 100% of the time. This is a non-problem now. (If you understand how it works, it really can't be otherwise.)

https://openai.com/index/introducing-structured-outputs-in-t...

https://platform.openai.com/docs/guides/structured-outputs

OtherShrezzing · 2025-06-19T10:03:05 1750327385

From the 2nd link:

> Structured Outputs can still contain mistakes.

The guarantee promised in link 1 is not supported by the documentation in link 2. Structured Output does a _very good_ job, but still sometimes messes up. When you’re trying to parse hundreds of thousands of documents per day, you need a lot of 9s of reliability before you can earnestly say “100% guarantee” of accuracy.

hobofan · 2025-06-19T10:31:11 1750329071

Whether it's a non-problem or not very much depends on how much the LLM API providers actually bother to add enforcement server-side.

Anecdotally, I've seen Azure OpenAI services hallucinate tools just last week, when I provided an empty array of tools rather than not providing the tools key at all (silly me!). Up until that point I would have assumed that there are server-side safeguards against that, but now I have to consider spending time on adding client-side checks for all kinds of bugs in that area.

surfingdino · 2025-06-19T13:29:01 1750339741

Meanwhile https://dev.to/rishabdugar/crafting-structured-json-response... and https://www.boundaryml.com/blog/structured-output-from-llms

You are confusing API response payloads with Structured JSON that we expect to conform to the given Schema. It's carnage that requires defensive coding. Neither OpenAI nor Google are interested in fixing this, because some developers decide to retry until they get valid structured output which means they spend 3x-5x on the calls to the API.

fooster · 2025-06-19T11:41:51 1750333311

Have you actually used this stuff at scale? The replies are often not valid.

sanxiyn · 2025-06-19T13:02:47 1750338167

Yes I have.

evalstate · 2025-06-19T13:34:06 1750340046

Structured Output in this case refers to the output from the MCP Server Tool Call, not the LLM itself.

surfingdino · 2025-06-19T06:49:42 1750315782

There's always https://www.fabfile.org/ no need to throw an LLM into a conversation that is best kept private.

dig1 · 2025-06-19T07:42:21 1750318941

Ansible, puppet, chef, salt, cfengine... There are tons of tools that are more precise and succinct for describing sensitive tasks, such as managing a single or a fleet of remote servers. Using MCP/LLMs for this is... :O

ethbr1 · 2025-06-19T12:46:17 1750337177

... the future?

There are security / reliability concerns, true, but finally getting technically close to Star Trek computers and then still doing things 'the way they've always been done' doesn't seem efficient.

swalsh · 2025-06-19T12:18:18 1750335498

I don't know if you understand the role the LLM is playing here. The mechanism used to execute the command is not the relevant thing. The LLM autonomously executing commands has intelligence, it's not just a shell script. If I ask it to do a task, and it runs into an issue... LLMs like Claude can recognize the problem, and find a way to resolve it. Script failed because of a missing dependency, it'll install it. Need a config change, it'll do it. The SSH mcp is just the interface for the LLM to do the work.

You can give an LLM a github repo link, a fresh VPC, and say "deploy my app using nginx" and any other details you need... and it'll get it done.

surfingdino · 2025-06-07T07:01:43 1749279703

> just how loud the expectations around AI have become, especially among non-technical folks.

This. It's bordering on mass madness. I am taking 2-4 calls a week from "two guys from ..." with mad ideas and unrealistic expectations of what it takes to build and maintain an AI product. I've seen it with early internet rush, Web 2.0, and crypto before.

ednite · 2025-06-07T13:56:05 1749304565

Like that déjà vu scene in The Matrix, except instead of a black cat, it’s AI pitch decks with wild ideas. Appreciate the sanity check!

surfingdino · 2025-05-26T07:31:15 1748244675

Various fungi would love it.

surfingdino · 2025-05-26T06:49:56 1748242196

Nobody at those big corps has any control of or time to pause to think over the effects of their actions.

Brajeshwar · 2025-05-26T07:27:17 1748244437

I've long back realized that it is not just the big corps, but every employee of a company, after a while began to think everything outwards. Their focus, the work, they playbook always from them. There are only a very few that things from outside.

During my consultation, the team I was helping keep talking about "Our App", "Our Process", "Our Use", "How do we get this data into our System?" I had to ask them multiple times, "How does your users or customers outside of your company uses them?" "Have you thought of how people usually do these kind of steps?"

surfingdino · 2025-05-26T16:35:09 1748277309

Exactly. That's their default mode of thinking.

surfingdino · 2025-05-26T06:16:11 1748240171

I'm so glad I wound down all of my AWS accounts and pay ~$1.5 for them per month. I'll close the rest this month.

surfingdino · 2025-05-20T06:25:25 1747722325

When politicians get involved in research scientific proof and reason aren't always winning.

adastra22 · 2025-05-20T06:34:31 1747722871

Politicians have been involved in research for over a century.

toolslive · 2025-05-20T06:35:10 1747722910

Didn't a politician invent the internet ?

eschaton · 2025-05-20T07:39:03 1747726743

That’s a meme created and spread by pseudo-journalist Declan McCullagh specifically to tar Al Gore in the lead-up to the 2000 election.

Specifically, Gore said in an interview that he “took the initiative in creating the Internet” by introducing the bill to allow commercial traffic on ARPAnet, which McCullagh twisted in an article to “Al Gote claimed he invented the Internet” in order to to smear him.

sundarurfriend · 2025-05-20T09:09:29 1747732169

Given how close that election turned out to be, this smear campaign likely changed the presidency, and given George WMD Bush's actions, changed the course of the world for the worse in many ways. (For those who were too young or not yet born at the time, these jokes were MASSIVE to the extent that became largely Al Gore was known for, for years after. So it's not much of an exaggeration to say they had a material impact on his perception and hence the votes.)

Al Gore understood technology, the internet, was a champion for the environment, and it's unbelievable today that he came that close to presidency (and then lost). When people say "we live in the bad timeline", one of the closest good timelines is probably one where this election went differently.

TypingOutBugs · 2025-05-20T06:59:44 1747724384

sundarurfriend · 2025-05-20T09:01:23 1747731683

Al Gore played a big role in getting political (and hence economic) support for the expansion of the Internet.

https://en.wikipedia.org/wiki/Al_Gore_and_information_techno... :

> Al Gore, a strong and knowledgeable proponent of the Internet, promoted legislation that resulted in President George H.W Bush signing the High Performance Computing and Communication Act of 1991. This Act allocated $600 million

> In the early 1990s the Internet was big news ... In the fall of 1990, there were just 313,000 computers on the Internet; by 1996, there were close to 10 million. The networking idea became politicized during the 1992 Clinton–Gore election campaign, where the rhetoric of the information highway captured the public imagination.

Your parent comment is either joining in on the ridicule side or at least in misquoting:

> Gore became the subject of controversy and ridicule when his statement, "I took the initiative in creating the Internet", was widely quoted out of context. It was often misquoted by comedians and figures in American popular media who framed this statement as a claim that Gore believed he had personally invented the Internet.[54] Gore's actual words were widely reaffirmed by notable Internet pioneers, such as Vint Cerf and Bob Kahn, who stated, "No one in public life has been more intellectually engaged in helping to create the climate for a thriving Internet than the Vice President."

surfingdino · 2025-05-20T21:20:26 1747776026

Nope. Most of the early core research was done by RAND Corporation.

surfingdino · 2025-05-12T13:40:37 1747057237

Are you saying that people who are affected by the AI's impact on their income/wealth are the least qualified to be concerned about it?

gloxkiqcza · 2025-05-12T13:52:34 1747057954

AI is a tool. Just like with a drum machine or a DAW, you need to be a musician to be able to use it to create something worthwhile. And just like sampling, drum machines and DJing didn’t kill acoustic music, AI won’t either. It will merely create a new type of music that will coexist along with all of the other types of music just fine.

AI just raises the level of abstraction and therefore the capabilities of an individual.

etblg · 2025-05-12T22:52:39 1747090359

Since you brought up sampling in music and I feel compelled to point this out in any AI thread when that gets mentioned:

sampling machines don't give you a free pass to sample music all willy nilly, if you're going to publish the result for commercial gain you have to clear the sample, the original artist is getting royalties from it. This is something that was fought for and won by musicians: https://en.wikipedia.org/wiki/Grand_Upright_Music,_Ltd._v._W....

(Not that you implied otherwise, I just want to point that out)

gloxkiqcza · 2025-05-13T07:21:28 1747120888

Thanks for pointing that out, I actually didn’t know that! It is very relevant, although it doesn’t affect my core message. Let’s see what the future holds.

surfingdino · 2025-05-12T13:37:15 1747057035

Bell Labs had infinite money. Their owners made money every time someone picked up a phone. Not all businesses are that embedded in the society and those that have boards that might like the idea of funding their own labs have to answer to the higher power--the Wall Street crowd, who will force you to optimise for maximum profit in the shortest amount of time. You get there fastest by cutting costs, especially the costs of long-term research that may not bear fruit.

BTW. Why wait 50 years to forget Theranos? https://www.nytimes.com/2025/05/10/business/elizabeth-holmes...

surfingdino · 2025-05-08T06:28:07 1746685687

What annoys me is that every programmers who wish their favourite language / feature was as popular as Python and they choose to implement it in Python to make Python "better". Python was created as a dynamically typed language. If you want a language with type checking, there are plenty of others available.

Rust devs in particular are on a bend to replace all other languages by stealth, which is both obviously visible and annoying, because they ignore what they don't know about the ecosystem they choose to target. As cool as some of the tools written for Python in Rust are (ruff, uv) they are not a replacement for Python. They don't even solve some annoying problems that we have workarounds for. Sometimes they create new ones. Case in point is uv, which offers custom Docker images. Hello? A package manager is not supposed to determine the base Docker image or Python version for the project. It's a tool, not even an essential one since we have others, so know your place. As much as I appreciate some of the performance gains I do not appreciate the false narratives spread by some Rust devs about the end of Python/JavaScript/Golang based on the fact that Rust allowed them to introduce faster build tools into other programming languages' build chains. Rust community is quickly evolving into the friends you are embarrassed to have, a bit like any JVM-based language that suddenly has a bunch of Enterprise Java guys showing up to a Kotlin party and telling everyone "we can be like Python too...".

Spivak · 2025-05-08T08:48:57 1746694137

This argument doesn't make a whole lot of sense because nothing about type annotations constrains Python code at all. In fact because they're designed to be introspectable they make Python even more dynamic and you can do even crazier stuff than you could before. Type checkers are working very hard to handle the weird code.

Pydantic being so fast because it's written in Rust is a good thing, you can do crazy dynamic (de-)serializations everywhere with very little performance penalty.

maleldil · 2025-05-08T17:37:56 1746725876

> nothing about type annotations constrains Python code at all

Sorry, but this is just not true. Don't get me wrong, I write typed Python 99% of the time (pyright in strict mode, to be precise), but you can't type check every possible construct in the language. By choosing to write typed Python, you're limiting how much of the language you can use. I don't think that's a bad thing, but it can be a problem for untyped codebases trying to adopt typing.

Sinidir · 2025-05-08T18:39:42 1746729582

It is literally true. You don't need to run a type checker.