I don't get this sentiment `AI is wrong! OMG WE'VE BEEN LIED TO!` Is anyone REAL...

chucksta · on Feb 14, 2023

This reads the same as "what did you expect of full self driving? Of course there are technical limitations." A lot of people have an expectation that AI is "smart" based on popular culture, relatively very few realize what we have currently is word prediction not intelligence of some sort

unshavedyak · on Feb 14, 2023

> relatively very few realize what we have currently is word prediction not intelligence of some sort

Honestly this impresses me most. The fact that we have word prediction and it can still be very useful (in my experience). You're right people don't understand it; hell i don't either, fully. Nevertheless it is shockingly useful for being nothing more than an autocomplete over existing information.. in my view.

Even if all we could add to the service is the ability to cite sources of information - ie to validate for ourselves statements more easily - then i think it would be a huge leap in interfaces.

Being able to put a chat interface over content seems wholly unique to me. I'd love to explore the depth of complex inputs like the entire LOTR series as a chat. To discuss, learn, and grow on existing content. I don't care about intelligence; i just want the ability to mitigate confidently-incorrect. If that's even possible.

bioemerl · on Feb 14, 2023

That's because these things are way more than word prediction.

Their job is 100 percent to predict the next word, but saying that hides what you have to do in order to predict the next word

In a factual database? You have to repeat facts.

In a logical database? You have to learn logic.

In a database containing only good chess moves? You have to learn to make good chess moves.

Train any AI model in a strong enough way and it will pick up donation knowledge along the way. Yes, it's predicting words, but it's using that knowledge to do it.

It's by no means perfect, but do not underestimate the potential here, and remember that reductive definitions (it's just x) can make most great things seem bad or mundane.

abm53 · on Feb 14, 2023

Right, it’s (very roughly speaking) an approximation to an algorithm which maximises the probability of the entire response by factorising the distribution as a sequence of conditional distributions.

If the approximation works, and the training data is coherent, it will produce coherent responses.

The surprising thing to me is that this does work so much of the time.

abm53 · on Feb 14, 2023

I think one of the clear lessons of ChatGPT is that the line between word prediction and intelligence (for some definition) is somewhat blurry.

marcosdumay · on Feb 14, 2023

Self driving cars are supposed to drive themselves. That's their entire goal.

LLMs are not supposed to answer knowledge questions. They are not knowledge bases. The fact that everybody, including their developers insist on using them that way is just absurd. Are we in a Kafka novel?

That's the equivalent to complaining that the self driving cars you rented didn't deploy your marketing message effectively, so their AI must be bonkers.

ericsoderstrom · on Feb 14, 2023

What are they supposed to do then?

hbrn · on Feb 14, 2023

> LLMs are not supposed to answer knowledge questions

I googled for a knowledge question and wasn't able to find an answer.

Then I used ChatGPT and it produced a surprisingly satisfying (opinionated and correct) response.

I was able to successfully extract value by treating LLM as knowledge base, what exactly is absurd about it?

(to be fair, I asked ChatGPT the same question today and this time it returned a bland "it depends" non-answer).

chucksta · on Feb 14, 2023

And intelligence is part of the acronym, what it is and what the masses understand it to be are two completely different things.

Barely anyone is calling them LLM, its always AI.

TeeWEE · on Feb 14, 2023

To predict sentences correctly the model learned human knowledge and its relations.

It correctly predict, you need intelligence. This model encodes human knowledge in its huge neural net.

chucksta · on Feb 14, 2023

I'm not saying its not impressive or doesn't contain logic and facts and knowledge, i'm saying people generally understand the word intelligence to mean one thing, and this doesn't meet that understanding.

JoeAltmaier · on Feb 14, 2023

AI doesn't even pretend to get anything 'right'. As far as I understand it produces statistically common phrases related to the subject words.

I asked for a chemical bond description of caffeine. It began with the formula C8H10N4O2, then proceeded to blather about the number of bonds of this type and that. Getting absolutely everything wrong, egregiously wrong.

I believe it was just spouting chemistry-sounding verbiage from other descriptions of chemical bonds of other substances.

AI doesn't 'look things up' or 'disgorge something it's heard'. It makes shit up wholesale from fragments. Always spouting it out with complete confidence.

sometimeshuman · on Feb 14, 2023

The most dangerous communicators are those who/that make assertion with a confidence tone that is not in proportion to their knowledge level.

A good start would be to get rid of ChatGPT's "Sure!" opening word. It's giving me real garbage at times with the implied confidence of certainty.

PaulDavisThe1st · on Feb 14, 2023

Microsoft appears to believe they can market with a message extremely close to "Our AI technology will provide you with 100% accurate facts and summaries", and then apologists (like your post) come along and say "well, come on, nobody really thinks that".

No, it's not realistic to expect AI to be correct about anything, especially when that "AI" isn't built to give a fuck about correctness. But tell that to the proponents of "chatgpt everywhere".

zaptrem · on Feb 14, 2023

Which part of this prominent message at the top of every BingGPT says the quote you ascribe to MS? https://imgur.com/a/vkPG6ZL

PaulDavisThe1st · on Feb 14, 2023

Which part of "suprises and mistakes" is covered in this announcement text?

https://blogs.microsoft.com/blog/2023/02/07/reinventing-sear...

Also, user feedback is not going to improve the LM behavior.

jakear · on Feb 14, 2023

Q: Which part of "suprises and mistakes" is covered in this announcement text?

A: "Our teams are working to address issues such as misinformation"

Q: Also, user feedback is not going to improve the LM behavior.

A: finalModel = prospectiveModels.sort(m => m.averageUserFeedback)[0]

(Not the exact implementation, obviously. But if you think data can't improve AI models... I don't know what to say)

PaulDavisThe1st · on Feb 14, 2023

Fair enough, I missed the line about work on misinformation.

However, AFAIK, there is no AI-field wide consensus about sensible ways to address this, and no conviction from anyone that there are any reliable techniques. So it's nice that they have "teams working" on it, but IMO that doesn't justify deployment of clearly flawed technology for this purpose.

Data from users ("It was wrong") is hard to incorporate. A scheme like the one you propose basically implies using users as literal testers, which wouldn't matter so much if this was a UI/UX question. Instead, users will be given garbage, a few of them will find out, a few of those will leave feedback. This is not a sane model for improving the behavior of a language model.

jakear · on Feb 14, 2023

> there is no AI-field wide consensus about sensible ways to address this, and no conviction from anyone that there are any reliable techniques

Oh no, an open problem! Better just give up. Certainly don't allow anyone in the public to be involved in finding a solution. Much better to have an internal team stumbling around in the dark for years then force their product out when a different company comes along who was willing to develop in the open and has moved much faster accordingly.

> A scheme like the one you propose basically implies using users as literal testers

Users are literal testers, that's why the product is out now. Some users are being given garbage, some of them are finding out, and some of them are leaving feedback. That user flow is occurring at a measurable rate.

In the future, different models will be made. They will be given access to different "oracles" (in the computability theory sense), and these oracles will change their behavior. They will be able to do things like query {the web, wolfram alpha, python, prolog, etc.} and provide cited sources in responses. However, it's not enough to add the oracle. You must also verify the oracle improves the user's experience. This is done by comparing the measured feedback rate with/without the oracle(s).

PaulDavisThe1st · on Feb 15, 2023

> Better just give up.

Certainly not. But don't release a modification for a major public facing service based on "we'll probably figure it out one day".

> Users are literal testers, that's why the product is out now

I regard this as immoral (not for the UI/UX case, as I mentioned above). I don't expect or require you or anyone else to agree with me.

sometimeshuman · on Feb 14, 2023

The most dangerous communicators are those who/that make assertion with a confidence that is not in proportion to their level of certainty.

A good start is to get rid of ChatGPT's "Sure!" opening word. It's giving me real garbage at times with the implied certainty.

fleddr · on Feb 14, 2023

A lot of ChatGPT responses are presented as factual, with confidence. And it's ChatGPT, not "search random results GPT". And it's AI, which has the word "intelligence" in it.

When you deploy a tool as such, you're asking for it.

MagicMoonlight · on Feb 14, 2023

The search results show you websites. They don’t claim to be true, they just show you things that are related to what you searched for.

How can you not see the difference?