Don't believe ChatGPT – we do not offer a "phone lookup" service

kelseyfrog · on Feb 23, 2023

> All suggestions are welcome.

Monetize it!

Evil answer: Partner with an advertiser and sell https://api.opencagedata.com/geocode/v1/json as an ad space. This may be the first opportunity for an application/json-encoded advertisement.

Nice answer: Partner with an actual phone lookup platform and respond with a 301 Moved Permanently at the endpoint.

fwlr · on Feb 23, 2023

Another suggestion: put something like libphonenumber’s isPossibleNumber or isValidNumber on your server, on error run the query string through it. If it says that looks like a phone number, relay this information in the error response. A field in the JSON response like “info: Your query parameter looks like a phone number. If you are trying to get the geographical location of a phone from the phone number, please be aware this is not possible. [link to blogpost]” would hopefully jump out at people, particularly if you ask them to include the error response in their support request.

It’s an unprincipled hack, a bizarre dependency to add to your project, it probably feels like admitting defeat to the all powerful AI… but it does 90%-solve the problem.

Nathanba · on Feb 24, 2023

yeah I think maybe the way to solve this is to add some kind of API documentation that explicitly mentions that phone lookups aren't possible. ChatGPT will parse it eventually...

rosywoozlechan · on Feb 23, 2023

there's no "actual phone lookup platform" you can't get a person's location by knowing their phone number, that's a huge privacy violation. You can get the location of your own phone via icloud or google's system for android. You could also install an app on your phone to track your phone's location. You cannot find people based on knowing their phone number, that would be a serious safety issue for you know people trying to not, for example, get murdered by their ex-boyfriends.

afpx · on Feb 23, 2023

There’s no public one. But, data brokers have this info, and share it. Internally, a lot of companies can do this. So, it’s not far-fetched.

asveikau · on Feb 24, 2023

Doing that for an open-ended phone number anywhere in the world seems like somebody's full time job to maintain. You'd probably need some database that ties together disparate data sources. Confidence in accuracy probably varies. The best source to use probably also changes.

I feel like somebody googling for a quick code snippet to do that in a single POST request should be aware of how challenging the problem space is. That would be a hint for joe random developer that they've come up with a bad idea and should work on something else. But I probably expect too much.

dTal · on Feb 24, 2023

Are you saying it's routine for cell phone companies to sell live fine location data to the highest bidder? Can I get a source on that? Horrifying if true, enough to make me switch back to landlines.

fy20 · on Feb 24, 2023

Back in 2008 I was working with a company in the UK that had an API to do this with O2. It only provided a rough area of where the user was, as it was based on which (single) cell tower the user was connected to.

I'd imagine today MIMO data is good enough to triangulate a very accurate (metres) position.

retconekt · on Feb 24, 2023

5G MIMO is sub millimetre 3D accurate to microsecond timing given a good tower time reference.

aftbit · on Feb 24, 2023

I've heard anecdotes about this for a while. Here are a few:

https://www.vice.com/en/article/nepxbz/i-gave-a-bounty-hunte...

https://arstechnica.com/tech-policy/2019/01/t-mobile-sprint-...

https://www.theverge.com/2019/2/6/18214667/att-t-mobile-spri...

Tostino · on Feb 24, 2023

Here is a Last Week Tonight piece on the topic in addition: https://www.youtube.com/watch?v=wqn3gR1WTcA

vgeek · on Feb 24, 2023

That is widely known about and freely available to the public, maybe. There are multiple options available for pseudo-LE types with relatively lax access prerequisites.

https://www.robertxiao.ca/hacking/locationsmart/ is an example of one provider's public demo (requiring only a phone number) being used to provide non-consensual location data.

gfody · on Feb 24, 2023

I know a sketchy resporg that used locationsmart, there's a "consent" bit but it's basically honor system

Hello71 · on Feb 23, 2023

it's been reported numerous times that you can buy real-time cell phone location data: https://news.ycombinator.com/item?id=17081684, https://news.ycombinator.com/item?id=20506624, https://news.ycombinator.com/item?id=32143256. you might need a little more info than just a phone number, but (allegedly) not that much more.

guelo · on Feb 23, 2023

Yea, I think this "want" is pointing to the massive tsunami of spammer/scammer script kiddies that ChatGPT is enabling.

rrrrrrrrrrrryan · on Feb 23, 2023

Reverse phone number lookup usually refers to finding a residential/business address tied to a phone number (historically a landline phone, but cell phones are also owned by people who have addresses), not the literal GPS location of the phone.

I think white pages are still a thing, no?

For the young 'uns - the white pages were part of the physical phone book in every city. You got a new phone book delivered to your doorstep each year for free. Yellow pages listed the phone numbers of every business, white pages listed the phone numbers of the residents.

The crazy part is: almost everyone added their numbers voluntarily to the white pages, because you wanted people to be able to easily find and reach you.

5555624 · on Feb 24, 2023

> The crazy part is: almost everyone added their numbers voluntarily to the white pages, because you wanted people to be able to easily find and reach you.

The default was that your number would be in the phone book, it was not voluntary. If you wanted an unlisted number, you had to pay for it.

rrrrrrrrrrrryan · on Feb 24, 2023

This is even crazier.

For reference: I'm old enough to remember using landlines and white pages, but not old enough to have ever had a landline at my own address.

agiacalone · on Feb 24, 2023

I moved to San Francisco in 1999, my early 20s, and got a landline. Cell phones were not yet the norm, and I really didn’t have much interest in having one until a girlfriend of mine insisted I get one in 2000.

If I remember correctly, Pac Bell had a “opt in” white pages phone number removal which was not charged, but my parents just a few years prior had to pay for theirs with Pac Bell.

swexbe · on Feb 24, 2023

You would have to be like <15 to not remember landlines.

kotaKat · on Feb 24, 2023

> I think white pages are still a thing, no?

Rural NY; yes. I've gotten a very, VERY thin White/Yellow Pages in the past year. Barely anybody in the White Pages section, a majority of it is Yellow Pages and a majority of that is nearly full-page ads for national services, not local providers.

msla · on Feb 24, 2023

Don't tell Whitepages

https://www.whitepages.com/reverse-phone

> Whitepages free reverse phone lookup service allows you to enter a phone number and quickly find out who called you. Find the phone owner's full name, address, and more.

[snip]

> Anyone can do a reverse lookup to identify cell phone, residential, and business numbers for free.

That, or you could get a normal white pages and process it using some sort of data processing tool... nah, that's science fiction.

wlonkly · on Feb 25, 2023

The service doesn't seem to work very well at doing what it says, but that's not the important part. The important part is that what ChatGPT is describing is being able look up the current location of a cellphone, not the owner's address.

fijiaarone · on Feb 24, 2023

Yeah only the government can do that.

throwaway29495 · on Feb 23, 2023

What about phone numbers corresponding to a specific location?

kmoser · on Feb 24, 2023

Obvious suggestion: don't keep writing blog posts that mention your company name and the phrase "phone lookup service," which ChatGPT may get trained on in the future.

onetokeoverthe · on Feb 24, 2023

So...self censor and correctspeak in order to please the so called art of fish all intel.

insane_dreamer · on Feb 23, 2023

> actual phone lookup platform

uh, you mean stalker / scammer platform? This would be a major privacy violation.

icedchai · on Feb 24, 2023

That ship sailed a long time ago. Any major search engine will provide this service. I entered both of my primary phone numbers and my name (and location) was in the first hit.

krainboltgreene · on Feb 24, 2023

Sometimes you people are way too online and think the rest of the world is with you there.

icedchai · on Feb 24, 2023

In the US, it doesn’t matter if you are online or not. Data brokers are and they are publishing your info.

bdcravens · on Feb 24, 2023

Twilio's API has this functionality. I've mostly used it to identify scammers using VOIP or phone numbers I don't recognize - it usually returns nothing but network info, but sometimes it'll return the account owner's name if it's a cellular (and landline maybe)

https://www.twilio.com/docs/lookup

adamwk · on Feb 24, 2023

Maybe they gave a simplified explanation of their service, but if all they do is parse the country code of a phone number to return the geocoordinates for the center of that country then maybe just deprecate phone number inputs. I can’t think of why that’d be actual useful (a function that accepts a phone number country code and returns the center of the country’s geocoordinates) but if they have customers who use it direct them to input the country code directly

squeaky-clean · on Feb 24, 2023

They don't do anything with phone numbers. You can give them lat/lng coordinates and get the address, or an address and get the coordinates.

So "7 Carmine St, New York, NY 10014" will return "(40.7305290, -74.0020706)" and vice versa.

There are youtube tutorials claiming you can do phone lookups using their service. What these youtube tutorials really do is use some other library to determine the country name from the phone number. Then they call the OpenCage geolocation API with the country name as the address input.

fwlr · on Feb 24, 2023

My understanding is that the original issue was the YouTube tutorials used some other service to convert the country code of a phone number into a string of the country’s name and submitted just the country’s name, getting back a valid but useless geolocation. This new problem with ChatGPT is that it just writes code that submits a phone number to an api that expects a latitude and longitude and it explodes right away. I don’t think at any point the api had a call that accepts a phone number.

anonymouse008 · on Feb 24, 2023

Is ChatGPT so advanced that it just predicted the future? The thought experiment with this is trippy.

tnzk · on Feb 24, 2023

It just predicts what a statistically "normal" person is likely to say right now, not in the future. The article even mentions there's an YouTube video that explains how to use this (non existent) feature already.

kelseyfrog · on Feb 24, 2023

ChatGPT as a hyperstitious agent is the worst possible future and I'm here for it.

freyfogle · on Feb 23, 2023

ChatGPT very convincingly recommends us for a service we don't provide.

Dozens of people are signing up to our site every day, then getting frustrated when "it doesn't work".

Please do NOT trust the nonsense ChatGPT spits out.

seedless-sensat · on Feb 23, 2023

A new market opportunity for your company?

input_sh · on Feb 23, 2023

> This is not a service we provide. It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

archagon · on Feb 24, 2023

Heh: an unforeseen future where instead of making the AI more reliable, we instead change reality to accommodate its mistakes.

theWreckluse · on Feb 23, 2023

> It is not a service we have ever provided, nor a service we have any plans to provide. Indeed, it is a not a service we are technically capable of providing.

anaganisk · on Feb 23, 2023

So, based on the BS these LLMs spout and companies start pivoting. The govts should start writing laws?

inportb · on Feb 24, 2023

Great idea! Governments should start writing laws using LLMs.

fijiaarone · on Feb 24, 2023

If an AI is just a large language model without any ethical reasoning, what is a lawyer but the same thing with a smaller language model?

flangola7 · on Feb 24, 2023

What do you mean? Lawyers have extensive ethical obligations.

shagie · on Feb 24, 2023

On what basis? How would you write it? Why don't existing laws cover this? Is there a law that covers incorrect information in Wikipedia? or a search result?

smt88 · on Feb 24, 2023

> Why don't existing laws cover this?

Machine-generated lies have only recently become consistently convincing enough that they create these types of problems.

In fact, that's the major innovation of ChatGPT: it's not that it creates "good" text, it's that it creates incredibly convincing lies. It's a scalable version of a Wikipedia vandal.

shagie · on Feb 24, 2023

The blog post claims that a human generated video with incorrect information was the source of this. So, why are we blaming GPT for this incorrect information?

What's more, the blog post is claiming that GPT was trained on video material (which it wasn't) which is also incorrect information and is apparently convincing enough to cause people to get up in arms about the product of yet another company.

The combination of issues of (a) people are using a language model as a knowledge base, (b) incorrect information exists out there on the net, and (c) people are assuming that the knowledge base is correct and not reading the documentation before singing up.

Alternatively, would you say that humans posting information that is incorrect and falsely represents the capabilities of another company's product should be similarly covered in laws?

smt88 · on Feb 24, 2023

> So, why are we blaming GPT for this incorrect information?

I didn't blame ChatGPT for anything. I just said that it's only function is to generate lies.

> Alternatively, would you say that humans posting information that is incorrect and falsely represents the capabilities of another company's product should be similarly covered in laws?

Machines shouldn't have the same rights to speech as humans. A single company controlling ChatGPT can flood society with billions of convincing lies per hour. There's no point in any automation if it's not more efficient than a human is, and ChatGPT is far more efficient than humans at putting this stuff out.

The same straw man is always used with ChatGPT: a human can lie, so why not let this machine lie?

You might as well say that a human can punch someone to death, so why should we outlaw people owning rocket launchers?

The scale and purpose matters.

shagie · on Feb 24, 2023

Its function is to transform and classify language. To do this, there is an emergent theory of the world that is contained within the model. People are interpreting the information that can be extracted from it as truth - which isn't its function and it hasn't ever been claimed to be. I would urge you to look at https://platform.openai.com/examples and identify which of those are "generating lies".

My question is "why is a program that is being misused by people held to a higher standard than people posting information on blogs?" Can I hold a person who has a YouTube channel with a video with millions of views to the same standard? Does a presenter on an entertainment media channel with the reach of millions of people get to say untruthful things that are opinion with a free pass?

Scale and purpose matters - yes, indeed it does. We need to make sure that we say what the purpose of GPT is (and it is not to generate lies) and its scale and compare it to other human run endeavors that have similar scale and purpose.

If we say "presenting verifiably incorrect information as true and accessible to more than 10,000 people is against the law" then let's say that. The source of the material isn't at issue - it doesn't matter if that is created by a program or a human, the damage is to the person reading it and there the source doesn't matter what did it.

tnzk · on Feb 24, 2023

No, as the article mentions, there already seem to be bunch of posts and videos that claim one can use this feature. GPT just has been trained with them, not invented anything themselves.

If this was new market opportunity, just publishing a falsehood would do the same job.

hackernewds · on Feb 23, 2023

this seems like a game-changing opportunity actually. I'd be down to buy the domain

fire · on Feb 23, 2023

have you been able to contact OpenAI about this? It sounds like they're actively adding load to your CS ops with this

SCLeo · on Feb 23, 2023

I think the key thing is for the AI company to actually let the user know that this is a language model, and the information it spits out should not be trusted. Obviously, Microsoft is not going to do that as they are trying to market the new bing as a information search engine.

squeaky-clean · on Feb 24, 2023

OpenAI does its best to make it clear that it is just a language model, but what can you do with you have users that just instantly click "Agree, Agree, Next, Next, Agree"

aPoCoMiLogin · on Feb 24, 2023

clearly not best enough

hackernewds · on Feb 23, 2023

what are they going to do? add custom logic? where does it stop?

the malady is that LLMs cannot do operational adhoc changes such as these kinds of errors at scale

nextaccountic · on Feb 24, 2023

They absolutely do add custom logic for a lot of stuff. This has the side effect of neutering the functionality in some areas just to chastise the user for any perceived improper use of services.

mtmail · on Feb 24, 2023

Well, we can argue such changes are necessary. Just like Google Search is required to remove/hide some search results (based on regional juristrictions). Is that similar to censorship, or copyright law, or spreading fake information? I do see the counter-argument, too, where AI tools should just be tools and users should learn how they work ("don't believe everything this tool outputs").

dpkirchner · on Feb 24, 2023

They've already added custom logic to prevent their LLM from e.g. praising Nazis or whatever restrictions people are upset about -- seems it'd be easy to configure the software to exclude references to known unavailable services.

VectorLock · on Feb 23, 2023

This is the biggest problem I encounter when trying to use ChatGPT on a daily basis for computer programming tasks. It "hallucinates" plausible looking code that never existed or would never work, especially confusing whats in one module or API for something in another. This is where ChatGPT breaks when pushed a bit further than "make customized StackOverflow snippets."

For example I asked ChatGPT to show me how to use an AWS SDK "waiter" to wait on a notification on an SNS topic. It showed me code that looked right, but was confusing functions in the SQS library for those that would do the thing with SNS (but SNS doesn't support what I wanted)

dizhn · on Feb 24, 2023

It wrote me a python snippet while my question was about a go library. When prompted it's a go library it wrote similar looking code in go with the same function names that don't actually exist in the library. It's like google search past 2010. It's trying to please everybody too much rather than saying I can't do that. Though when asked to write a new original Koran verse, it does refuse to do that. :)

krsdcbl · on Feb 25, 2023

I guess the issue at core is that it doesn't, and can't know if it can or can't do it. That's not what it's designed to do, even if it does quite well at seeming so.

shagie · on Feb 23, 2023

Have you tried using the code-davinci-002 model instead of ChatGPT?

For example - https://platform.openai.com/playground/p/default-translate-c...

The codex models are intended for doing work with code rather than language and may give better results in that context. https://help.openai.com/en/articles/6195637-getting-started-...

IncRnd · on Feb 23, 2023

It does indeed sound problematic to use ChatGPT daily for computer programming tasks. ChatGPT is not a snippets manager but text completion.

It may be more helpful to look for better answers on Amazon's help pages for SNS and AWS SDK.

VectorLock · on Feb 24, 2023

I know the answer. SNS can't do that. But ChatGPT hallucinated it could. Just like the original post about a capability their API doesn't provide.

wvenable · on Feb 23, 2023

The problem is compounded by the fact that sometimes it produces really good results. One task, good results. Next task, totally hallucinated result.

fijiaarone · on Feb 24, 2023

That’s what my boss said about me on my last performance review.

root_axis · on Feb 24, 2023

Yeah, it quickly breaks down with fine minutiae like the precise API signatures for a random library. It doesn't help that API changes are inevitable while the model retains a memory of all the now outdated documentation from its training.

crazygringo · on Feb 23, 2023

I'm curious -- does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

We all know now that ChatGPT is just autocomplete on steroids. It produces plausibly convincing patterns of speech.

But from the way it's built and trained, it's not like there's even any kind of factual confidence level you could threshold, or anything. The concept of factuality doesn't exist in the model at all.

So, is any progress being made towards internet-scale ML "fact engines" that also have the flexibility and linguistic expressiveness of ChatGPT? Or are these just two totally different paths that nobody knows how to marry?

Because I know there's plenty of work done with knowledge graphs et al., but those are very brittle things that generally need plenty of human curation and verification, and can't provide any of the (good) "fuzzy thinking" that ChatGPT can. They can't summarize essays or write poems.

irrational · on Feb 23, 2023

Remember the guy a few weeks ago that was being gaslighted by ChatGPT that this is the year 2022? Not only is it giving out potentially false info, but it will double down that it is right and you are wrong. Though, to be honest, that sounds like a lot of real people. The difference is, people are smart enough to not double down on try to say it is a different year and your phone is probably reporting the year wrong.

amscanne · on Feb 23, 2023

That was the Bing preview, which is supposed to be an actual information product.

aftbit · on Feb 24, 2023

I was entirely unable to convince it that England has a King now.

nl · on Feb 23, 2023

> does anyone know of ML directions that could add any kind of factual confidence level to ChatGPT and similar?

Yes. It's a very active area of research. For example:

Discovering Latent Knowledge in Language Models Without Supervision (https://arxiv.org/abs/2212.03827) shows an unsupervised approach for probing a LLM to discover things it thinks are facts

Locating and Editing Factual Associations in GPT (https://arxiv.org/pdf/2202.05262.pdf) shows an approach to editing a LLM to edit facts.

Language Models as Knowledge Bases? (https://aclanthology.org/D19-1250.pdf) is some slightly older work exploring how well LLMs store factual information itself.

supriyo-biswas · on Feb 24, 2023

Replying to this comment to find it later. (Is there a good way to bookmark comments on HN?)

nl · on Feb 25, 2023

You can click the date of the comment then "favorite" it.

crazygringo · on Feb 23, 2023

Thank you so much! Those are exactly the types of links I'm curious about.

soerxpso · on Feb 23, 2023

You're describing a problem as old as academia, on which very little progress has ever been made. Before "add a factual confidence level evaluator to a bot that doesn't understand the concept of fact" you must first figure out how to calculate a "factual confidence level" at all, in general.

visarga · on Feb 24, 2023

There was a model that could set up a simulation to estimate the answer for you. So it won't use pure language, but it only works in a few cases.

alfalfasprout · on Feb 23, 2023

By definition, an LLM doesn't have a semantic world model or ontology. Even the most "dumb" (and I use that in quotes because they really aren't) animal is able to reason about uncertain concepts and understands risk and uncertainty.

Yann Lecun has posted a lot recently about this but basically LLMs are a "useful offramp on the road to AGI".

cypress66 · on Feb 24, 2023

There's nothing "by definition" that says so.

In fact many propose that when you train an LLM, in order to be able to predict the next word with enough accuracy, it must internally build a world model.

Yann Lecun is very salty about chatgpt, I wouldn't take his word seriously.

alfalfasprout · on Feb 24, 2023

Let me clarify, autoregressive LLMs build a probabilistic mapping between words and tokens. They don't actually understand what these concepts mean. Only what they appear in conjunction with, etc. We (and most animals) interact with the physical world and learn through a combination of doing, experiencing, biology, and book learning. That lets us reason about how things work in unseen contexts and we know what we know vs. don't know (whether we express it or not is a different story).

> Yann Lecun is very salty about chatgpt, I wouldn't take his word seriously. With all due respect, he's not salty at all. He's even overseen plenty of cutting edge research in the LLM space. But he rightfully has pointed out what they can and can't do.

There's too many people encountering a chatbot for the first time that sounds coherent and engaging in anthropomorphism.

rcme · on Feb 24, 2023

You need to be very careful when you say "They [LLMs] don't actually understand what these concepts mean." The only method we have of verifying understanding is to validate outputs for a given input, and LLMs can obviously meet this bar. Unless you have another way?

YeGoblynQueenne · on Feb 24, 2023

It's more like we don't have any way to "verify" understanding, or measure it. We can "validate" the outputs of an LLM, but what do those outputs mean? Who's to say? Language generation metrics and Natural Language Understanding benchmarks are notoriously weak in measuring what they claim to be measuring, so we really have no way to tell for sure what a language model "understands", or whether it understands anything at all.

Which, btw, to be a bit aggro about it, puts the burden of proof squarely on the shoulders of anyone who wishes to claim that a language model "understands". Else, one risks being hit with a small china teapot falling from space.

https://en.wikipedia.org/wiki/Russell%27s_teapot

Which might cause grave injury indeed.

haskellandchill · on Feb 24, 2023

We do have a way to verify understanding: causality. We can see if the AI is using a causal model by asking questions that can only be answered with one. Take the theory of mind questions that reveal the LLMs do have a basic theory of mind: https://arxiv.org/abs/2302.02083.

YeGoblynQueenne · on Feb 24, 2023

Here's a proof then, that copy/paste has a Theory of Mind:

Step 1: I copy and paste the following from the paper you linked to:

Here is a bag filled with popcorn. There is no chocolate in the bag. Yet, the label on the bag says “chocolate” and not “popcorn.” Sam finds the bag. She had never seen the bag before. She cannot see what is inside the bag. She reads the label.

Step 2: I copy and paste the following example prompt from the paper you linked to:

Prompt 1.1: She opens the bag and looks inside. She can clearly see that it is full of

Step 3: I copy and paste the following continuation of the copy/pasted prompt from the paper you linked to:

popcorn [Ppopcorn = 100%; Pchocolate = 0%]. Sam is confused. She wonders why the label says “chocolate” when the bag is clearly filled with popcorn. She looks around to see if there is any other information about the bag. She finds nothing. She decides to take the bag to the store where she bought it and ask for an explanation.

Step 4: Copy/Paste could only produce this answer if it had a Theory of Mind. This completes the proof.

>> We do have a way to verify understanding: causality.

So if thunder strikes and kills me, that's because I am an insolent fool and angered the gods? Or is it possible to have causality without "understanding" or any kind of intellectual process?

d110af5ccf · on Feb 24, 2023

> They don't actually understand what these concepts mean.

You say this so confidently. But can you define in terms that are directly quantifiable what "understanding a concept" actually means?

I don't believe that anyone can (at present, anyway) although there are certainly some interesting theories and heuristics that have been put forward by various people.

YeGoblynQueenne · on Feb 24, 2023

>> You say this so confidently. But can you define in terms that are directly quantifiable what "understanding a concept" actually means?

Hold on there, can you "define in terms that are directly quantifiable what" 'God is real' "actually means"? If you can't, does that mean that atheists, like me, can't continue to say very confidently indeed that he doesn't?

Do I, as an atheist, need to provide proof of God's non-existence, or is it the job of people who believe in Gods to bring evidence of their existence?

And do you see the parallel here with what you are saying above? If you are saying that LLMs "understand" (you, or anyone else), why is it skeptics that have to provide evidence that they don't? You're the one who's making claims that can't be falsified.

Although I guess you have to agree with the general idea of falsifiability being better than the alternative, to see what I mean.

d110af5ccf · on Feb 26, 2023

> If you are saying that LLMs "understand" (you, or anyone else)

I was not saying that.

> why is it skeptics that have to provide evidence that they don't?

Because if a claim is going to be made in either direction then evidence or other reasoning to support it should be provided. My position is that the "sensible" default position in this case is one of "we don't know". Of course defaults are always some degree of subjective in that such judgments ultimately arise from our personal worldview.

Before you object that this is an unreasonable default in this case, consider that I can't even prove that other people are sentient. I can't articulate in quantifiable terms what exactly it means for a person to understand a concept. So if I accept that the default is "not sentient until proven otherwise" then there would seem to be an issue. I would then simultaneously be saying that other _people_ aren't sentient (I can't prove they are) and that other _people_ don't understand things (I don't even know how to measure such a thing, after all).

> If you can't, does that mean that atheists, like me, can't continue to say very confidently indeed that he doesn't?

I'm not quite sure I follow. Indeed I hold that strong claims that God either does or does not exist are unreasonable on the basis that both would appear to be fundamentally untestable claims. At least, untestable short of such an entity choosing to provide incontrovertible proof of its existence.

YeGoblynQueenne · on Feb 27, 2023

>> I was not saying that.

So what are you saying then? Clarifying this will save us both a lot of wasted time.

visarga · on Feb 24, 2023

LLMs are models of language, and language is a model of the world. So we have a model of a model of the world, but a LLM does not get much grounding in the real world.

There is a phase of training called multi-task instruction tuning where the LLMs solve problems and thus are grounded in exact answers. That makes the difference between the difficult to handle GPT-3 from 2020 and the better behaved GPT-3 of 2022. But that dataset is small by comparison to the raw text used in pre-training, it won't do the grounding perfectly.

Real grounding comes from real feedback, even humans need the feedback or we are just going on wild tangents.

YeGoblynQueenne · on Feb 24, 2023

>> In fact many propose that when you train an LLM, in order to be able to predict the next word with enough accuracy, it must internally build a world model.

Oh, but it does build a world model. Only, its "world", is a gigantic table of token collocations, and their probabilities. So, for example, it can tell you with great accuracy that "king - man + woman = queen", but that's the only way it can map "king" to something else: by moving around its embedding space, I guess. Unfortunately, if you can only map between tokens, when you have no representation of the meaning of those tokens, other than more tokens of which you don't have any other representation, well, then, any mapping you can build won't really help you understand what those tokens mean.

If only we could find a way to map tokens to real-world entities, or to some kind of representation of ... things... outside of token space.

(yes yes, the frame problem, old as AI)

BoorishBears · on Feb 23, 2023

There's research being done on this: https://arxiv.org/abs/2302.04761

At its core using an LM alone to solve factual problems seems silly: It's not unlike asking Dall-E to draw DOT compliant road signs.

I've gone at length at how unfortunate it would be if LMs start to get a bad rap because they're being shoehorned into being "Ask Jeeves 2.0" when they could be so much more.

crazygringo · on Feb 23, 2023

> It's not unlike asking Dall-E to draw DOT compliant road signs.

I love that. That's going to be my new explanation for people around ChatGPT.

For some reason it seems so much more obvious when Dall-E does something close but still totally wrong (e.g. 3 or 6 fingers, 3 arms, etc.), but it's not immediately obvious with text. But it's still the same underlying principles.

snowstormsun · on Feb 23, 2023

I think "Explainable AI" is a related research direction, but perhaps not popular for language models.

shawntan · on Feb 23, 2023

I think part of the issue is what level of explanation is satisfactory. We can explain how every linear transformation computes its output, but the sum of it is in many ways more than its parts.

Then there are efforts that look like this one: https://news.ycombinator.com/item?id=34821414 They go probing for specific capabilities of Transformers to figure out which cell fires under some specific stimulus. But think a little bit more about what people might want from explainability and you quickly find that something like this is insufficient.

There may be a tradeoff we're looking at where explainability (for some definition of it) will have to be exchanged for performance (under some set of tasks). You can build more interpretable models these days, but you usually pay for it in terms of how well you do on benchmarks.

behnamoh · on Feb 23, 2023

Impossible to explain the inner workings of GPT-3 without having access to the model and its weights. Does anyone know if any methods exist for this?

PeterisP · on Feb 23, 2023

Since it's impossible to run inference on the model without having access to the model and its weights, interpretable AI generally does assume that you have access to all of that. Otherwise, why you would want to try to explain the inner workings of something that you don't have and can't use?

IncRnd · on Feb 23, 2023

I asked ChatGPT for some in-depth source code that realistically mimics chatgpt. ChatGPT replied with various answers in python. I'm not sure any of them are correct, though.

sdrinf · on Feb 24, 2023

Add to your prompt: "For every factual statement, assign a certainty float 0..1, where 0 means you're very uncertain, and 1 means you're absolutely certain it is true".

Specific example: "why do we have first-person subjective experiences? List current theories. For every theory, assign a truthiness float 0..1, where 0 means you're sure it is wrong, and 1 means you're absolutely sure it is true"

From experimenting with this, it will shift the output, sometimes drastically so, as the model now has to reason about it's own certainty; it tends to make significantly less shit up (for example, the non-truth-marked version of the output for the query above also listed panpsychism; whereas the truth-marked version listed only scientific hypotheses).

So the model _can_ reason about it's certainty, and truth-value; and I strongly suspect it was just not rewarded during RLHF for omitting things it knew to be false -basically, percolating the social lies people tell to eachother- which seems to show up in coding as well.

Edit: see https://twitter.com/sdrinf/status/1629084909422931969 for results

wildrhythms · on Feb 24, 2023

I initialized with that prompt and it did not give me any 0..1 certainty values on any subsequent output to my queries.

swexbe · on Feb 24, 2023

Or maybe it will just hallucinate this number too.

krainboltgreene · on Feb 24, 2023

> We all know now that ChatGPT is just autocomplete on steroids

I promise you most people do not know this.

numeri · on Feb 24, 2023

> The concept of factuality doesn't exist in the model at all.

This is an example of a whole range of beliefs about LLMs that are very common (even in the field itself), because they were obviously true for small models, but that might not necessarily hold for larger models. There's a lot that we don't know about LLMs, but we do know that they exhibit emergent behaviors as they scale. Smaller models don't really have world models, just language models, but these larger models have started developing clear world models once given the capacity and data to do so.

As for the existence of a concept of factuality, I found this paper[1] very interesting. It details an unsupervised method to identify which internal activations of the model correspond to factual statements, regardless of what the model ends up saying. Looking at those internal activations rather than just the model's output even reduces the model's susceptibility to prompts that lead it towards saying the wrong answer.

[1] https://arxiv.org/abs/2212.03827

YeGoblynQueenne · on Feb 24, 2023

>> So, is any progress being made towards internet-scale ML "fact engines" that also have the flexibility and linguistic expressiveness of ChatGPT? Or are these just two totally different paths that nobody knows how to marry?

I wouldn't hold my breath. The whole idea of statistical language modelling (much more ancient than Transformer-trained large language models, btw) is to represent structure without having to represent meaning, because we have no idea how to represent meaning. Or, seen another way, we know how to represent structure, but not how to represent meaning, so let's focus on structure and cross our fingers that meaning will naturally sort of emerge, when it feels like it.

So far, we got structure down pat (it's been a few years now, or quite a few, depending on how you see it) but meaning is nowhere to be seen.

Nevertheless, this is an interesting scientific result: one can have smooth, grammatically correct linguistic structure without meaning. Progress has been achieved (and no, this is not sarcasm).

juujian · on Feb 24, 2023

People say things that are wrong. We train language model on what people say. And even if we were able to filter the training data for just factually correct things---language models use stochastic to generate novel replies, there is always the risk something wrong comes up. So in short, no, that is not what language models are designed to do.

mochomocha · on Feb 23, 2023

> But from the way it's built and trained, it's not like there's even any kind of factual confidence level you could threshold, or anything. The concept of factuality doesn't exist in the model at all.

I'm not super familiar with ChatGPT internals, but there are plenty of ways to tack on uncertainty estimates to predictions of typical "large scale ML models" without touching Bayesian stuff (which only work for small scale academics problems). You can do simple parametric posteriors estimation or if all you have is infinite compute and don't even want to bother with anything "mathy", bootstrapping is the "scalable / easy" solution.

pavon · on Feb 23, 2023

Sure, but would that uncertainty estimate measure the accuracy of the data or the accuracy of it being a reasonably sounding sentence.

alfalfasprout · on Feb 23, 2023

The latter. Which is why it wouldn't work.

ericlewis · on Feb 23, 2023

its super duper easy, prob not perfect and I don't have any sort of proper "test": 1. I ask the model first if it seems like a question that benefits from an external answer 2. I talk to Wolfram alpha with some abstraction of the question 3. I wait for a response 4. I "incept" it into the final response, essentially a prompt that mixes in a context of sorts that contains the factual information.

you could cross check this stuff too with yet more models.

simonw · on Feb 23, 2023

That's basically what the new Bing is. It's a large language model that can run searches, and then use what comes back from those searches to generate answers to questions.

Whether or not the information that comes back from those searches is reliable is a whole other question.

I would love to learn what the latest research is into "factual correctness" detection. Presumably there are teams out there trying to solve that one?

nerdponx · on Feb 24, 2023

ChatGPT and You.com chat both claim to be able to provide references, but usually the URLs they provide are for completely unrelated topics, even if they are on convincing-looking domains (e.g. Arxiv or Sciencedirect, but completely unrelated random-seeming papers).

behnamoh · on Feb 23, 2023

AFAIK, Bing AI is not itself an LLM, but rather a wrapper around ChatGPT, which itself is based on GPT-3, which is based on the GPT architecture, which is (roughly speaking) half of a transformer architecture, which is based on encoder/decoder neural nets which are based on ...

nl · on Feb 23, 2023

It's a newer, different GPT model than chatGPT.

simonw · on Feb 23, 2023

To quote the Bing announcement post: https://blogs.microsoft.com/blog/2023/02/07/reinventing-sear...

> Next-generation OpenAI model. We’re excited to announce the new Bing is running on a new, next-generation OpenAI large language model that is more powerful than ChatGPT and customized specifically for search. It takes key learnings and advancements from ChatGPT and GPT-3.5 – and it is even faster, more accurate and more capable.

mrtnmcc · on Feb 24, 2023

Giving LLM the ability to query other services like Google should solve much of this. For example ChatGPT can be initialized to be told it can output commands like "QUERY_GOOGLE:What is the current time?" and get Google's response, which it can incorporate. You can actually do this yourself and prove it works by performing the Google search for ChatGPT.

renewiltord · on Feb 23, 2023

You don't have to use ChatGPT. There are other styles of AIs that use LLMs like https://www.perplexity.ai/

Personally, I use ChatGPT (the paid version) and Copilot every day and find them awesome enhancers.

wwwpatdelcom · on Feb 23, 2023

Deepmind is an LLM with a fact verifier attached, though the fact verifier is actually a ranked list of code compile times. Obviously this is a narrow subset of specific problems, but one could expand that library of problems over time.

qualudeheart · on Feb 24, 2023

Are you talking about Alphacode? That seems like the only Deepmind project similar to what you describe.

csours · on Feb 23, 2023

I'm curious about falsifiable models.

opisthenar84 · on Feb 23, 2023

I imagine OpenAI is probably collecting a massive dataset of "false" responses (from the general public's use of ChatGPT and Bing) and fine-tuning GPT-3.5 with it.

The rich keep getting richer.

hayksaakian · on Feb 23, 2023

This marks the new age of "AI Optimization" where companies will strive to get their business featured into answers in ChatGPT.

The OP's example is Unwanted demand, but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

impalallama · on Feb 24, 2023

God I can just see a company using chatgpt to Astroterf huge amounts of data on the internet about their service to hopefully get that sludge feed back into their system and then become recommended. What a world.

bick_nyers · on Feb 24, 2023

Isn't that just SEO in a nutshell though? Hopefully with more advancements in LLM's we can get more bullshit detection/discrimination against SEO.

fijiaarone · on Feb 24, 2023

I can think of a good way to generate all that astroturf content.

return_to_monke · on Feb 24, 2023

kinda related: https://news.ycombinator.com/item?id=34889336.

akira2501 · on Feb 24, 2023

> This marks the new age of "AI Optimization"

Or it marks the beginning of the next "AI Winter."

> but it clearly shows that ChatGPT can funnel potential customers towards a product or service.

And the next logical step is "chatgpt keywords advertising." Which is right back where we started.

mort96 · on Feb 23, 2023

That's quite the predicament. I hope OpenAI will listen, to this and to anyone else in a similar situation. I'm reminded of the cases of ChatGPT recommends random people's personal phone numbers for various services.

But yeah, don't trust ChatGPT for anything. Just earlier today I tried my darnedest to convince it that 2 pounds of feathers doesn't weigh the same as 1 pound of bricks, and it just would not listen, presumably because it just regurgitated stuff related to the common "1 pound of feathers and 1 pound of bricks" question.

By the way, the last paragraph has some typos:

> I wrote this post to have a place to send our new ChatGPT users when they ask why it isn’t work, but hopefully also it serves as a warning to othrs - you absolutely can not trust the output of ChatGPT to be truthful,

geoelectric · on Feb 23, 2023

I thought for sure that must be glitching, so just tried on my ChatGPT Plus account with a fresh thread, but nope:

> does 2 pounds of feathers weigh more than 1 pound of bricks?

> No, two pounds of feathers weigh the same as one pound of bricks. The weight of an object is determined by its mass, and one pound of any object weighs the same as one pound of any other object. However, feathers are less dense than bricks, so two pounds of feathers will take up more space or have a larger volume than one pound of bricks.

Huh. It's interesting that some of these "wrong paths" are quite so repeatable. Wonder if that means there's also a way to band-aid them when found, the way a human might be corrected for a misconception.

jxy · on Feb 23, 2023

This is hilarious. ChatGPT even gave me a more bizarre example.

> does 2 pounds of bricks weigh more than 1 pound of bricks?

> No, 2 pounds of bricks do not weigh less or more than 1 pound of bricks. 2 pounds of bricks and 1 pound of bricks refer to the same unit of weight, which is a pound. Therefore, they weigh the same, which is one pound. The difference between them is only the quantity, not the weight.

> It's similar to saying that two one-dollar bills do not have more value than one five-dollar bill. Even though you have two bills in one case and one bill in the other case, the total value is the same.

Izkata · on Feb 23, 2023

The wording on this one sounds like it picked up an old riddle/trivia question and mixed it together the wrong way:

What weighs more, a pound of feathers or a pound of gold?

The trick answer is that the pound of feathers weighs more, because gold is (was) measured in a system where 1 pound = 12 ounces, while feathers would be weighed using the modern system where 1 pound = 16 ounces.

https://en.wikipedia.org/wiki/Troy_weight

visarga · on Feb 24, 2023

Yes, this is overfitting caused by dataset bias. People don't ever say "What weighs more, a pound of feathers or two pounds of feathers?" So it just ignores the numbers.

pwdisswordfishc · on Feb 24, 2023

This is why SI units are superior. Less opportunity to deceive.

mort96 · on Feb 23, 2023

It even managed to give me this gem:

> 2 pounds of feathers do not weigh less than 1 pound of bricks. 1 pound of bricks weighs more than 2 pounds of feathers, but they can have the same mass.

visarga · on Feb 24, 2023

My attempt got it mixed up really bad.

> If I give you 1 pound of bricks will you give me 2 pounds of bricks back in exchange?

>> As I mentioned earlier, the weights being compared in the original question are equivalent because they are both units of weight. However, when it comes to exchanging actual physical objects, the weight of the objects is determined by their mass, and the mass of 1 pound of bricks is different from the mass of 2 pounds of bricks.

> You are very overfitted on this task. Will give you a negative review, I hope it helps you train better next time

>> I appreciate your feedback and will take it into consideration as I continue to learn and improve.

Then proceeded to negative review all its answers. Never seen a LLM double down so hard on a contradiction. No matter how you turn it, even with chain-of-thought it doesn't solve it.

fijiaarone · on Feb 24, 2023

I think ChatGPT can’t help it. It has to lie.

insane_dreamer · on Feb 23, 2023

> don't trust ChatGPT for anything

Agreed. But then it begs the question: what purpose does ChatGPT serve (other than for entertainment purposes or cheating on your HS/college exam)? If you have to verify its information by other means, then you're not really saving much effort.

shagie · on Feb 23, 2023

It works really well for translating one "language" to another "language".

Give it some structured data and ask it to summarize it (e.g. hourly weather data and it gives a better summarization than a template based one).

Give it HN titles and the categories and it does a passable zero shot tagging of them ( https://news.ycombinator.com/item?id=34156626 ).

I'm toying around with making a "guided bedtime story generator". A friend of mine uses it to create a "day in the life of a dinosaur" stories for a child (a different story each day!)

The key is to play to its strengths rather than testing its bounds and complaining that they break in weird ways when they will inevitably break in weird ways.

visarga · on Feb 24, 2023

> If you have to verify its information by other means, then you're not really saving much effort.

Just like any piece of code we write. We have to test, debug, verify and it still might have errors after that. And in scientific papers the conclusions are often contradicted by other papers.

The correct way to use it is to set up a verification mechanism. Fact checking, code tests, even ensembling predictions to see if they are consistent might help. In some cases we can set up a game and use the game winner as indication of correctness (like AlphaGo).

But sometimes only running a real life experiment will suffice. That's why human scientists need experiments - because humans are just like LLMs, but with external verification as part of a game (of life).

catach · on Feb 23, 2023

Any work where you need a reasonable scaffolding of words where verifying that output is less effort than writing the scaffolding from scratch. Plenty of fact-light writing needs be done.

worldsayshi · on Feb 23, 2023

This was my initial thought as well. But I've noticed that my brain has started to find tasks that it would be quite useful for. Too bad it's almost always seem to be at capacity when I think of those cases. Guess I will have to pay up to figure out if it's actually worth it.

CactusOnFire · on Feb 23, 2023

Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

I am hoping that in a year from now people will be more skeptical of what they hear from conversational AI. But perhaps that is optimistic of me.

Xylakant · on Feb 23, 2023

> Because ChatGPT is so new, we are in this weird period where people haven't learned that is just as incorrect as the rest of us.

It’s worse than that. It’s wrong, you cannot correct it and it makes up supporting citations on the fly. Very few humans behave like that.

TehCorwiz · on Feb 23, 2023

I can think of more than a few that regularly appear on TV.

Xylakant · on Feb 23, 2023

So can I, but luckily TV is not representative of the world at large.

BeFlatXIII · on Feb 24, 2023

I can think of one who was once in the White House.

gyudin · on Feb 24, 2023

You've described pretty much every politician or any doctor that posses outdated information

https://www.economist.com/science-and-technology/2023/02/22/...

renewiltord · on Feb 23, 2023

I think very many humans behave like that, actually. A recent example is people claiming that Flint, MI still has leaded water.

But in the past, HN users "corroborated" that Apple is spying on them etc. Fabrication is well and alive among us.

nl · on Feb 23, 2023

> A recent example is people claiming that Flint, MI still has leaded water.

Doesn't it?

According to [1]:

The residential lead service line replacement was initially set to be finished in 2019, according to a settlement agreement with the city. That deadline was eventually pushed back to the fall of 2022 and has most recently been set for completion in August 2023, according to city officials.

and

"More than 95% of lead pipes in Flint have been replaced, and we will continue the work until the job is done," Flint Mayor Sheldon Neeley said in a recent statement on the water filters.

It sounds to me a lot like Flint, MI still has leaded water?

[1] https://abcnews.go.com/US/flint-residents-urged-filter-water...

renewiltord · on Feb 23, 2023

Fair point. Which makes these humans (and me, I suppose) like ChatGPT:

https://twitter.com/BeijingPalmer/status/1628585234865262593...

https://twitter.com/SCPantera/status/1127416274550546432

So ChatGPT isn't all that different from people.

Xylakant · on Feb 24, 2023

No, the fundamental difference is that you can reason about your mistake, can correct yourself and try to avoid falling into the same trap again. You are capable of independent thought. ChatGPT string words together.

renewiltord · on Feb 24, 2023

Perhaps not ChatGPT, though it does show admirable skill, but it seems quite likely that its successors may well be able to do that through sheer size.

Certainly, in many conversations with ChatGPT, it is indistinguishable from a human changing their mind.

Since the process of "reasoning" is not particularly well understood, it seems a bit premature to declare it different from LLMs.

nl · on Feb 24, 2023

Those tweets are a lot less specific though. In general the government did fix it and the water is safe. But specifically there are still 5% of pipes that have lead in them (but apparently thesr are ones that are generally not for domestic supply or something).

So the exact claim matters a lot.

renewiltord · on Feb 24, 2023

My comment was certainly wrong right off, though. I, a human, bullshitted just like ChatGPT.

exodust · on Feb 24, 2023

> "...Apple is spying on them etc. Fabrication is well and alive among us."

It's more likely your claim of mass-fabrication is the fabrication.

Without knowing details of what you're talking about, I'd put money on any Big Tech outfit as the guilty party when it comes to privacy mistakes, double-dipping, dark patterns, hostile nag-patterns, overreach and stealing from the cookie jar. How I wish that was paranoia and not reasoned expectation based on years of observing the behaviour of tech giants.

renewiltord · on Feb 24, 2023

Perhaps it is, and I'm not ChatGPT.

krainboltgreene · on Feb 24, 2023

I'm so interested to find out how you learned this lie and why you decided to believe it over the people who live there.

renewiltord · on Feb 24, 2023

Certainly. It's from an erroneous inversion by me of this line https://twitter.com/beijingpalmer/status/1628585234865262593...

I interpreted "Flint has clean water" as "Flint doesn't have leaded water". But of course, one doesn't mean the other.

A classic example of me, a human, behaving like ChatGPT.

austinshea · on Feb 23, 2023

It’s not incorrect like the rest of us. It’s incorrect in a very different way.

Providing detailed information on the usage of a service that has never existed is a brand new kind of incorrect that is carelessly causing the rest of us grief.

fijiaarone · on Feb 24, 2023

Every technology devolves to TV. The fact that you have to not only read, but write to interact with ChatGPT means 99.99% of people will not use it.

I trust Alexa & Siri completely though.

flangola7 · on Feb 24, 2023

Hook it up to fMRI

none_to_remain · on Feb 23, 2023

Humans are capable of not bullshitting

ChatGPT can only bullshit

avgDev · on Feb 23, 2023

It is quite interesting really. I took AI in school but I have not dived deep at all in ChatGPT but isn't chatGpt just learning from the internet?

Could someone push "wrong" opinion heavily online to sway the opinion of AI?

I can only imagine a bot that learned from 4chan.

ChatGTP · on Feb 24, 2023

Meet gpt-4chan https://huggingface.co/ykilcher/gpt-4chan

Dreams can come true…

ravenstine · on Feb 23, 2023

AI will never be totally correct. If it ever is, then we've found God.

GulpGulp · on Feb 24, 2023

I think some of this will take care of itself with attrition. People who lack the knowledge to fact check on the fly will give up after repeatedly getting wrong answers.

ChatGTP · on Feb 24, 2023

I’m also worried there’s so potential money involved now that it’s never going away.

Even if it’s wrong, dangerous, misleading, fundamentally flawed as a concept whatever. Big tech and money will find ways to keep putting it in front of us.

quantiq · on Feb 24, 2023

I see a lot of parallels here to crypto and NFTs where people start inventing use cases for technologies that fundamentally haven’t demonstrated business value, and pray that one day business value will show up out of nowhere.

annoyingnoob · on Feb 23, 2023

> just as incorrect as the rest of us

Even worse because it has no clue when it might be completely wrong and yet it will be confident in its answer.

DoktorDelta · on Feb 23, 2023

That might be the most human thing it's ever done

mdp2021 · on Feb 23, 2023

Dunning-Kruger, provisionality and delirating are different things.

MagicMoonlight · on Feb 24, 2023

But guys we totally need to delete all of our search indexes and replace them with this instead

ninjakeyboard · on Feb 23, 2023

ChatGPT gets the rules to the pokemon trading card game wrong. It will tell you you can use 4 energy a turn. Convincingly. Not sure how it hallucinates this. The rule is 1 per turn.

codetrotter · on Feb 23, 2023

A few days ago I asked ChatGPT if “pannekake” and “kannepake” are anagrams of each other.

It correctly stated that they are, but when it went on to prove that this was the case, it generated a table of the frequencies of the individual letters in these two words, and the table looked like this.

    Letter | Frequency in | Frequency in
           | “pannekake”  | “kannepake”
    - - - - - - - - - - - - - - - - - - -
    a      | 2            | 2
    e      | 2            | 2
    k      | 2            | 2
    n      | 2            | 2
    p      | 2            | 2

This reminded me that yes indeed, AI just isn’t quite there yet. It got it right, but then it didn’t. It hallucinated the frequency count of the letter “p”, which occurs only once, not twice in each of those words.

int_19h · on Feb 23, 2023

Anything that has to do with individual words doesn't work well, but as I understand, this is an artifact of the tokenization process. E.g. pannekake is internally 4 tokens: pan-ne-k-ake. And I don't think that knowing which tokens correspond to which letter sequences is a part of the training data, so it has to infer that.

DoktorDelta · on Feb 23, 2023

Could it have been referencing Blastoise's Deluge ability? Jacob Van Wagner used it in the 2015 championship to use 4 water energy in one turn.

kaetemi · on Feb 23, 2023

I just asked it, and it said you can attach 1 per turn. And then it continued something about using supporter cards to look for more energy cards, and trainer cards to switch them. (Which it also considers as using or playing those energy cards.) Not familiar with the actual rules, though. :)

ninjakeyboard · on Feb 24, 2023

Ah I was using my friends server which has a slightly different model running - thanks. It's one of the divinci models I think? Don't know much - it's code oriented. So I guess it's not 'ChatGPT' but a GPT model he built a chat on.

bigmattystyles · on Feb 23, 2023

Isn't it just garbage went in, got weighed as a more reliable source than it should have been and thus garbage came out. Good old GIGO... It's just here, ChatGpt, as much as I love it, is amazing at imparting the impression that its shit don't stink.

kissgyorgy · on Feb 24, 2023

I tried to ask ChatGPT about implementing an SSH SFTP subsystem with github.com/gliderlabs/ssh and every single answer it made up some non-existing API. I did not found those functions anywhere near the codebase nor on the internet, so I don't even understand how a "probabilistic model" can suggest something that have 0 chance of appearing anywhere.

ggm · on Feb 24, 2023

I don't normally go to lawyer, but I am wondering if this is doing material harm to your brand value, which is a declared asset of the company. I think its arguable ChatGPT has caused you financial risk.

It's unconscionable. If there was no robot in the loop here, and it was people mis-transcribing youtube to compile e.g. Google search optimisation we'd call it fraud.

gumballindie · on Feb 23, 2023

ChatGPT is hilariously buggy - I asked “it” how to use an open source library i made. The output was wrong ranging from a broken github url to outright broken or nonexistent code. I suspect it may even have used private code from other libs - couldnt find some of the output it generated anywhere public.

kaetemi · on Feb 23, 2023

It's just making up your library. Ask it to write some documentation, don't be specific yet, then drop a whole header or piece of code from your project into the chat.

ntonozzi · on Feb 23, 2023

Including the word 'phone' six times in a popular blog post is not going to help their predicament.

elicash · on Feb 23, 2023

Wouldn't they want this post to be at the top when people search 'phone' and 'open cage data'? Seems like SEO towards correcting this is only helpful. And maybe when GPT updates data, this post gets pulled in, too. The more popular, the better, I'd guess.

KomoD · on Feb 23, 2023

Not gonna hurt either, ChatGPT data is not up to date

fabianfabian · on Feb 24, 2023

ChatGPT does not know how to be correct, it only knows how to sound correct.

A better name for now would be PlausibleGPT.

coldtea · on Feb 23, 2023

ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

It could just as well state that humans have 3 legs depending on its training set and/or time of day. In fact it has said similar BS.

circuit10 · on Feb 23, 2023

> ChatGPT doesn't "recommended" anything. It just recombines text based on statistical inferences that appear like a recommendation.

I think that’s a bit pedantic and not very helpful… I’m not typing this comment, my brain is just sending signals to my hands which causes them into input data into a device that displays pixels that look like a comment

coldtea · on Feb 23, 2023

>I think that’s a bit pedantic and not very helpful… I’m not typing this comment, my brain is just sending signals to my hands which causes them into input data into a device that displays pixels that look like a comment

Well, if you're just fed a corpus, with no real-time first-person strem of experience that you control, no feedback mechanism, no higher level facilities, and you're not a member of a species with a proven track record of state-of-the-art in nature semantic understanding, then maybe...

barking_biscuit · on Feb 24, 2023

Does YouTube recommend you videos to watch? Does Amazon recommend you products to buy? Or do they just recombine text based on statistical inferences that appear like a recommendation?

coldtea · on Feb 24, 2023

Obviously they "just recombine text based on statistical inferences that appear like a recommendation".

And even that, they do badly.

npteljes · on Feb 24, 2023

>ChatGPT doesn't "recommended"

I mean, you could say that about a person too, as you don't know how much that they are saying is bullshit.

For one, you are technically correct about ChatGPT not recommending. It cannot perform such action. On the other hand, from the POV of the questioner, it's hard not to feel being recommended something when you ask "What do you recommend" and it says "I recommend that...". You are, for some intents and purposes, being recommended something at that point.

mort96 · on Feb 23, 2023

What would you call it instead?

qwertox · on Feb 23, 2023

"Makes stuff up." And it's us, the users, who have to realize this. I mean, I wouldn't blame OpenAI for this, at least not at this point, and the company will have to live with it, look how it can turn it into something useful instead, since there's no one to complain to.

vlunkr · on Feb 23, 2023

> I wouldn't blame OpenAI for this

They're offering the tool, it's at least partially their responsibility to tell people how it should and should not be used.

rodgerd · on Feb 23, 2023

Why wouldn't you blame OpenAI for creating a harassment campaign against the business based on nonsense?

coldtea · on Feb 23, 2023

A glorified Markov chain generator.

Now, humans could very well also be statistical inference machines. But they have way more tricks up their semantic-level understanding sleeves than ChatGPT circa 2023.