1-800-ChatGPT

xattt · 2024-12-18T19:17:25 1734549445

Memories of GOOG411! And probably same purpose of this. (1)

(1) https://en.wikipedia.org/wiki/GOOG-411

verst · 2024-12-18T19:39:02 1734550742

I used to use GOOG-411 all the time before I had a smart phone. I must have provided so much training data that it is no surprise Google from early on has been very good at Speech-to-Text conversion of my particular accent :D

mbauman · 2024-12-18T19:34:11 1734550451

GOOG411 was actually very helpful in the dumbphone/limited-cell-data era! I'm not sure why I'd use this now.

It also brings back memories of trying random (and known) 800 numbers from payphones.

hyperdimension · 2024-12-19T00:56:17 1734569777

I spent so much time as a kid in front of a payphone dialing 1-800-XXX-9999 numbers. That and wardialing (NPA) XXX-9999 numbers in my area code.

ssl-3 · 2024-12-18T19:29:51 1734550191

I'm also reminded of TellMe. [0]

In the days before we had pocket supercomputers, I used both of these services occasionally while out and about.

0: https://en.wikipedia.org/wiki/Tellme_Networks

qwertox · 2024-12-18T19:32:25 1734550345

I don't think it's the same purpose. YouTube, TV and Movies offers enough speech samples and a lot of content is dubbed to other languages, and alot of this content already has the transcripts available.

They know who's calling, and the greeting was something like "Hello again". They are catching up at building a competitive database of persons and their preferences at the scale of FAANG. They're moving over from collecting info for their models to collecting info from their users for their agents. This is what they need to offer good agents.

But I might be wrong and it's just phoneme collection, as you speculate.

paxys · 2024-12-18T19:41:10 1734550870

Regular human conversational voice, especially over the phone, is going to be a gold mine for training customer support AI agents. Actors reading movie scripts can't really provide that amount of relevance.

cj · 2024-12-18T21:52:20 1734558740

“This call is being recorded for quality and training purposes” truly has a new meaning.

qwertox · 2024-12-20T16:23:47 1734711827

Wow... I never thought of this.

nirvanatikku · 2024-12-18T19:39:00 1734550740

Agreed on the broader use of data. That said, it’s not just about phoneme collection—different channels and UX modalities reach different audiences and contexts. Each channel ultimately delivers unique inputs, fueling more specialized and robust models tailored to those specific use cases.

aviperl · 2024-12-19T00:41:40 1734568900

The best part of GOOG411 was that they would connect you to the phone number, free of charge, across borders.

List a business with a Google voice number and you can call in, check messages, and _dial out_ from Google voice. Free international calls!

I was in school in Canada where we had a payphone in a hallway. People heard me randomly saying "Funny Business Name, City State ... Connect me" into the phone so much, it became a running joke.

When I eventually got my own phone, I transferred the number and I still have it.

ghurtado · 2024-12-18T19:47:39 1734551259

Does anyone else remember a very short lived Google experiment that allowed you to call a number, vocalize your search, and somehow without any additional steps, the results appeared on the browser in front of you? (which was not connected to the phone, or even logged into a Google account)

ipaddr · 2024-12-18T20:05:30 1734552330

Sounds impossible. Are you calling on the same phone your browser is?

krackers · 2024-12-18T21:59:33 1734559173

It might be possible if the browser plays some high frequency inaudible tone that's picked up by the phone

idiotsecant · 2024-12-18T23:24:35 1734564275

Why impossible? You'd just have to register your phone number so Google knew which account to connect which telephone session

justsid · 2024-12-19T04:53:11 1734583991

> which was not connected to the phone, or even logged into a Google account

sleek · 2024-12-18T19:28:51 1734550131

Yes! Instant memories

ChrisArchitect · 2024-12-19T02:25:04 1734575104

woo, that's a throwback!

Goodbye to an old friend: 1-800-GOOG-411 (2010)

https://googleblog.blogspot.com/2010/10/goodbye-to-old-frien...

m3kw9 · 2024-12-18T19:35:14 1734550514

Must have been tough thinking of an easy to remember phone number, and this ain’t it.

verst · 2024-12-18T19:41:34 1734550894

It made more sense at the time. 411 is an actual directory service (similar to dialing 0 for the operator). [1]

[1]: https://en.wikipedia.org/wiki/411_(telephone_number)

riffic · 2024-12-18T22:17:35 1734560255

N11 codes are a particular curiosity of the electromechanical switching systems used to set up circuits: https://en.m.wikipedia.org/wiki/N11_code

dylan604 · 2024-12-18T21:01:39 1734555699

What's the haps? What's the skinney? What's the 411?

3 questions that Gen-Zers probably have never heard asked and will never ask themselves

idiotsecant · 2024-12-18T23:27:27 1734564447

Sweet little duckling. Before the Internet you had to call a human on a phone to find phone numbers. 411 was a widely known number, similar to how widely known 911 is today.

bsimpson · 2024-12-18T20:41:39 1734554499

On coast to coast flights, there's often not a good way of knowing what movies are available until after you've left cell coverage. This makes simple research like checking the IMDb score challenging.

Alaska Air has a whitelist of messaging services that you can use for free during the flight. WhatsApp is on that list.

So if you want to research obscure plane movies on an Alaska flight, you can connect to their wifi and message either WhatsApp's built-in LLaMA or now ChatGPT.

frakt0x90 · 2024-12-18T21:18:11 1734556691

I get it's just an example, but are we really this far gone? Just watch a random movie and if you don't like it, pick another. This is such an extreme micro-optimization of a small experience.

crazygringo · 2024-12-18T22:25:47 1734560747

You don't often know if a movie is good until you've finished it, because it all depends on how the story came together in the end.

You can spend 2 hours watching a moving, emotional story that teaches you something new about the human condition and the choices we make in our lives.

Or you can spend 2 hours that turns out to be full of plot holes and inconsistent characters, where nothing makes sense in the end and you've utterly wasted your time.

In what universe would you not want to have that information before watching? Especially if you're generally a busy person and only get to watch 10-20 movies a year.

I truly don't understand the attitude of "just pick one", whether it's for movies or other things. That reviews are "micro-optimization". Like, do you just not value your time? Do you not care about quality?

It's not like reviews are always right. But one film with 98% on Rotten Tomatoes vs. one with 45%... that's a really strong signal. Why on earth would you choose to ignore that?

wfme · 2024-12-18T23:31:17 1734564677

Life is about more than optimizing the movies you watch.

Watching a bad movie is not going to harm you. Maybe you'll take something away, maybe you won't.

Much like having a bad day is unlikely to ruin your life - it'll just give some nice context to the good days.

And we're talking about watching them on the plane, so the "busy person" argument really doesn't apply here.

listenallyall · 2024-12-19T00:53:23 1734569603

You aren't wrong but it's not an argument you're going to win. A bad restaurant or a dumpy hotel, etc won't kill you either, yet most people rely heavily on crowdsourced reviews. It's just a part of the culture today, given how prevalent ratings are. This isnt 1940, so suggesting, out of the blue, "just go watch the movie regardless if it is any good" isn't going to convince someone to do so.

crazygringo · 2024-12-19T00:32:36 1734568356

> Life is about more than optimizing the movies you watch.

Where did I say it wasn't? That's a straw man.

But if you're going to watch a movie for the next two hours, then yeah -- your life is going to be about that movie. So why not choose wisely?

> Watching a bad movie is not going to harm you. Maybe you'll take something away, maybe you won't.

Straw man again. And again -- why not choose quality instead of choosing ignorance and rolling the dice?

> Much like having a bad day is unlikely to ruin your life - it'll just give some nice context to the good days.

Again, straw man. Nobody's talking about ruining your life. But why intentionally choose a bad movie...?

> And we're talking about watching them on the plane, so the "busy person" argument really doesn't apply here.

To the contrary. For a lot of busy people, the plane is one of the few moments they have time to watch a movie. So it sure does apply.

You're arguing in favor of choosing bad things, because it's not going to ruin your life. Huh? Shouldn't we have a higher bar for the things we try to choose to spend our time on? You're describing standards that are the lowest of the low -- as long as it doesn't harm you, it's fine. Don't seek anything better. Yikes. I've rarely come across a life philosophy more depressing.

wfme · 2024-12-19T04:47:22 1734583642

> Where did I say it wasn't? That's a straw man.

This isn't a straw man - I'm not claiming you think life is all about movie optimization. I'm making the point that the effort of optimization might not be worth it in the broader context.

> Straw man again. And again -- why not choose quality instead of choosing ignorance and rolling the dice?

Also not a straw man. I'm illustrating that the downside of a bad movie is so minimal that extensive optimization might not be justified. This directly addresses your argument about opportunity cost by suggesting the cost is actually quite small.

> Again, straw man. Nobody's talking about ruining your life. But why intentionally choose a bad movie...?

Again, not a straw man. I'm making a proportionality argument about how much a sub-optimal movie experience actually matters in practice.

> To the contrary. For a lot of busy people, the plane is one of the few moments they have time to watch a movie. So it sure does apply.

Even on a plane, the stakes just aren't that high. A less-than-perfect movie isn't going to meaningfully impact your life regardless of how busy you are.

> You're arguing in favor of choosing bad things, because it's not going to ruin your life. Huh?

You're interpreting my position as "arguing in favor of choosing bad things," but that's just not accurate. I'm suggesting that the effort of optimization might outweigh the minimal downside of occasionally watching something mediocre. There's a middle ground between actively choosing bad things and obsessing over choosing only the very best.

crazygringo · 2024-12-19T15:13:16 1734621196

> A less-than-perfect movie isn't going to meaningfully impact your life regardless of how busy you are.

There are movies I've seen that changed my life. If I'd watched a dumb movie instead, yes my life actually would have been meaningfully impacted for the worse. That's the power of art.

> I'm suggesting that the effort of optimization might outweigh the minimal downside of occasionally watching something mediocre.

It takes a few seconds to check Rotten Tomatoes. A movie is around two hours. In what universe would you rather waste a couple of hours in order to save a few seconds?

And it's not occasionally watching something mediocre. Most movies are mediocre. You have the choice of usually watching something mediocre, versus usually watching something high-quality.

Again, you're strawmanning with "obsessing over choosing only the very best". Where did I describe an obsession? I'm just saying, check Rotten Tomatoes to help pick a good movie. There's just no universe in which the tiny effort to do that is going to outweigh the two+ hours of boredom and frustration of a bad movie.

I genuinely don't understand how you can take the position you're taking with movies, when checking Rotten Tomatoes takes seconds (a minute if you're checking several) and a movie lasts for hours.

mywittyname · 2024-12-19T15:44:14 1734623054

Bad movies make for great conversation pieces after the flight.

aoeusnth1 · 2024-12-19T00:32:48 1734568368

OP: good movies are better than bad movies. Replies: you buffoon, you actual clown. How dare you value good things more than bad things.

I can’t take this knee jerk response seriously. Why wouldn’t good movies be more worthwhile than bad movies? How is this even controversial?

jazzyjackson · 2024-12-19T01:19:07 1734571147

Because whether a movie is good or not is not an objective, one dimensional thing that can be represented by a score on rotten tomatoes?

crazygringo · 2024-12-19T21:00:00 1734642000

It's not perfect, but it's a very strong and useful signal.

Never in my life have I seen something with 98% and thought, well that was a crap movie.

And never in my life have I seen something with 35% and thought, that was amazing!

It's more in the 75-90% range where you have to consider the "dimensionality" of the thing, like whether it's a genre you like, or which individual reviewers match your tastes more precisely.

jazzyjackson · 2024-12-19T21:44:27 1734644667

Yeah I guess you're right, even the "so bad its good" movies get points for being campy and the love it / hate it reviews average out.

d0gsg0w00f · 2024-12-19T02:34:04 1734575644

Because life just isn't perfect. If you go seeking this perfect optimization in every aspect of your life then the world is going to be a continual disappointment.

crazygringo · 2024-12-20T21:44:22 1734731062

Who said anything about perfection? About life being perfect, or about perfect optimization? Or about expecting perfection?

You can just pick a better movie and make life better.

What is your objection to that?

jazzyjackson · 2024-12-19T01:16:54 1734571014

To me, unanimous positivity means a boring movie that takes no risks. I'd rather watch a people-love-it-or-hate-it film.

Usually tho I just watch trailers and judge the actors' chemistry to decide if I would enjoy watching those characters. What other people thought of it is not especially relevant. Particularly on flights I've watched some amazing foreign content that I just would not have stumbled upon if I was just watching whatever topped rotten tomatoes.

johannes1234321 · 2024-12-18T22:37:32 1734561452

Better bring your tablet or similar device with your choice of content. Airplane screen quality is bad and movies are edited in weird ways (for being acceptable for all ages and cultures on loss of anything else)

mostertoaster · 2024-12-20T06:22:56 1734675776

The valuable time we save in not watching bad movies gives us more time to argue on hacker news, so a good optimization.

zeroonetwothree · 2024-12-19T00:02:33 1734566553

If you aren’t enjoying it then stop watching. And if you did enjoy it for 2 hours who cares what the review says.

crazygringo · 2024-12-20T21:45:07 1734731107

I think you missed the very first sentence of my comment.

lxgr · 2024-12-19T04:03:47 1734581027

Why should I spend two or so hours watching a mediocre movie if I can also watch a great one, or do something else?

Reading a movie review or just asking a friend that’s seen it if they liked it has been a thing… always.

etherealG · 2024-12-24T18:17:37 1735064257

Why do you try to mandate how others experience the world?

Alupis · 2024-12-18T20:43:42 1734554622

I would expect nothing but hallucinations and nonsense coming out of any LLM regarding recently-released movies (aka. the ones you often find on flights).

tracerbulletx · 2024-12-18T20:49:12 1734554952

In every post about LLMs there is someone to blindly say something like this.

When in reality if you ask ChatGPT for 10 good movies from this year you will get this.

Anora - Directed by Sean Baker, a compelling drama about the life of a sex worker in Coney Island.

Challengers - A provocative tennis drama directed by Luca Guadagnino, starring Zendaya.

Dune: Part Two - Denis Villeneuve's continuation of the epic science fiction saga.

Furiosa: A Mad Max Saga - An action-packed prequel exploring the origins of Furiosa, directed by George Miller.

Inside Out 2 - Pixar's sequel that dives deeper into the complexities of human emotions.

Wicked - A musical fantasy adaptation directed by Jon M. Chu . The Zone of Interest - A thought-provoking film about Auschwitz, directed by Jonathan Glazer.

The Idea of You - A steamy romance starring Anne Hathaway.

Hit Man - A comedy thriller starring Glen Powell.

The Outrun - A powerful drama about a recovering alcoholic, starring Saoirse Ronan.

Let me know if you'd like more details about any of these!

Which is a great list.

slg · 2024-12-18T21:13:48 1734556428

Those descriptions are less detailed than the information you will see on basically any streaming interface and yet it still manages to not being very good. For example, no person who had actually seen Anora would describe it as "a compelling drama about the life of a sex worker in Coney Island".

glenstein · 2024-12-18T21:28:08 1734557288

I haven't seen Anora so I'll give you that one, but you cited that as if it was just one of many examples, when in fact I think it's the only one, as all the other descriptions seem reasonable.

Originally the problem was supposedly that it would hallucinate complete and utter gibberish, but now here we are quibbling over one example and insisting that maybe it's not quite as good as alternative descriptions.

The gap between what was produced and what you're looking for is small enough that I think it could be covered with some slightly tweaked prompt instructions.

I'm not saying you're wrong but want to note how the goalposts keep seeming to shift whenever we talk about these capabilities.

slg · 2024-12-18T21:45:49 1734558349

I'm not Alupis. I can't and am not trying to speak on their behalf. I'm therefore not moving the goalposts they established. I'm making my own related point.

That point is that the information provided above about these movies is worthless. It does not add any new value beyond what would already be available in the streaming interface. Several of the descriptions are nothing but the genre and one person involved in the making of the movie. And yet even with these descriptions being incredibly short and vague, they still manage to contain at least one misleading summary.

glenstein · 2024-12-18T21:58:36 1734559116

I'm aware that you're a different commenter but you are addressing yourself to a comment that was in reply to them and therefore not necessarily appropriate to measure such a comment against entirely new criteria that you want to bring into the conversation.

Despite your protestations to the contrary, these descriptions seem perfectly fine in that they're accurate and meaningful. And it if you want to start getting fast and loose with all kinds of new extra criteria and requirements for what it's supposed to do, they all seem squarely within the reach of the capabilities on offer, with some prompt tweaks.

slg · 2024-12-18T22:44:02 1734561842

>these descriptions seem perfectly fine in that they're accurate and meaningful

The description of Wicked doesn't mention either The Wizard of Oz or the Broadway musical. So yes, the descriptions don't contain obscene mistakes like calling Wicked a courtroom drama. If that is enough for you to call these "accurate" while ignoring the vagueness or the 1 in 10 failure rate on the Anora description, fine by me. But you must have some weird definition of the word "meaningful" to apply that to descriptions like the one of Wicked. That simply isn't a helpful way to describe that movie.

jenniferCrawdad · 2024-12-18T22:59:58 1734562798

The comment thread you're at the end of started with this:

> I would expect nothing but hallucinations and nonsense coming out of any LLM regarding recently-released movies (aka. the ones you often find on flights).

The comment that replied to it (the one that you're arguing against) provides evidence that proves it wrong. You are correcting someone who isn't incorrect, and I think the person you're responding to is very justified in saying you're moving the goalposts here.

slg · 2024-12-18T23:15:20 1734563720

A reply downthread is not an endorsement of everything said upthread. I'm happy to discuss the points I made, but I’m not going to be made to defend something I didn’t say.

jenniferCrawdad · 2024-12-18T23:44:36 1734565476

Well if you're not endorsing what was said upthread, then your comment is a complete non-sequitir. The parent comment said "LLMs can't give movie recommendations for recent movies because they'll hallucinate or spout nonsense", the next comment responds with a list of accurate movie recommendations, and then you come in and say this:

> Those descriptions are less detailed than the information you will see on basically any streaming interface and yet it still manages to not being very good.

The points you made were not relevant to the discussion at hand. It's like if people were having a debate about where to find the best tacos in town and you stepped in to say "tacos aren't as good as hamburgers, you know" and then got upset that nobody wanted to debate that point with you. It's not everybody else's fault if you don't understand how conversations work!

slg · 2024-12-19T00:03:50 1734566630

I don’t know why you are letting that one reply define the bounds of this conversation. My comment was directly relevant to the first comment in this thread and the comment I was replying to.

dartos · 2024-12-18T21:49:18 1734558558

> I haven't seen Anora so I'll give you that one

It was literally the first movie in that list.

You tried making a counter example and the first part of it was already wrong.

That’s the point. Not that it _cant_ give good answers, but whether it does or not is a crap shoot.

Now to analyze how correct it was we need to verify each movie it gave… It’d be faster just to read the movie descriptions.

jenniferCrawdad · 2024-12-18T23:01:59 1734562919

I _have_ seen Anora and I think that description is perfectly fine. It certainly isn't "hallucinations and nonsense" which is what the parent comment is claiming. What part of that description do you consider "wrong"?

glenstein · 2024-12-18T22:01:51 1734559311

You're still trying to imply that the list as a whole is as inaccurate as that one particular example.

And I think that's quite obviously not the case, most, probably every other example on the list is just fine.

dartos · 2024-12-18T22:58:15 1734562695

I’m saying that because of the one incorrect example, I can’t just assume that the rest are accurate.

I now need to either trust a machine that I know gives incorrect information (as demonstrated by the first example) or I need to verify each example.

> probably every other example on the list is just fine.

Why don’t you check IMDb and let me know?

While you’re at it, don’t think about how much faster it would’ve been if you just looked up popular recent movies on IMDb or rotten tomatoes.

kbelder · 2024-12-19T19:07:37 1734635257

For comparisons, Wikipedia's opening paragraph for Anora reads:

"Anora is a 2024 American comedy-drama film written, directed, and edited by Sean Baker. It follows the beleaguered marriage between Anora (Mikey Madison), a young sex worker, and Vanya Zakharov (Mark Eydelshteyn), the son of a Russian oligarch. The supporting cast includes Yura Borisov, Karren Karagulian, Vache Tovmasyan, and Aleksei Serebryakov."

I haven't seen the film, but it doesn't seem incompatible with ChatGPT's briefer description.

dartos · 2024-12-20T15:11:29 1734707489

You see the issue here, right?

Instead of just checking with a first party source, you ask a statistical guessing machine for an answer.

There was a disagreement about the answer, so we needed to dig deeper.

You bring up Wikipedia, a 3rd party source of information. That description could also be wrong (it’s probably not, but stick with me)

Instead of just checking with a first party source (IMDb is very easy to search on), we went through several layers of obfuscation.

This was an issue for Wikipedia early on, but it has citations, at least. AI doesn’t and doesn’t have an army of people constantly fact checking every answer generated either.

There’s no benefit to asking AI for information like this. Especially since the in flight summary has accurate information that’s more than “drama, sex worker, cony island”

Maybe something like perplexity is better, since it has citations, but I haven’t tried it for very long yet.

tehwebguy · 2024-12-18T21:12:52 1734556372

These details are already available in the in flight entertainment interface

glenstein · 2024-12-18T21:34:21 1734557661

They were responding to a commenter suggesting it would produce completely unusable results, the question was never about whether the results produced would be redundant.

I know that any mention of fallacies, valid or otherwise, causes instinctive eye rolls, but in this instance I agree with them that this amounts to moving the goalposts.

tracerbulletx · 2024-12-18T21:16:49 1734556609

This type of response is called moving the goal post. When someone responds to one claim, the claim is changed to something different which was not part of the original argument. This is debating in bad faith.

th0ma5 · 2024-12-18T21:15:15 1734556515

Great! Now show me a system that can verify that list for accuracy as well. Not to be flippant, but this is the complaint. You can't approach outputs uncritically. And no I don't want it to be as unreliable as a person who also forgets how English or basic knowledge works at random intervals.

glenstein · 2024-12-18T21:53:22 1734558802

They were responding to a comment that suggested that this was a category where the only thing you would get is unintelligible gibberish.

You don't even seem to be disputing the actual results here, just gesturing towards a kind of philosophy class exercise of whether we can ever "really" verify its accuracy. I see Wittgenstein's name increasingly tossed around in these parts (a good thing!), so I'll just note that one of the reasons he's hailed as one of the great philosophers of the 20th century is because he felt these puzzles about "really" knowing were frivolous.

I don't think I agree that what's needed here is some new and extra process of verification. I think the same usual quality control criteria that are already being used are good enough in this case.

th0ma5 · 2024-12-18T22:25:01 1734560701

Yes, like, how are corporations (like movie productions in this example) supposed to control their message?

JadeNB · 2024-12-18T21:32:22 1734557542

> Great! Now show me a system that can verify that list for accuracy as well. Not to be flippant, but this is the complaint. You can't approach outputs uncritically.

In general you can't, but surely it's not that big a deal if ChatGPT offers an inaccurate summary of a movie you're about to use to kill time on a flight? I suppose it becomes important if, e.g., you're relying on it to tell you whether a movie is appropriate for children, but, if you're just asking it whether a movie is worth watching, that's a question that doesn't have an objective, factual answer anyway, so a hallucinated answer is probably about as useful as that of a not-previously-known reviewer.

th0ma5 · 2024-12-18T22:26:36 1734560796

If I invested money into a film, I would want its representation in the world to reflect what the movie is about at the very least.

JadeNB · 2024-12-18T22:46:10 1734561970

> If I invested money into a film, I would want its representation in the world to reflect what the movie is about at the very least.

Sure, but that's the filmmaker's interest. As someone sitting on a plane trying to decide whether to watch a movie, I care about my interest, not that of the person who made it. I'm not particularly arguing for the use of ChatGPT here (I wouldn't use it), just that the risks it usually poses are fairly minimal in this case.

th0ma5 · 2024-12-19T01:13:21 1734570801

You're forgetting the information hazard of five years from now someone mentioning a movie and you saying "oh I didn't want to watch that because of the car chase" and everyone looks at you funny because it is a film set in the 1700s about a carriage driver.

conductr · 2024-12-18T21:29:43 1734557383

Movie reviews are amongst the most subjective things, how do you define “verify” and “accuracy”?

th0ma5 · 2024-12-18T22:25:48 1734560748

If I run a business, I guess I would have the ability to bring a libel suit would be one way?

tracerbulletx · 2024-12-18T21:17:15 1734556635

The system is I'm a movie nerd who knows all the best movies of the year and have seen them all.

SkyPuncher · 2024-12-18T22:19:14 1734560354

You’d be pretty wrong, then. ChatGPT in particular will cite its sources via an internet site.

My wife wanted a pair of boots for Christmas that I couldn’t find in her size. Google was a wasteland of SEO, but ChatGPT found 5 sites and was able to tell me current stock levels.

johnbatch · 2024-12-18T21:13:52 1734556432

Looks like this is using GPT-4 and has no knowledge after January 2022.

' As of my knowledge cutoff in January 2022, the last movie I have information on is "Spider-Man: No Way Home", which was released in theaters in December 2021. It was one of the most highly anticipated films of that year, marking a major event in the Marvel Cinematic Universe (MCU) and the Spider-Man franchise. '

bsimpson · 2024-12-18T20:53:17 1734555197

Here's a comparison of asking ChatGPT and Meta AI about actual in-flight movie choices.

I pasted the same initial prompts in both, but Meta AI needed more clarification. When ChatGPT found multiple entries with similar titles, it gave information about all of them.

https://gist.github.com/appsforartists/004bafe11a9e23a418fd5...

slg · 2024-12-18T21:17:09 1734556629

>[The Campaign] received mixed-to-positive reviews from critics. On the Rotten Tomatoes website, it holds a 65% approval rating from critics, based on 191 reviews, with an audience score of 60%.

The first thing I fact-checked, the Rotten Tomatoes scores are actually 66% and 51% respectively[1]. Probably not enough of a difference to sway any opinions, but an excellent example of the type of inaccuracy that the previous comment was referencing.

[1] - https://www.rottentomatoes.com/m/the_campaign

lxgr · 2024-12-19T04:05:42 1734581142

Llama in WhatsApp can search the web, so usually gets these queries right.

Hilariously it often believes that it can’t access the web and then hallucinates reasons for how it can know things beyond its knowledge cutoff date. But in any case, it works very well for this use case.

freedomben · 2024-12-18T21:49:17 1734558557

I've heard that Twitter's (AKA X's) LLM (Grok) is really, really good at this sort of thing (in part because it has recent access to Twitter's data).

lxgr · 2024-12-19T04:07:45 1734581265

Web or other search access for LLMs really isn’t that new anymore, and I doubt that Grok will do a statistically significant sampling of everything on X, so I don’t really expect it to fare much better than a model with access to regular web search.

nojs · 2024-12-18T22:38:36 1734561516

Time for http-over-whatsapp

cco · 2024-12-18T19:20:34 1734549634

100% shipped to show off to family during Christmas.

I'll definitely give it a go, I wonder if this lands better with those aged 50+ who are more used to phone calls rather than chat.

WhitneyLand · 2024-12-18T19:39:47 1734550787

So the perception of those aged 50+, is one of people so far removed from technology they’d prefer to use a telephone to avoid their discomfort with computers?

I’m well into this group and still make a lot more api calls than phone calls.

Fresh out of college I recall vividly thinking, I’ll need to build an impressive list of side projects to overcome preconceptions about how much I can truly offer at my age. Maybe nothing has changed.

bloomingkales · 2024-12-18T20:07:34 1734552454

I had chatgpt read through my recent bloodwork results and helped me understand it better than my doctor.

50+ are going to be so addicted to this thing its not even funny. My parents are not reaching for AI immediately yet, but thats just a yet. This is the wave that could come at any moment.

hombre_fatal · 2024-12-18T21:58:59 1734559139

My dad sells farming-related equipment to mostly older people and there are still people more comfortable giving him their credit card info over the phone instead of purchasing on his website online.

(Though I see that as mostly a failure of our financial industry. Credit card numbers should be obsolete by now.)

etbebl · 2024-12-19T02:55:16 1734576916

Just one data point but my father is in his 70s and has never owned a smartphone, when he wants to Google something he goes to the computer in the basement. On the other hand there are landline extensions all over the house. So yeah it would be more convenient for people like him.

conductr · 2024-12-18T21:38:19 1734557899

I think the SMS chat feature this enables is of more significance than the actual voice calling feature.

dingnuts · 2024-12-18T19:49:23 1734551363

the idea that someone who was 20 in 1995 is too old to be comfortable with computers is a horrifying and offensive stereotype that deeply worries me for my own future

our industry is old enough that the first generation of pioneers has died of old age.

Do you really think someone who grew up with computers in the 80s is incapable of using a smart phone? These are people who are still in the workforce today. These are your most skilled colleagues.

Some of them probably designed the device you think they're too old to understand

vel0city · 2024-12-18T20:16:20 1734552980

> someone who was 20 in 1995 is too old to be comfortable with computers

I know people who are in their 20s and 30s who seem to be uncomfortable with computers, cloud technology, and especially AI.

In some ways I'm one of them. I will never let an always listening AI helper be in my home. And I'm <40.

xattt · 2024-12-18T20:45:36 1734554736

That’s different. One is fear of the unknown, the other is a precautionary fear of what could be.

dylan604 · 2024-12-18T21:07:50 1734556070

Not really. A lot of people in their 20s might have never actually done much with a computer, yet they cannot put their phone down. I know lots of 20-somethings that cannot type. I know even more that do not own a traditional "computer". It has nothing to do with fear, but lack of need.

ericd · 2024-12-18T20:04:40 1734552280

I don't think anyone is denying the existence of Greybeards, it's more that the field has exploded so much in the meantime that the probability of a random 30 year old being in it is much higher than the probability of a random 50 year old.

swatcoder · 2024-12-18T20:22:32 1734553352

You're surely right that they anticipate it being be a novelty that people share during holiday visits.

But as you can probably tell from the other replies, the idea that older people don't know how to use internet-era technology is a meme that was wearing thin 20 years ago already.

People who haven't had ChatGPT "land" for them yet are likely just people who don't find themselves asking a lot of questions they need a chatbot to answer, regardless of the medium. That probably has some age skew right now, but isn't really about the medium at all.

conductr · 2024-12-18T21:51:20 1734558680

I’m a few years from 50 and while Google has deteriorated my Google fu and ability to see signals through the noise still serves me well enough and is my comfort zone.

When I dabble with chatgpt it always feels like I’m playing with a toy as I don’t really have a use case I’m taking to it. I’ve used a few websites creators and code generators which have been useful but also I don’t think they saved me much time overall. Web design, graphic design, etc and creative stuff are things I suck creating so it gives me a new power and is easy to iterate on. Otherwise, I’ve not found much actual value from it yet.

If it makes you much more efficient in your job, like it does for professional software devs, many of HN users, then i think you’re more apt to be excited by the tech

TRiG_Ireland · 2024-12-18T23:59:37 1734566377

I genuinely have never had the slightest inclination to ask a chatbot anything, ever. I do not understand how people find them useful.

mikesabat · 2024-12-19T00:10:21 1734567021

Aren't they doing a 12 days of Christmas thing where they release new features for 12 days? This would fit into that idea.

I was thinking earlier today that an agent listening to my calls would be helpful. I was on the phone with a financial institution that will require some followup. Being able to sync in an agent to transcribe and remind me would be valuable.

I understand this isn't that.

4ndrewl · 2024-12-18T19:40:19 1734550819

Sonny, people in their 50s were sending smileys on their Nokias before Zuck had even thought of Facebook...

stevenj · 2024-12-18T19:50:51 1734551451

Your comment made me lol. And it’s very rare for that to happen to me via reading text. And I needed it today. So I just wanted to tell you thank you and I hope you have a good day.

dantyti · 2024-12-18T21:16:06 1734556566

> Your comment made me lol. And it’s very rare for that to happen to me via reading text.

If anyone else reading this is in a place like you've described, try 1900hotdog.com

jph00 · 2024-12-18T19:55:48 1734551748

Umm... do you actually know anyone 50+? You know, for instance, the co-authors of "AI: A Modern Approach" are both 50+?

TiredOfLife · 2024-12-19T10:46:57 1734605217

There are cookbooks written by people that have never cooked a thing in their life.

whimsicalism · 2024-12-18T19:56:48 1734551808

? why is the authorship of this book relevant

glenstein · 2024-12-18T22:05:52 1734559552

Because it shows that it's perfectly plausible for people ages 50 plus to appreciate the value out of these technologies every bit as much as us whippersnappers. Some of them are writing books about it, after all

whimsicalism · 2024-12-18T22:43:14 1734561794

those books are not really at all about the techniques used in llms

glenstein · 2024-12-19T03:27:25 1734578845

You seem to have forgotten the context of this conversation. Right now we're talking about whether 50+ somethings can appreciate the value proposition of the 1-800 line and more generally of the whole line of GPT releases presently coming out.

Pointing to the book authorship help support the intuition that our Gen X friends are able to get it, because after all, it's not out of the question for them to be involved in these very fields. I don't think any of those points, which again in this context are the points that are under discussion, are things that hinge on whether or not the book specifically addressed particular llm methodologies.

skrebbel · 2024-12-18T19:51:48 1734551508

I like this a lot. I don't use AI a lot and I often find it annoying, so I don't eg feel the need to install the OpenAI mobile app (which I assume exists). Having ChatGPT in my WhatsApp (I live in a place where WhatsApp is everywhere) is a nice middle ground, lets me occasionally ask it stuff without worrying about accounts and projects and models and all that stuff. Cool!

fzzzy · 2024-12-18T19:53:59 1734551639

You can also go to the website and use it without logging in.

skrebbel · 2024-12-18T19:58:01 1734551881

Hey nice! That's gotta be new-ish too, right? Last I checked I had to log in. Thanks!

fzzzy · 2024-12-19T11:20:19 1734607219

You're right they did add anonymous access at some point, but it was quite a while ago I think. Smart move on their part. Makes casual use much more convenient.

SkyPuncher · 2024-12-18T20:30:42 1734553842

This is a killer feature for me. In fact, I briefly explored building a semi-self hosted version for myself.

My biggest use case for ChatGPT voice mode is when I _need_ or _want_ to be handsfree. Think working around the house/yard, Driving, Walking around the grocery store, cooking, etc. I find that I end up using my iPhone's voice-to-text then simply communicate with text mode (in the case of driving, I stop). After all, once I have to touch my phone, it's just faster to work in text mode.

All of my devices know how to make calls. All of my devices know how to make calls from a voice command. All of my devices know how to hang up a call. This is really nice.

lxgr · 2024-12-18T20:36:18 1734554178

Same here.

How ironic that it's not actually Apple delivering that despite being in the perfect position to do so (they have a deal with OpenAI for ChatGPT using Siri, have all the contextual knowledge they could ever need etc.) – my iOS 18.2 Siri + ChatGPT experience has been extremely disappointing so far: It seems to completely forget all context between questions, ignores me for follow-up questions 80% of the time etc.

SkyPuncher · 2024-12-18T22:20:38 1734560438

I agree. With the rise of LLMs, Siri is basically useless outside of triggers 3 or 4 actions on my phone (timer, call, message, play music)

paul7986 · 2024-12-18T22:21:40 1734560500

I nerdishly get angry at Siri and Apple's Not Intelligence while driving. ChatGPT iPhone app i can have a whole conversation with and get things done... Siri on my iPhone 15 Pro running 18.2 is so frustratingly still dumb and a one now only two trick pony compared to the chatGPT's voice mode.

Im still hoping one of Open AI's 12 day announcements is they are creating a AI Phone with Microsoft called GPT and or an Phone AI OS.

lxgr · 2024-12-19T04:09:24 1734581364

“Hey Siri, let me talk to ChatGPT voice mode” is the only thing I really need from it, and that’s unavailable for whatever reason.

“Hey Siri, call 1-800-CHATGPT” will have to do for now :)

paul7986 · 2024-12-19T06:25:21 1734589521

That's cool ... i just want an awesome new holy moly personal device and I see a GPT phone could be it. Open AI has that Facetime an AI where it sees you and your surroundings and it acts like your talking to a real human.

I want that front and center in my AI phone as my personal assistant. The AI Phone's UI would be sparser.. wouldn't be a lot of UI (some app icons but not as app icon driven). While the image/video of your AI chat bot personal assistant you look and talk with could be a celebrity to a deceased relative or loved one (they live on and help you through your day to day). There's so much things to innovate and move forward from the boring iPhone!

Hopefully Open AI makes an even bigger announcement of getting into the personal device business soon (or later).

tigranbs · 2024-12-18T20:24:48 1734553488

OMG! Try calling from Microsoft Teams :D You will end up with, "Thanks for calling Agenta". Did OpenAI outsource and release this implementation with some of the company's internal phone numbers?

Sean-Der · 2024-12-18T20:37:33 1734554253

Not outsourced, I worked on it myself.

This is the old owner of the number. Some carriers are still routing it wrong.

Alupis · 2024-12-18T20:45:31 1734554731

So you all launched anyway? YOLO embodied entirely I guess...

650REDHAIR · 2024-12-18T20:51:59 1734555119

That is unsurprising and totally wild.

Not very confidence inspiring.

conductr · 2024-12-18T21:56:16 1734558976

Because the tech is in a move fast break things phase, not business critical to anyone and mostly just a toy

Alupis · 2024-12-18T22:53:46 1734562426

> not business critical to anyone and mostly just a toy

Except it's business-critical to OpenAI, who hopes to look impressive when you call the number.

Instead, for some unknown percentage of folks that call will become confused, or think OpenAI is a bit janky. Based on the anecdotes here, it seems the percentage of people who will experience this issue is not trivial either.

My guess is OpenAI paid a truck load for this 1-800 number and rushed it into "production" for this product launch without waiting for all old routing to be updated.

That's a pretty amateur mistake, honestly.

conductr · 2024-12-18T23:57:07 1734566227

Maybe. I’m sure it was calculated either way and their mistake to make. Could be that they are working in the background and have a plan to resolve before next week. I don’t think people in general have such high SLA expectations. It’s a minor blip in the grand scheme of things.

lxgr · 2024-12-19T04:15:02 1734581702

The owner of an 800 number literally pays for your calls to them. This is a promotion of their services, not a business critical or even paid product.

themanmaran · 2024-12-18T23:59:30 1734566370

Ah I got the same result calling from Google Fi. Thought this was a weird April fools joke for a bit. Then third call and it went through to GPT. Telcom is weird!

sizzle · 2024-12-19T08:37:36 1734597456

Any cool or funny stories from the inside having worked on it?

shlomo_z · 2024-12-19T03:40:02 1734579602

Haha, same for me when calling through a Voice gateway (a twillio competitor)

drewnick · 2024-12-18T21:01:26 1734555686

Reminds me of 1-800-MY-YAHOO. I remember hiking in a national park in the 90s and calling in from a pay phone and having my email read to me over the phone by a robot. I could record an audio response that was sent back as an attachment. Good times!

amclennon · 2024-12-18T19:20:47 1734549647

Between purchasing chat.com, and this new 800 number, I sometimes feel like OpenAI is really channeling that 90s dot com era energy.

dghlsakjg · 2024-12-18T19:25:48 1734549948

Controversial take: LLMs are the first time in a while that I have felt like emerging technology trends is doing something cool and adding value.

For the past 8-10 years it has all felt like a bunch of apps that just aim to be mediocre middlemen/gig economy brokers with bad customer service.

abixb · 2024-12-18T19:31:58 1734550318

I agree. A good chunk of the tech trends in the last decade were indeed rent seeking, but silent revolution was happening in the transformers and the neural network architecture domain, which made today's products possible.

And I'd wager that there are silent revolutions happening all across colossus that's the tech industry that will become apparent in the next decade.

Jeff Bezos put it best during his recent interview at the 2024 NYTimes Dealbook Summit, "We're living in multiple golden ages at the same time." There's never been a better time to be alive.

TRiG_Ireland · 2024-12-19T00:05:41 1734566741

That's easy for a billionaire to say, isn't it? Jeff Bezos is not exactly a reliable narrator here. His business practices are built on exploitation and externalising his costs (such as the massive environmental damage).

hypeatei · 2024-12-18T19:33:12 1734550392

I agree about an abundance of apps, but what type of value are LLMs adding?

It can sometimes be useful to input a more "human" search and have something get spit out but 60% of the time it completely lies to you. I'm talking about questions related to web specifications which are public documents. Section numbers, standards names, etc.. will be completely made up.

rtsil · 2024-12-18T20:14:37 1734552877

Off the top of my head, and just for the last couple of months, and only outside of work (where its value is even more immense), it has saved one of my indoor plants, told me how to handle a major boiler problem that would have left us without a working boiler during a weekend in the winder, with the next "emergency" repairman only available on Monday, advised me to use Kopia as backup solution for my personal files instead of Syncthing, helped me choose the right type of glass for a painting frame, answered a couple of questions about bikes and helped me when I was stuck in an harmonic analysis of a piece of music. All of that are extremely valuable to me (if only for the time not wasted googling answers), and in none of them its potential hallucinating would have been an issue. And I can't count the number of times where "specialists" in bike repairs or plumbing told me something incorrect or outright false, so I've learned to deal with hallucinations already!

vel0city · 2024-12-18T20:21:55 1734553315

> And I can't count the number of times where "specialists" in bike repairs or plumbing told me something incorrect or outright false, so I've learned to deal with hallucinations already!

So much this. So many times I've argued with hired experts saying "can't be done" just to see yes, it can be done.

swatcoder · 2024-12-18T20:35:27 1734554127

Yes, but which of those things would you not have resolved just as well 10 years ago? All those possibilities were added by the maturing web itself, as a genuinely novel change from having to source books or experts/friends in the days before.

I'm glad ChatGPT didn't lead you astray, but I'm not seeing what it's added here besides shuffling up the user interface in a way that you presently and subjectively prefer?

TeMPOraL · 2024-12-18T23:32:30 1734564750

> I'm not seeing what it's added here besides shuffling up the user interface in a way that you presently and subjectively prefer?

This. But in the same sense the past 50 years merely changed interface from dusty textbooks in libraries to Google Search, and the past 100 years gave us dusty textbooks over writing to Royal Society, and that just replaced the option of asking a local whisperer or hoping you'll find answers on the Sunday mass.

Do not underestimate the power of being able to get an answer to your problem described, visualized, and perhaps complete with interactive demo to explore it further, in time it would previously take you to formulate the right search query that finally gives you relevant information.

EDIT:

And that's on top of all the arbitrary data transformations prior tools couldn't do. E.g. I'm increasingly often using GPT and Claude models to turn photos of (possibly hand-written) notes or posters into iCAL files I can immediately import into our family shared calendar.

Another frequent use case, data normalization. Paste a whole dump of inconsistently structured data multiple people collected (say, addresses of various local businesses that helped a local NGO and now are supposed to get a thank-you card for Christmas). Like, you get 200 rows of addresses in a single column, with spelling mistakes, repetitions, junk at the end, arbitrary capitalization, wrong order of address segments, and such; you need to separate it out into 5+ columns (name line 1, name line 2, street address, zip code, city, etc.) and have it all normalized.

The fastest and most robust way to do it as a one-off job, today, is to paste the whole thing to GPT-4o or Claude 3.5 Sonnet, tell it how the output should look (give one-two examples, mention some mistakes you saw), then send the message and wait 30 seconds for the job to be done for you.

(Yes, it may make mistakes - it didn't for me in recent memory, but it can. But for that, I quickly add an extra verification column for each one in LLM output, and do a simple case-insensitive substring match with original, and eyeball any data row that shows an error. And guess what, the formulas don't take much time either, since LLMs are good at writing them for you, too!)

rtsil · 2024-12-18T21:07:31 1734556051

My plant would have been dead. As for the rest, sure, I would have resolved them eventually, after many frustrated hours of googling and trial and error.

Time is my most precious thing, I already don't have enough time to do all the things that I want to do, I don't want to waste that trying to find and test solutions when ChatGPT gives me instant answers. I'd rather spend time playing with my cats or riding a bike instead. It's not a matter of UI, it's a matter of preventing waste of time, energy and money, and less frustration. For that alone, €20/month is a very good value. And that's just for my personal life.

swatcoder · 2024-12-18T22:43:00 1734561780

"many hours of frustrated googling and trial and error" isn't a familiar experience to me, but I'll trust that it is for you. I'm glad you see that as behind you now with this. I suppose you must not be alone.

djeastm · 2024-12-19T01:22:44 1734571364

>besides shuffling up the user interface

I wouldn't discount this effect. As someone with sensory issues, one thing I like about ChatGPT as opposed to the "raw" internet is that I can see the answer to my questions in a nice and calm textual format without some website who created the article specifically to catch my search terms, but is trying to get me to deceptively click on ads or pull me into buying something through their affiliate links. That's absolutely increased my own enjoyment and productivity.

fragmede · 2024-12-18T20:56:31 1734555391

objectively, it takes less time to ask a question and get a direct answer than it does to search for some words, leaf through a couple of results, find one that has the information you want, and then read that page. If I want to know the height of the Eiffel tower, being told it's 1083 meters tall is faster than searching for its website, finding the stats section, then locating that information on the page. Google realizes that, so they pull that info out of the page and just put it on the results page for you.

dayjah · 2024-12-18T19:44:27 1734551067

This is a thin edge of the wedge issue, right? ChatGPT is pretty darn good for most things. I’ve used it extensively for the past 18 months and only in a few cases would I say it “completely lied to me”.

My general rubric is: “would I trust someone on Reddit to correctly guide me on this”. If the answer is “yes” then ChatGPT is likely going to do well. If the volume on a particular subject is low / susceptible to false information then it’ll lie.

Recently it lied hard about how to configure MikroTik routers. I lost many hours. But for a large construction project recently it completely balled out.

Are you doing cutting edge / complicated stuff? Have you examples of where it lies?

hypeatei · 2024-12-18T20:06:14 1734552374

> Have you examples of where it lies?

No specific prompts, but most were related to the XHR/Fetch specs and behaviors within. It would say "X.Y.Z sections defines this" but that section didn't exist at all and the answer provided was not accurate.

> My general rubric is: “would I trust someone on Reddit to correctly guide me on this”. If the answer is “yes” then ChatGPT is likely going to do well

I see. Well, I don't know if I find that very valuable but if others do, then so be it.

adamc · 2024-12-18T20:14:30 1734552870

I've asked it for things like book recommendations and gotten:

  - completely made up books
  - real books that were only marginally related
  - real books with really bad reviews

I'd estimate that only 30-40% of the time did I find the results at all useful.

airstrike · 2024-12-18T20:31:13 1734553873

stop using it like a database that you can query

bigfudge · 2024-12-18T22:14:32 1734560072

Agreed this is a bad idea in the case you are replying to, but I love ChatGPT as a way to recover the name of a book or film I’ve forgotten. I recently prompted for “a book about nuclear wasteland dominated by a church” and it gave me A canticle for Leibowitz (which is great). I’m not sure how easy that would be any other way.

airstrike · 2024-12-19T03:39:34 1734579574

absolutely, I've been using it for that at lot as well and it's remarkably well suited for it. there's really no better tool for the job.

it's just that every thread about LLMs in AI invariably has someone complaining about best results from a query best described as `SELECT * FROM...`

cruffle_duffle · 2024-12-18T22:06:14 1734559574

I wonder how many people are promoting it correctly. You can’t just query it like you might for google or something. It works best with lots of context and back and forth. And yeah, for many things you are going to get directional answers not exact ones (esp with “rote memory” like exact quotes from a book or something.)

therein · 2024-12-18T20:00:52 1734552052

I don't want to turn this into another Claude lies less than ChatGPT subthread but since you mentioned configuration of MikroTik routers I felt like I should.

ChatGPT lies a lot about RouterOS, I don't know why. Claude helped me a lot on the other hand with all things MikroTik.

dayjah · 2024-12-18T22:47:05 1734562025

Thanks! I’ll give it a shot when I get to the vlan stuff I’ve on deck

dghlsakjg · 2024-12-18T20:34:10 1734554050

I find it useful, and it brings value to me (literally: I exchange valuable money for API access), even if it doesn't for you. Many other people report the exact same thing. Just because you don't find value in a technology, doesn't mean that others don't.

In the past week I have used it for helping write a script in a framework I'm not super familiar with (OpenSCAD), I was able to finish a project in 5 minutes that otherwise would have taken me hours. I have used it to help make movie recommendations (none of them were hallucinated). I have used it to translate a conversation with a non-english speaker, etc. There are other tools that can help me do all of these things, but none quite as fast or painlessly.

It might not be useful for your use case of asking questions related to specific web specs, but that doesn't mean that the technology has no value. Horses for courses...

rajamaka · 2024-12-18T19:55:13 1734551713

The value to me is by having an on-demand junior developer working alongside me for the price of $20 a month

hypeatei · 2024-12-18T20:12:50 1734552770

My experience with code completion tools (i.e. single line/method snippets) has been positive. But, anything more complicated seems to fall apart rather quickly.

stonedge · 2024-12-18T21:39:50 1734557990

I have upgraded to the $200 Pro tier, and, with o1-pro, all of my tasks delegated to the "junior" have been so much better. It takes longer to complete, of course, but the overall duration is less because I'm not having to go back and correct it as much as I was with 4o. It's been able to figure out problems that 4o continually failed on.

ipaddr · 2024-12-18T20:09:58 1734552598

Mine is a senior developer with memory lapses.

airstrike · 2024-12-18T20:30:36 1734553836

And that sometimes you need to bully a bit to get coerced answers out of... feels bad

drusepth · 2024-12-18T20:11:26 1734552686

LLMs have been a personal tutor to me for the last year, able to explain anything and everything I've been curious about professionally and personally. I changed jobs to new technologies in large part because I effectively had an assistant able to help cover any gaps in knowledge I had, train me up quickly, and offer ongoing help on the job.

They can make stuff up, but saying "60% of the time they lie to you" hasn't been true for years.

krger · 2024-12-18T20:31:46 1734553906

>They can make stuff up, but saying "60% of the time they lie to you" hasn't been true for years.

If you're using them to fill knowledge gaps, what scaffolding have you set up to ensure that those gaps aren't being filled with incorrect-but-plausible-sounding information?

lxgr · 2024-12-18T20:42:04 1734554524

That's because we're currently largely not using them correctly, i.e. hooked up to RAG instead of hoping that they've memorized enough of the training data verbatim, which is arguably a waste of neurons in a foundational model.

Imaging being graded on your ability to quote exact line numbers of particular parts of your codebase as a senior software engineer without being able to look at it!

LLMs are not, in isolation, a search product.

jklinger410 · 2024-12-18T19:37:46 1734550666

> but 60% of the time it completely lies to you

This is such an exhausting conversation

hansonkd · 2024-12-18T20:28:51 1734553731

i think when people say things like this it indicates that they tried LLMs in 2022 and solidified their opinion there.

I had the same impression about the hallucinations 2 years ago. The reality is in at the end of 2024, you can get incredible value from LLMs.

I've used copilot to code almost exclusively now for the past few months. Anyone still comparing it to text completion I feel is operating on completely out of date information either intentionally or unintentionally.

beefnugs · 2024-12-18T21:36:40 1734557800

Wait do you expect people to retry every failed thing they have tried due to marketing lies every how often exactly?

jklinger410 · 2024-12-18T21:52:21 1734558741

Do you expect the first iteration of every product to be perfect?

zamadatix · 2024-12-18T19:39:36 1734550776

I'd (generally) agree. About 5 minutes of using Flux, Claude or Suno would have provided more net new value than I've yet to get out of blockchain, self driving, gig brokers, metaverse, 5G, AR/VR, quantum computing, hyperloop, and whatever people were trying to make web3 be combined over the years. Not that I don't think all of these things will always perpetually fail to deliver (hell, if I had a chance to try Waymo already then self driving probably wouldn't be on the list), just the hype cycles were unrelated to when that delivery occurred (if ever).

The hard part is, despite actually having some "real" value delivered, you still have to sort through the 99% of bullshit that comes along with it anyways.

becquerel · 2024-12-18T20:32:53 1734553973

I will personally say that if you ever get the chance, definitely try a Waymo. I did recently for the first time and it's a hell of an experience. You can very vividly imagine it being the future.

I'm also going to stand up for AR/VR here. I'm in a long-distance relationship and me and my partner spend an hour or so in VRChat around two to three times a week. The power that has to reduce the badness of an LDR is well well well well worth the three hundred bucks I paid for a Quest. That and some of the golf games on it are fun.

zamadatix · 2024-12-18T21:06:11 1734555971

I am super stoked to try a Waymo when I'm in a city with one. It's hype failures have more to do with 10 years of hype about its public availability yet not being available to 99% of the world's population 10 years later. Hype is useless without the result.

I've had an HTC Vive and an Oculus Rift 3 (Walkabout Mini Golf is one I tried!) and while I wouldn't try to argue NOBODY has found a use for it (somebody somewhere found uses for all of the things I mentioned, just not me and just not the majority of people like big new things are promised to) it never really ticked the "new value" box before they ended up in the closet for me.

becquerel · 2024-12-19T07:37:29 1734593849

That's totally fair. The tech is only barely coming out of the enthusiast adopter phase and there's not a critical mass of content on there to keep most people putting on the headset daily.

That and the ergonomics do still suck, even if I've mostly gotten used to them.

I do think VR will make it, though - starting with the kids. Apparently Gorilla Tag broke 1.5 million players recently, and those are mostly under-15s. The next generation is going to have a strange relationship with computers.

emptysea · 2024-12-18T19:35:55 1734550555

Have you tried a Waymo yet? Honestly the coolest tech I’ve seen/used in ages

Lots of engineering involved

dghlsakjg · 2024-12-18T20:17:47 1734553067

No. Never even seen one, since I don't live in the US/California.

TRiG_Ireland · 2024-12-19T00:03:22 1734566602

They're "adding" value they've stolen from artists and writers. It's an industry built on copyright infringement at a heroic scale.

smokel · 2024-12-18T19:52:18 1734551538

Don't forget the vast (and parallel) improvements in image processing.

itishappy · 2024-12-18T19:28:10 1734550090

> a bunch of apps that just aim to be mediocre middlemen/gig economy brokers with bad customer service

Isn't this the new LLM playbook?

dghlsakjg · 2024-12-18T20:23:33 1734553413

How so?

I pay Claude/ChatGPT trivial amounts of money for metered API access to their models, and they in turn provide it to me.

Middlemen/marketplace models like "Uber for x" or "Etsy for x" or "Betterhelp for x" is a totally different business model.

itishappy · 2024-12-18T21:01:47 1734555707

I had in mind the surge in LLM chat support and the surge in thin ChatGPT wrappers with a custom system prompt. Claude/ChatGPT do seem useful, "an AI companion for Microsoft Paint" less so.

vasco · 2024-12-18T19:30:19 1734550219

And now we will have mediocre middlemen/gig economy brokers with bad customer service performed by AI agents that you can summarize with chatgpt and automatically reply back to. Progress!!

dylan604 · 2024-12-18T21:09:34 1734556174

yeah, we were definitely stagnant when the focus was on crypto

otabdeveloper4 · 2024-12-18T19:31:26 1734550286

> doing something cool

Yes.

> and adding value.

No. The only breakthrough innovation LLMs gave us is the ability to speedrun the making of racist pictures. Not sure the world really benefited.

dghlsakjg · 2024-12-18T20:20:19 1734553219

LLMs don't generate images at all.

finnh · 2024-12-18T19:33:44 1734550424

I don't think this was your intent, but the only interpretation here is that you think the rapid creation of racist pictures is cool.

gowld · 2024-12-19T18:59:54 1734634794

Read the comment again, more slowly.

kibwen · 2024-12-18T19:23:28 1734549808

And ChatGPT still does lets me do less today than Zombo.com let me do in 1999.

mgkimsal · 2024-12-18T19:26:24 1734549984

You could do ANYTHING at zombo.com, if I remember correctly. At zombo.com, anything was possible.

EDIT: There is one limit at zombo.com. The limit is myself.

samcgraw · 2024-12-18T19:28:09 1734550089

You, yes, you, can still do anything at https://zombo.com.

mgkimsal · 2024-12-18T19:30:59 1734550259

I still can't do things that I can't do, because at zombo.com, I am my own limit.

layer8 · 2024-12-18T20:57:55 1734555475

But the unattainable is unknown at zombo.com.

deadfa11 · 2024-12-18T19:35:03 1734550503

Zombo.com really had everything, way ahead of its time. It's been a while... maybe since the last time I lost the game.

colejohnson66 · 2024-12-18T19:27:22 1734550042

That's only because Zombo had everything. It was the original everything app/site that Musk so desperately wants X to be. Nothing can top that - not even AI.

jaredsohn · 2024-12-18T19:30:18 1734550218

OpenAI acquisition/synergizing/rebrand to zombo.com incoming and then we'll complain about them ruining Zombo.

layer8 · 2024-12-18T21:04:28 1734555868

We need 1-800-ZOMBOCOM.

xanderlewis · 2024-12-18T19:23:55 1734549835

I’m all for it! Maybe they’ll start auctioning off pixels on openai.com.

codetrotter · 2024-12-18T19:29:30 1734550170

If you’re quick, you might be able to grab a 88x31 spot!

jonny_eh · 2024-12-18T19:24:28 1734549868

Feels like something Google would do in the early days, but not now.

wibbily · 2024-12-18T19:30:21 1734550221

Nothing new under the sun

https://en.wikipedia.org/wiki/GOOG-411

jonny_eh · 2024-12-18T19:41:01 1734550861

That's what I was thinking of. Ironically, no amount of googling surfaced it for me, so I thought I imagined it.

joshuaturner · 2024-12-18T19:26:24 1734549984

AOL keyword "chat"

lm28469 · 2024-12-18T19:25:47 1734549947

Next month we'll have LetterGPT and by 2026 they'll introduce MorseGPT to let us communicate via telegram

ravenstine · 2024-12-18T19:26:51 1734550011

Why stop there? We could have ChatGPT the breakfast cereal! ChatGPT the coloring book! ChatGPT the flamethrower! (the kids love that one)

The scary thing is it's actually conceivable to somehow integrate GPT into those things.

sterlind · 2024-12-18T19:30:46 1734550246

ChatGPT the breakfast cereal is just alphabet bites. Nutritious but the next-token prediction accuracy is terrible.

jaredsohn · 2024-12-18T19:27:17 1734550037

FaxGPT as well