Does current AI represent a dead end?

singingfish · 2024-12-27T20:08:37 1735330117

I've been following the whole thing low key since the 2nd wave of neural networks in the mid 90s - and made a very very minor contribution to the field which has applications these days back then too.

My observation is that every wave of neural networks has resulted in a dead end. In my view, this is in large part caused by the (inevitable) brute force mathematical approach used and the fact that this can not map to any kind of mechanistic explanation of what the ANN is doing in a way that can facilitate intuition. Or as put in the article "Current AI systems have no internal structure that relates meaningfully to their functionality". This is the most important thing. Maybe layers of indirection can fix that, but I kind of doubt it.

I am however quite excited about what LLMs can do to make semantic search much easier, and impressed at how much better they've made the tooling around natural language processing. Nonetheless, I feel I can already see the dead end pretty close ahead.

steve_adams_86 · 2024-12-27T20:34:52 1735331692

I didn’t see this at first, and I was fairly shaken by the potential impact on the world if their progress didn’t stop. A couple generations showed meaningful improvements, but now it seems like you’re probably correct. I’ve used these for years quite intensively to aid my work and while it’s a useful rubber duck, it doesn’t seem to yield much more beyond that. I worry a lot less about my career now. It really is a tool that creates more work for me rather than less.

iman453 · 2024-12-27T20:49:55 1735332595

Would this still hold true in your opinion if models like O3 become super cheap and bit better over time? I don't know much about the AI space, but as a vanilla backend dev also worry about the future :)

foobarian · 2024-12-28T02:05:33 1735351533

I was helping a relative still in college with a project, and I was struck by how lackadaisical they are about cut-and-pasting huge chunks of code from chatgpt into whatever module they are building without thinking about why, or what it does, or where it fits, as long as it works. It doesn't help that it's all relatively same-looking Javascript so frontend or backend is kinda mixed together. The troubleshooting help I provided was basically untangling the mess by going from first principles and figuring out what goes where. I can tell you I did not feel threatened by the AI there at all, if anything I felt bad for the juniors and feeling like this is what we old people are going to end up having to support very soon.

muixoozie · 2024-12-28T11:10:41 1735384241

I had a similar experience recently.

Not sure how accurate these numbers are but on https://openrouter.ai/ highest used "apps" basically can auto-accept generated code and apply it to the project. I was recently looking at top performers on https://www.swebench.com/ and noticed OpenHands basically does the same thing or similar. I think the trend is going to get much worse, and I don't think Moore's Law is going to save us from the resulting chaos.

docmars · 2024-12-28T15:24:35 1735399475

AI slop-fixing consultants, at your service! There's hope for us veterans yet. ;)

intended · 2024-12-28T04:28:25 1735360105

It will be kinda like modern furniture, vs “Old Human written code.”

So many people won’t be able to get paid for the quality of their work, but the people who buy the cheaper product will get what they pay for.

root_axis · 2024-12-27T20:55:06 1735332906

Let's see how O3 pans out in practice before we start setting it as the standard for the future.

varelse · 2024-12-27T21:02:10 1735333330

Mamba-ish models are the breakthrough to cheap inference if they pan out. Calling a dead-end already is just silly.

sydd · 2024-12-27T21:03:29 1735333409

We know that OpenAI is verz good at least in one thing: generating hype. When Sora was announced everyone thought that this will be revolutionary. Look at how it looks like in production. Same when they started floating rumours that they have some AGI prototype in their labs.

They are the Tesla of the IT world, overpromise and under deliver.

WhyOhWhyQ · 2024-12-27T21:36:46 1735335406

It's a brilliant marketing model. Humans are inherently highly interested in anything which could be a threat to their well-being. Everything they put out is a tacit promise that the viewer will soon be economically valueless.

bilbyx · 2024-12-27T23:01:07 1735340467

I hope people will come to the realisation that we have created a good plagiarizer at best. The "intelligence" originates from the human beings who created the training data for these LLMs. The hype will die when reality hits.

gom_jabbar · 2024-12-28T18:55:08 1735412108

Hype is very interesting. The concept of Hyperstition describes fictions that make themselves real. In this sense, hype is an essential part of capitalism:

"Capitalization is [...] indistinguishable from a commercialization of potentials, through which modern history is slanted (teleoplexically) in the direction of ever greater virtualization, operationalizing science fiction scenarios as integral components of production systems." [0]

"Within capitalist futures markets, the non-actual has effective currency. It is not an "imaginary" but an integral part of the virtual body of capital, an operationalized realization of the future." [1]

This corresponds to the idea that virtual is opposed to actual, not real.

[0] https://retrochronic.com/#teleoplexy-12

[1] https://retrochronic.com/#on-accelerate-2b

seadan83 · 2024-12-28T20:03:32 1735416212

Religion too. Those that are told a prophecy is to come, have a lot of incentive to fulfill that prophecy. Human belief systems are strange and interesting because (IMO) of the entanglement of beliefs with identity

steve_adams_86 · 2024-12-29T01:26:37 1735435597

Generally speaking, I think it would. I’m open to being wrong. I think there is a non-trivial amount of hype around O3, and while it would certainly be interesting if it was cheap, I don’t think it would address important issues that AI currently doesn’t seem to even begin to accommodate in its current capacity to recognize or utilize contexts.

For example, I have little to no expectation that it will handle software architecture well. Especially refactoring legacy code, where two enormous contexts need to be held in mind at once.

hatefulmoron · 2024-12-27T21:30:06 1735335006

I'm really curious about something, and would love for an OpenAI subscriber to weigh in here.

What is the jump to O1 like, compared to GPT4/Claude 3.5? I distinctly remember the same (if not even greater) buzz around the announcement of O1, but I don't hear people singing its praises in practice these days.

jazzyjackson · 2024-12-27T22:57:55 1735340275

I gave up interest in GPT4/Claude3.5 about 6 months ago as not very helpful, producing plausible but wrong code.

Have an o3-mini model available to me on the other hand I'm very impressed with its fast, succinct, correct answers while tooling around in zsh on my mac. what things are called, why they exist. why is macports installing db48 etc. It still fails to write simple bash one liners. (I wanted to pipe the output of ffmpeg to a column of --enabled-features and it just couldn't do it)

It's a very helpful rubber duck but still not going to suffice as an agent, but I think its worth a subscription. I wanted to do everything local and self hosted and briefly owned a $3000 mac studio to run llama3.3-70B but it was only as good as GPT4 and too slow to be useful so returned it. In that context even $200/m is relatively cheap.

lom888 · 2024-12-27T22:00:05 1735336805

I don't know how to code in any meaningful way. I work at a company where the bureaucracy is so thick that it is easier to use a web scraper to port a client's website blog than to just move the files over. GPT 4 couldn't write me a working scraper to do what I needed. o1 did it with minimal prodding. It then suggested and wrote me a ffmpeg front-end to handle certain repetitive tasks with client videos, again, with no problem. Gpt4 would often miss the mark and then write bad code when presented with such challenges

2muchcoffeeman · 2024-12-27T23:02:17 1735340537

Try Claude. I get even better code results.

te_chris · 2024-12-27T21:32:57 1735335177

O1 is fine.

tempodox · 2024-12-28T06:26:18 1735367178

No degree of cheapness will be able to offset the “creates more work for me rather than less” part.

trhway · 2024-12-27T22:51:49 1735339909

>I worry a lot less about my career now. It really is a tool that creates more work for me rather than less.

when i was a team/project leader the largest part of my work was talking to the reports on what needs to be implemented and how they are going to implement it and the current progress of the implementation, how to interface the stuff, what are the issues and how to approach the troubleshooting, what are the next steps, etc. with occasional looking into/reviewing the code - it looks to me what working with coding LLM would soon be quite similar to that.

trod1234 · 2024-12-27T23:00:05 1735340405

Many of the major harms of these things were neglected, and downplayed even to this day people don't recognize just how changed the world has become. The mere delusion that AI will replace work has been used to justify mass layoffs.

The persistence of indistinct ghost jobs that are generated by computer for pennies to flood and bind with prospective job seekers (similar to RNA interference), has resulted in severe brain drain in many fields. Worse, the fact these people have often been forced into poverty as a result will have a lasting impact. You might have planned for up to a year out of work pre-AI and had the financial resources, but now how long does it take? Conversion ratios for the first step have changed by two magnitudes or order (from x100 to x10,000). What are the odds of these people finding a job given their finite time, and requirements that are un-automatable for submission (nil). The media keeps claiming that everything is getting better, the stats say so (while neglecting the fact that the stats are being manipulated to the point of uselessness/fabricated), but you have 1/3 of welfare payouts now going to these people in California (in the US), just for basic food.

When you can't find work, you go where the work is abandoning the bad economic investment and choice you made regardless of how competent you were. It is a psychologically sticky decision. When there is no chance at finding work, you get desperate, and many desperate people turn to crime and unrest. This was foreseen by a number of very intelligent people many decades ago, and ignored following business as usual.

The mere demonstration that we are unable to react in time is what gave engineers such great pause to write about these things, as far back as in the 70s. Hysteresis is a lagging time problem where you can't react fast enough to avert catastrophic failure given chaotic conditions, leaving survival up to chance. Its the worst type of engineering problem with real consequences.

Given how western society is structured dependently on labor exchange, its a perfect weapon of chaos and debasement in the value of labor, that effectively destroys half of its underlying economic structure (factor markets). This forces sieving conditions of wealth that become spinodal, and eventually falter under their constraints and spiral into deflationary trends over time.

Business wins so much that they lose everything. Its quite a disadvantaged environment and the general trend is that everyone is ignoring the pink elephant. Actions (and inaction) have consequences. When people don't listen and take appropriate action, consequences get dire, it hits the fan.

cgio · 2024-12-27T23:47:53 1735343273

I agree that we are often missing in our analyses the true materialised impact of expectations by focusing on the validity of said expectations instead. Organisations, even if not laying off, are pausing hiring plans with a conviction that AI will replace some of the workers. It then becomes a self fulfilling prophecy to some extent. It doesn’t matter if it can, what matters is if it will. And to assume that people won’t place a bet is futile, as everyone does, and even if it’s wrong the market will allocate the losses to the baseline.

trod1234 · 2024-12-28T22:44:53 1735425893

There are two aspects of your line of reasoning that don't jive for me.

People can choose to not place a bet by not participating in the economy, or tying physical assets to it. In other words, de-banking, off-grid farming, unemployment on welfare (not their money, printed guaranteed loss absorbed by the baseline).

The assumption that the market can always allocate the losses to the baseline has already been shown to be foundationally flawed. It depends upon whether the baseline can absorb the losses to keep the market going, not the other way around. Those who believe in MMT don't pay head to the fact that money printing has caused societies to fail many times in the distant past, in the ever quoted phrase, but it will be different this time.

When the economic engine stalls, so too does order and money-printing/debt issuance (without fractional reserve) drives this as a sieve (which we've seen over the past several decades in the form of bailout, marketshare concentration, and consolidation).

Central banks set reserve allocations to 0% in 2020, adopting a capital reserve, risk-weighted system based in fiat that is opaque and stock-market tied (Basel III modified). Value is subjective, and fiat may have store of value, right up until it doesn't.

Of particular note, societal order is required to produce enough food for 8bn globally, without order and its now brittle dependencies, we can only feed 4bn globally. Malthus has a lot to say about population dynamics in ecological overshoot.

TL;DR Half of all people die when modern chemical production (Haber-Bosch fertilizer) and other food dependencies (climate) fail.

AI drives chaos and disruption. Its like throwing a wrench into a gear system, maybe it will stall, maybe it gets thrown out (still slowing it), maybe it runs rough wearing it faster further degrading the system towards failure.

When the baseline cannot absorb the cost in terms of purchasing power, it absorbs the cost from the resulting chaos in lives.

Intelligent people pay attention to history because the outcomes that repeat in history occur as a result of dynamics that repeat, and in matters where lives or survival are on the line risk management shifts from permissive to restrictive (where the requirements of proof are flipped).

cgio · 2024-12-30T00:35:26 1735518926

Thank you for the thoughtful answer. There is a certain amount of cynicism in my post, to match the cynicism of reality unfortunately. Your arguments may be valid, but who cares to rationally think and act when they can easily observe and react? A collapse akin to your description would disproportionately affect the people who don’t have the power to do either, they just accept and suffer. In history, do we have any example where that was not the case, except revolution? Even with revolutions the respite is only perceived and of the shuffle to reach the new decision structures.

trod1234 · 2025-01-01T21:42:29 1735767749

I'm in agreement that self-fulling dynamics occur regularly over longer time horizons, I viewed what you said as pragmatism rather than cynicism.

As for who cares to rationally think and act when they can easily observe and react?

The problem with the latter is that its a false choice. The latter simply isn't possible in any effective sense. Certain systems and dynamics become a hysteresis problem, where the indicator is lagging ahead of when it actually happens; by the time you see the indicator it can be perceived, but ultimately impossible to react to in time.

There are also simultaneous issues with rational thought being deprived broadly through induction of psychological stress using sophisticated mental coercion and torture (which isn't physical). Rational thought is the first thing to vanish, and these methods act like HIV does in cellular systems (i.e. destroying the memory of the immune system making it unable to act, instead it destroys perception blinding people).

For some reason these things remind me of the Tower of Babel story in the book of genesis. It makes god out to be the bad guy, when it seems far more likely that the dynamics became destructive. All of Humanity has psychological blindspots that can be used to manipulate them in collective and through unity. Pride often lends itself to delusion, and blindness. Destruction usually follows, and confusion occurs naturally when delusion breaks towards a witnessed reality (as survivors, where others kept dying).

It seems like the translation is off, where instead of god, they meant the inescapable forces of reality. Albeit this is getting a bit into the weeds, its an interesting perspective.

Getting back to things, the major difference today when comparing to history with regards to revolution, we are in extreme ecological overshoot (globally).

Breakdown of order translates to famine so severe that half globally die from starvation. To make matters worse, nearly every economic system on the planet is controlled indirectly by one nation through money printing, and those distortions created are chaotic (fundamentally it shares many characteristics as that of a n-body immeasurable astrophysics system that has limited visibility).

When these things happened in the past, they were largely in isolation, and outside the geographical affected areas, assistance could be leveraged for survival. This is no longer the case now. If these things collapses, it all happens to everyone at the same time. Not enough resources exist to resolve the failure, and there are no tools that would allow correcting the situation after the dynamics have passed a point of no return.

Thinking about these things rationally, preparing while we can (before it happens), is the only tool that might allow survival long term for a few. Its important that survivors know what happened and how it happened, or it will happen again given sufficient time, and that requires a foundation. Needless to say, we have many dark times ahead.

A line appears, the order wanes, the empire falls, and chaos reigns.

I do not envy those who would have to somehow live through chaos, where nuclear weapons might be used by the delusional or insane.

cgio · 2025-01-04T02:31:52 1735957912

I enjoy your thinking and your use of analogy. We agree more than is evident, but you think on an horizon that eludes most, myself included unfortunately. As you say, the psychological torture of mere lifestyle survival overshadows the rational concern for true survival. To some extent, we live through chaos, but don’t have the wherewithal to accept it as such and cling to a normal that is increasingly not normal at all.

__loam · 2024-12-28T00:53:03 1735347183

> The mere delusion that AI will replace work has been used to justify mass layoffs.

AI might be the excuse but the reason is the end of zero interest rates and blitzscaling along with resentment among business leadership that some members of the labor force were actually getting a good deal for once.

trod1234 · 2024-12-28T03:39:51 1735357191

You are mistaken in more ways than one.

You can't claim wage earners have received a good deal when they are unable to support themselves with basic necessities, let alone a wife and three children (required for risk managing 1 surviving to have children themselves). This is largely why we have a problem with birth rate today with the old crowding out all opportunities for the young.

The problem you mention doesn't really have to do with AI. It comes down to purchasing power in the economy, not wages, and business has shown over decades they will not or cannot be flexible when it comes to profit.

Additionally, money printing puts both parties at each others necks through debasement of the currency. When the currency debasement (inflation) exceeds profit legitimate business not tied to a money printer leaves the market (no competition is possible).

When the only entities left in some proposed market cooperate, the market isn't a market, its non-market socialism without the requirements for economic calculation. This fails.

Neither parties in my opinion are getting a reasonable deal. Whose to blame? The cohorts of people printing money from nothing that call themselves central bankers.

gizmo · 2024-12-27T21:01:33 1735333293

Previous generations of neural nets were kind of useless. Spotify ended up replacing their machine learning recommender with a simple system that would just recommend tracks that power listeners had already discovered. Machine learning had a couple of niche applications but for most things it didn't work.

This time it's different. The naysayers are wrong.

LLMs today can already automate many desk jobs. They already massively boost productivity for people like us on HN. LLMs will certainly get better, faster and cheaper in the coming years. It will take time for society to adapt and for people to realize how to take advantage of AI, but this will happen. It doesn't matter whether you can "test AI in part" or whether you can do "exhaustive whole system testing". It doesn't matter whether AIs are capable of real reasoning or are just good enough at faking it. AI is already incredibly powerful and with improved tooling the limitations will matter much less.

dlkf · 2024-12-27T21:36:23 1735335383

> Previous generations of neural nets were kind of useless. Spotify ended up replacing their machine learning recommender with a simple system that would just recommend tracks that power listeners had already discovered.

“Previous generations of cars were useless because one guy rode a bike to work.” Pre-transformer neural nets were obviously useful. CNNs and RNNs were SOTA in most vision and audio processing tasks.

x_______v · 2024-12-28T18:43:57 1735411437

Language translation, object detection and segmentation for autonomous driving, surveillance, medical imaging... Indeed plenty fields where NNs are indispensable

singingfish · 2024-12-28T20:12:13 1735416733

Yeah, give 'em small constrained jobs where the lack of coherent internal representation is not a problem.

I was involved in ANN and equivalent based face recognition (not on the computational side, on the psychophysics side) briefly. Face recognition is one of these bigger more difficult jobs, but still more constrained than the things ANNs are useful for.

As far as I understand none of the face recognition algorithms in use these days are ANN based, but are instead computationally efficient versions of the brute force the maths implementations instead.

jfengel · 2024-12-27T21:27:57 1735334877

From what I have seen, most of the jobs that LLMs can do are jobs that didn't need to be done at all. We should turn them over to computers, and then turn the computers off.

kube-system · 2024-12-27T21:38:10 1735335490

They're good at processing text. Processing text is a valuable thing that sometimes needs to be done.

We still use calculators even though the profession we used to call "computer" was replaced by them.

jonasced · 2024-12-27T22:37:40 1735339060

But here reliability comes in again. Calculators are different since the output is correct as long as the input is correct.

LLMs do not guarantee any quality in the output even when processing text, and should in my opinion be verified before used in any serious applications.

kube-system · 2024-12-27T23:00:21 1735340421

> Calculators are different since the output is correct as long as the input is correct.

That isn't really true.[0] The application of calculators to a subject matter is something that does need to be considered in some use cases.

LLMs also have accuracy considerations, and although it may be to a different degree, the subject matter to which they're applicable has a broad range of acceptable accuracies. While some textual subject matter demands a very specific answer, some doesn't: For example, there may be hundreds or thousands of various ways to summarize a text that could be accurate for a particular application.

0: example: https://www.reddit.com/r/calculus/comments/upjdn4/why_do_all...

sdesol · 2024-12-28T01:05:41 1735347941

I think your point stands, but your example shows that anyone using those calculators daily should not be concerned. Those that need precision to the 6+ decimal places for complex equations should know not to fully trust consumer-grade calculators.

The issue with LLMs is that they can be so unpredictable in their behaviour. Take the following prompt that asks GPT-4 to validate the response to "calculate 2+3+5 and only display the result":

https://beta.gitsense.com/?chat=6d8af370-1ae6-4a36-961d-2902...

GPT-4o mini contradicts itself, which is not something one would expect for something we believe to be extremely simple. However, if you ask it to validate the response to "calculate 2+3+5," it will get it right.

https://beta.gitsense.com/?chat=43221de5-bff6-487a-8c0f-48ca...

By adding "and only display the result," GPT-4o mini was thrown for a loop; examples like this should give us pause.

kube-system · 2024-12-28T01:11:26 1735348286

Well, not every tool is a hammer and not every problem is a nail.

If I ask my TI-89 to "Summarize the plot in Harry Potter and the Chamber of Secrets" it responds "ERR"! :D

LLMs are good text processors, pocket calculators are good number processors. Both have limitations, and neither are good at problem sets that are outside of their design strengths. The biggest problem with LLMs aren't that they are bad at a lot of things, it's that they look like they are good at things they aren't good at.

sdesol · 2024-12-28T01:31:34 1735349494

I agree LLMs are good at text processing and I believe they will obsolete jobs that really should be obsoleted. Unless OpenAI, Anthropic and other AI companies come up with a breakthrough on reliability, I think it will be fair to say they will only be players and not leaders. If they can't figure something out, it will be Microsoft, Amazon and Google (distributors of diverse models) that will benefit the most.

I've personally found it is extremely unlikely for multiple good LLMs to fail at the same time, so if you want to process text and be confident in the results, I would just run the same task across 5 good models and if you have a super majority, you can be confident that it was done right.

rtsil · 2024-12-28T23:04:13 1735427053

Neither are humans, that's why we have proofreaders and editors. That doesn't make them any less useful. And a translator will not write the same exact translation for a text longer than a couple of sentences, that does not mean translation is a dead end. Ironically, it's LLMs that made translation a dead end.

michaelmrose · 2024-12-27T22:08:08 1735337288

> LLMs today can already automate many desk jobs.

No they can't because they make stuff up, fail to follow directions, need to be minutely supervised, all output checked and workflow integrated with your companies shitty over complicated procedures and systems.

This makes them suitable at best as an assistant to your current worker or more likely an input for your foo as a service which will be consumed by your current worker. In the ideal case this helps increase the output of your worker and means you will need less of them.

An even greater likelihood is someone dishonest at some company will convince someone stupid at your company that it will be more efficacious and less expensive than it will ultimately be leading your company to spend a mint trying to save money. They will spend more than they save with the expectation of being able to lay off some of their workers with the net result of increasing workload on workers and shifting money upward to the firms exploiting executives too stupid to recognize snake oil.

See outsourcing to underperforming overseas workers because the desirable workers who could have ably done the work are A) in management because it pays more B) in country or working remotely for real money or C) cost almost as much as locals once the increased costs of doing it externally are factored in.

jsjohnst · 2024-12-27T22:49:15 1735339755

> No they can't because they make stuff up, fail to follow directions, need to be minutely supervised, all output checked and workflow integrated with your companies shitty over complicated procedures and systems.

What’s the difference between what you describe and what’s needed for a fresh hire off the street, especially one just starting their career?

sebastianz · 2024-12-27T23:04:23 1735340663

> What’s the difference between what you describe and what’s needed for a fresh hire off the street, especially one just starting their career?

The fresh hire has the potential that after training and working for a while to become a much more valuable and reliable senior.

jsjohnst · 2024-12-27T23:13:40 1735341220

> has the potential

Good choice of wording! Definitely not a given though.

intended · 2024-12-28T04:51:19 1735361479

Real talk? The human can be made to suffer consequences.

We don't mention this in techie circles, probably because it is gauche. However you can hold a person responsible, and there is a chance you can figure out what they got wrong and ensure they are trained.

I can’t do squat to OpenAI if a bot gets something wrong, nor could I figure out why it got it wrong in the first place.

kunai · 2024-12-27T22:58:43 1735340323

The bell curve is much wider for humans than LLMs, I don't think this needs to be said.

fzeroracer · 2024-12-28T03:24:13 1735356253

The difference is that a LLM is like hiring a worst-case scenario fresh hire that lied to you during the interview process, has a fake resume and isn't actually named John Programmer.

jsjohnst · 2024-12-28T03:26:39 1735356399

Entirely disagree.

bdangubic · 2024-12-28T03:42:59 1735357379

boy do I love being in the same industry as people like you… :) while you are writing silly stuff like this us that do shit have automated 40-50% of what we used to do and not have extra time to do more amazing shit :)

dom96 · 2024-12-27T23:23:55 1735341835

> Spotify ended up replacing their machine learning recommender with a simple system that would just recommend tracks that power listeners had already discovered.

Do you have a source on this? Spotify also seems to employ a few different recomendation algorithms, for example Discover Weekly vs. continuing to play after a playlist ends. I'd be surprised if Discover Weekly didn't employ some sort of ML as it does recommend songs I have never heard before many times.

gizmo · 2024-12-28T17:30:06 1735407006

It's from the book by Carlsson and Leijonhufvud. Perhaps Spotify uses ML today, but the key insight from the book was that no ML was needed to build a recommender system. You can just show people songs from custom playlists curated by powerusers. So when your playlist ends you find other high quality playlists that overlap with the music you just listened to. Then you blend those playlists and enqueue new tracks. This is from memory so I might have gotten the details wrong, but I remember that this approach worked like magic and solved the issues with the ML system (bland or too random recommendations). No reason to use ML when you already have millions of manually curated playlists.

bobsomers · 2024-12-28T05:10:02 1735362602

These are bold claims.

> This time it's different. The naysayers are wrong.

This has been said every time. What is fundamentally different this time?

> LLMs today can already automate many desk jobs.

Which desk jobs existed 2 years ago that don’t exist today because LLMs have automated them away?

boh · 2024-12-28T22:40:38 1735425638

If you had to bet a large amount of your own money on a scenario where you have a 3200 word text and you ask ChatGPT to change a single sentence, would you bet on or against that it would change something other than what you asked it to change? I would bet that it would, every time (even with ChatGPT's new document feature). There aren't a lot of employers who are okay with persistent randomness in their output.

If there's a job that can be entirely replaced by AI, it was already outsourced to an emerging market with meager labor costs (which at this point, is likely still cheaper than a fully automated AI).

giardini · 2024-12-28T04:48:38 1735361318

gizmo says>LLMs today can already automate many desk jobs.

I call: show me five actual "desk jobs" that LLMs have "already automated". Not merely tasks, but desk jobs - jobs with titles, pay scales, retirement plans, etc. in real companies.

jiggawatts · 2024-12-28T05:01:17 1735362077

I know an immigration agent who simply stopped using professional translators because ChatGPT is more than good enough for his purposes. In many ways it is actually better, especially if instructed to use the specific style and terminology required by the law.

If you think about it, human calculators (the job title!) were entirely replaced by digital electronic calculators. Translators are simply "language calculators" that perform mechanical transformations, the ideal scenario for something like an LLM to replace.

anon373839 · 2024-12-28T08:37:20 1735375040

That’s professional negligence. Have the LLM prepare a draft for a human translator to review, sure. But taking the human out of the loop and letting in undetectable hallucinations? In a legal proceeding?

anonzzzies · 2024-12-28T05:11:15 1735362675

But it is not all or nothing here. We replaced real programmers (backend, frontend, embedded) with it, but obviously (I guess) not all. We just require 1/5th of those roles since around beginning this year. There are a lot more 'low level' jobs in tons of companies where we see the same happening because suddenly the automation is trivial to make instead of 'a project'. It will take time for the bigger ones and it won't 'eliminate' all jobs of the same type (maybe it will in time), but it will eliminate most people doing that job as now 1 people can do the work of 5 or more.

I guess we will see the actual difference in 5-10 years in the stats. Big companies are mostly still evaluating and waiting. Maybe it will remain just a few blibs and it'll fizzle out, or maybe, and this is what I expect, the effect will be a lot larger, moving many to other roles and many completely out of work.

On a small (we see many companies inside, but many is relative, of course), but real life examples I see are translators, programmers, seo/marketing writers, data entry (copying content from pdf to excel, human webscraping etc) being replaced now.

We work with some small outsourcing outfits (few 100 people per) and they noted sharp drops in business from the west where the stated reason is AI, but it's not really easy to say or see if that's real or just the current market.

gloosx · 2024-12-28T06:27:04 1735367224

Imagine the face of a guy who needs to do the work of 5 solo now... He is probably the happiest employee now and his salary raised 5-fold, surely yeah?

Hopefully that'll finish off big companies

dlkf · 2024-12-27T21:28:51 1735334931

> Current AI systems have no internal structure that relates meaningfully to their functionality

In what sense is the relationship between neurons and human function more “meaningful” than the relationship between matrices and LLM function?

You’re correct that LLMs are probably a dead end with respect to AGI, but this is completely the wrong reason.

singingfish · 2024-12-28T20:34:14 1735418054

Yeah the internal representation of organic neural networks are also weird - check out the signal processing that occurs between the retina and the various parts of the visual cortex before any decent information can emerge from the signal - David Marr's 1980s book Vision is a mathematically chewy treatise on this. This leads me to start thinking that human intuition may well caused by different neural network subsystems feeding processed data into other subsystems where consciousness and thus intuition and explanation emerges.

Organic neural networks are pretty energy efficient in comparison- although still decently inefficient compared to other body systems - so there is the capacity to build things out to the scale required, assuming my read on what's going on there is correct, that is. So it's not clear to me that the energy inefficiency of ANNs can be sufficiently resolved to enable these multiple quasi-independent subsystems to be built at the scale required. Not even if these interesting looking trinomial neural nets which are matrix addition based rather than multiplication come to dominate the ANN scene.

While I was thinking this comment through I realised there's a possible interpretation wherin human activity induced climate change is an emergent property of the relative energy inefficiency of neural architecture.

mmcnl · 2024-12-27T22:12:28 1735337548

Human intelligence has a track record of being useful for thousands of years.

imtringued · 2024-12-28T12:15:07 1735388107

The neurons are always learning whereas the matrices don't change.

dlkf · 2024-12-28T20:37:14 1735418234

I mean, the matrices obviously change during training. I take it your point is that LLMs are trained once and then frozen, whereas humans continuously learn and adapt to their environment. I agree that this is a critical distinction. But it has nothing to do with “meaningful internal structure.”

busyant · 2024-12-27T23:22:21 1735341741

> "Current AI systems have no internal structure that relates meaningfully to their functionality".

I'm curious as to why you feel this needs to be true?

Or to put it another way, what would an AI structure look like to be more meaningfully connected to its function?

Not trying to flame. I always feel that I can't think quite deeply enough about these issues, so I'm worried that I'm missing something 'obvious'.

singingfish · 2024-12-28T20:21:15 1735417275

The reasoning is quite subtle, and because I'm not a very coherent guy I have problems expressing it. In the LLM space there are a whole bunch of pitfalls around overfit (largely solvable with pretty standard statistical methods) and inherent bias in training material which is a much harder to problem to solve. The fact that the internal representation gives you zero information on how to handle this bias means the tool can itself not be used to detect or resolve the problem.

I found this episode of the nature podcast - "How AI works is often a mystery — that's a problem": https://www.nature.com/articles/d41586-023-04154-4 - very useful in a 'thank goodness someone else has done the work of being coherent so I don't have to' way.

busyant · 2024-12-28T21:28:58 1735421338

Thank you.

That's a really interesting (and understandable) explanation.

fragmede · 2024-12-28T04:13:12 1735359192

AlphaGo had an artificial neural network that was specifically trained in best moves and winning percentages. An LLM trained on text has some data on what constitutes winning at go, but internally doesn't have a ANN specifically for the game of go.

busyant · 2024-12-28T18:26:05 1735410365

> AlphaGo had an artificial neural network that was specifically trained in best moves and winning percentages. An LLM trained on text has some data on what constitutes winning at go, but internally doesn't have a ANN specifically for the game of go.

This isn't addressing what the original commenter was referring to.

aoeusnth1 · 2024-12-28T00:10:06 1735344606

Do your kids have internal structures which relate meaningfully to their functionality, which allow a mechanistic explanation of what they learned in school?

intended · 2024-12-28T05:10:17 1735362617

Not sure if this is satirical, but absolutely yes.

Heck we have everything from fields of study, to professions that cover this. Neurology, psychology, counseling, teaching, amongst a few.

All things being equal, If a kid didn’t pick up a concept, I can sit with them and figure out what happened, and we can both work towards making sure its cleared up.

ninetyninenine · 2024-12-30T00:00:15 1735516815

If we ever hit agi this overall point:

“ the fact that this can not map to any kind of mechanistic explanation of what the ANN is doing in a way that can facilitate intuition.”

Will remain true imho. We will never fully intuit AI or understand it outside of some brute force abstraction like a token predictor or best fit curve.

hammock · 2024-12-27T20:51:05 1735332665

What are your thoughts on neuro-symbolic integration (combining the pattern-recognition capabilities of neural networks with the reasoning and knowledge representation of symbolic AI) ?

bionhoward · 2024-12-27T22:02:40 1735336960

Seems like the symbolic aspect is poorly defined and it’s too unclear to be useful. Always sounds cool, but what exactly are we talking about?

hammock · 2024-12-28T21:04:23 1735419863

I’m not an AI expert, but from my armchair I might draw a comparison between functional (symbolic rule- and logic-based AI) and declarative (LLM) programming languages

pat64 · 2024-12-28T06:23:25 1735367005

Given you just mentioned semantic search (a term I haven’t heard in over 15 years) and the other breadcrumbs in this comment, you wouldn’t by chance be an English lecturer living in Ireland would you?

singingfish · 2024-12-28T20:39:16 1735418356

me? No. Ex trainee neruopsychologist and failed academic who was in the right place at the right time back in the mid 90s who didn't pick up computers for professional interest until the mid-late 2000s after getting excited by Neil Stephenson's Cryptonomicon when I was looking for a career change. These days I identify as an international computer hacker, but mainly to take the piss (due to the tiny element of truth sitting underneath)

cs702 · 2024-12-27T15:03:51 1735311831

As of right now, we have no way of knowing in advance what the capabilities of current AI systems will be if we are able to scale them by 10x, 100x, 1000x, and more.

The number of neuron-neuron connections in current AI systems is still tiny compared to the human brain.

The largest AI systems in use today have hundreds of billions of parameters. Nearly all parameters are part of a weight matrix, each parameter quantifying the strength of the connection from an artificial input neuron to an artificial output neuron. The human brain has more than a hundred trillion synapses, each connecting an organic input neuron to an organic output neuron, but the comparison is not apples-to-apples, because each synapse is much more complex than a single parameter in a weight matrix.[a]

Today's largest AI systems have about the same number of neuron-neuron connections as the brain of a brown rat.[a] Judging these AI systems based on their current capabilities is like judging organic brains based on the capabilities of brown rat brains.

What we can say with certainty is that today's AI systems cannot be trusted to be reliable. That's true for highly trained brown rats too.

---

[a] https://en.wikipedia.org/wiki/List_of_animals_by_number_of_n... -- sort in descending order by number of synapses.

FredPret · 2024-12-27T16:39:56 1735317596

If brown-rats-as-a-service is as useful as it is already, then I'm excited by what the future holds.

I think to make it to the next step, AI will have to have some way of performing rigorous logic integrated on a low level.

Maybe scaling that brown-rat brain will let it emulate an internal logical black box - much like the old adage about a sufficiently large C codebase containing an imperfect Lisp implementation - but I think things will get really cool we figure out how to wire together something like Wolfram Alpha, a programming language, some databases with lots of actual facts (as opposed to encoded/learned ones), and ChatGPT.

petesergeant · 2024-12-27T19:32:07 1735327927

ChatGPT can already run code, which allows it to overcome some limitations of tokenization (eg counting the letters in strawberry, sorting words by their second letter). Doesn't seem like adding a Prolog interpreter would be all that hard.

cs702 · 2024-12-27T16:53:16 1735318396

It's already better than real rats-as-a-service, certainly:

https://news.ycombinator.com/item?id=42449424

Kim_Bruning · 2024-12-27T21:41:20 1735335680

ChatGPT does already have access to Bing (would that count as your facts database?) and Jupyter (which is sort of a Wolphram clone except with Python?).

It still won't magically use them 100% correctly, but with a bit of smarts you can go a long way!

staunton · 2024-12-28T00:05:55 1735344355

Jupyter is completely different from Wolfram software. It's just an interface to edit and run code (Julia, Python and R) and write/render text or images commenting the code. Which isn't to say that Jupyter isn't a great thing but I don't see how a Chatbot would produce better answers by having access to it in addition to "just Python".

Meanwhile, Wolfram software has built-in methods to solve a lot of different math problems for which in Python you would either need large (and sometimes quirky) libraries, if those libraries even exist.

Kim_Bruning · 2024-12-28T15:32:17 1735399937

Except a typical Jupyter environment -especially the one provided to ChatGPT- includes a lot of libraries; including numpy, scipy, pandas and plotly, which -while perhaps not quite as polished as wolphram (arguments can be made), can still rival it qua flexibility and functionality.

That and you need to actually expose python to GPT somehow, and Jupyter is not the worst way I suppose.

* The fact that Jupyter holds on to state means GPT doesn't need to write code from scratch for every step of the process.

* GPT can easily read back through the workbook to review errors or output from computations. GPT actually tries to correct errors even. Especially if it knows how to identify them.

To be sure, this is not magic. Consider it more like a tool with limited intelligence; but which can be controlled using natural language.

(Meanwhile, Anthropic allows Claude to run js with react, which is nice but seems less flexible in practice. I'm not sure Claude reads back.)

ndesaulniers · 2024-12-27T18:36:57 1735324617

Does it matter what color the rat is?

notpushkin · 2024-12-27T19:17:38 1735327058

I suppose it refers to the particular species, Rattus norvegicus (although I'd call it common rat personally).

pockmarked19 · 2024-12-27T17:42:35 1735321355

Calling it a neural network was clearly a mistake on the magnitude of calling a wheel a leg.

cgearhart · 2024-12-27T18:11:05 1735323065

This is an excellent analogy. Aside from “they’re both networks” (which is almost a truism), there’s really nothing in common between an artificial neural network and a brain.

runarberg · 2024-12-27T18:30:06 1735324206

Neurons also adjust the signal strength based on previous stimuli, which in effect makes the future response weighted. So it is not far off—albeit a gross simplification—to call the brain a weight matrix.

As I learned it, artificial neural networks were modeled after a simple model for the brain. The early (successful) models were almost all reinforcement models, which is also one of the most successful model for animal (including human) learning.

legacynl · 2024-12-27T18:23:43 1735323823

I don't really get where you're coming from..

Is your point that the capabilities of these models have grown such that 'merely' calling it a neural network doesn't fit the capabilities?

Or is your point that these models are called neural networks even though biological neural networks are much more complex and so we should use a different term to differentiate the simulated from the biological ?

joe_the_user · 2024-12-27T18:30:34 1735324234

The OP is comparing the "neuron count" of an LLM to the neuron count of animals and humans. This comparison is clearly flawed. Even you step back and say "well, the units might not be the same but LLMs are getting more complex so pretty soon they'll be like animals". Yes, LLMs are complex and have gained more behaviors through size and increased training regimes but if you realize these structure aren't like brains, there's no argument here that they will soon reach to qualities of brains.

cs702 · 2024-12-27T18:39:42 1735324782

Actually, I'm comparing the "neuron-neuron connection count," while admitting that the comparison is not apples-to-apples.

This kind of comparison isn't a new idea. I think Hans Moravec[a] was the first to start making these kinds of machine-to-organic-brain comparisons, back in the 1990's, using "millions of instructions per second" (MIPS) and "megabytes of storage" as his units.

You can read Moravec's reasoning and predictions here:

https://www.jetpress.org/volume1/moravec.pdf

---

[a] https://en.wikipedia.org/wiki/Hans_Moravec

joe_the_user · 2024-12-27T23:36:48 1735342608

Your "not apples to apples" concession isn't adequate. You are essentially still saying that a machine running a neural network is compare to the brain of an animal or a person - just maybe different units of measurement. But they're not. It's a matter of dramatically different computing systems, systems that operate very differently (well, don't know exactly how animal brains work but we know enough to know they don't work like GPUs).

Your Moravec article is only looking at what's necessary for computers to have the processing power of animal brains. But you've been up and down this thread arguing that equivalent processing power could be sufficient for a computer to achieve the intelligence of an animal. Necessary vs sufficient is big distinction.

cs702 · 2024-12-29T05:12:38 1735449158

It might be sufficient. We do not know. We have no way of knowing.

Given their current scale, I don't think we can judge whether current AI systems "represent a dead end" -- or not.

legacynl · 2024-12-27T19:13:59 1735326839

I think he was approaching the concept from the direction of "how many mips and megabytes do we need to create human level intelligence".

That's a different take than "human level is this many mips and megabytes", i.e. his claims are about artificial intelligence, not about biological intelligence.

The machine learning seems to be modeled after the action potential part of neural communication. But biological neurons can communicate also in different ways, i.e. neuro transmitters. Afaik this isn't modeled in the current ml-models at all (neither do we have a good idea how/why that stuff works). So ultimately it's pretty likely that a ml with a billion parameters does not perform the same as an organic brain with a billion synapses

cs702 · 2024-12-27T20:50:25 1735332625

I never claimed the machines would achieve "human level," however you define it. What I actually wrote at the root of this thread is that we have no way of knowing in advance what the future capabilities of these AI systems might be as we scale them up.

torginus · 2024-12-27T19:15:45 1735326945

Afaict OP's not comparing neuron count, but neuron-to-neuron connections, aka synapses. And considering each synapse (weighted input) to a neuron performs computation, I'd say it's possible it captures a meaningful property of a neural network.

tshaddox · 2024-12-27T18:46:43 1735325203

Most simple comparisons are flawed. Even just comparing the transistor counts of CPUs with vastly different architectures would be quite flawed.

juped · 2024-12-27T18:26:32 1735323992

It was clearly a mistake because people start attempting to make totally incoherent comparisons to rat brains.

bgnn · 2024-12-27T20:48:12 1735332492

excellent analogy. piggybacking on this: a lot of believers (as they are like religious fanatics) claim that more data and hardware will eventually make LLMs intelligent, as if it's even the neuron count matters. There is no other animal close to humans in intelligence, and we don't know why. Somehow though a random hallucinating LLMs + shit loads of electricity would figure it out. This is close to pure alchemy.

runarberg · 2024-12-27T22:39:55 1735339195

I don’t disagree with your main point but I want to push back on the notion that “there is no other animal close to humans in intelligence”. This is only true in the sense that we humans define intelligence in human terms. Intelligence is a very fraught and problematic concept both in philosophy, but especially in the sciences (particularly psychology).

If we were dogs surely we would say that humans were quite skillful, impressively so even, in pattern matching, abstract thought, language, etc. but are hopelessly dumb at predicting past presence via smell, a crow would similarly judge us on our inability to orient our selves, and probably wouldn’t understand our language and thus completely miss our language abilities. We do the same when we judge the intelligence of non-human animals or systems.

So the reason for why no other animal is close to us in intelligence is very simple actually, it is because of the way we define intelligence.

bgnn · 2024-12-27T23:06:40 1735340800

Interesting point. Though I would say that you didn't disprove my point. Humans have a level of generalized intelligence that's not matched. We might be terrible at certain sensory tasks (smell), maybe all, compared to another animal. But the capability of thought, at the level of humans, is unmatched.

Just to clarify one point: I don't think intelligence is exclusive to humans. I only think that there's a big discrepency that cannot be explained with neuron counts oor the volume of the brain etc. which makes the argument of more hardware and more data will create AGI.

runarberg · 2024-12-27T23:20:48 1735341648

Like I said the term is very fraught both in philosophy and the sciences. Many volumes have been written about this in philosophy (IMO the only correct outlet for the discussion) and there is no consensus on what to do with it.

My main problem with the notion of generalized intelligence (in philosophy; I have tons of problems with it in psychology) is it turns out to be rather arbitrary what counts towards general intelligence. Abstract thought and project planning seems to an essential component, but we have no idea how abstract thought and project planning goes on in non-human systems. In nature we have to look at the results and infer what the goals were with the behavior. No doubt we are missing a ton of intelligent behavior among several animals—maybe even pants and fungi—just because we don’t fully understand the goals of the organism.

That said though, I think our understanding of the natural world is pretty unparalleled by other species, and using this knowledge we have produced some very impressive intelligent behavior which no other species is capable of. But I have a hard time believing that humans are uniquely capable of this understanding nor of applying this understanding. For examples, elephants have shown they are capable of inter-generational knowledge and culture. I don’t know if elephants had access to the same instruments as we, that they would be able to pass this knowledge down generations on build up on them.

bgnn · 2024-12-27T23:48:20 1735343300

I agree with you fully. Thank you for the interesting discussion.

imtringued · 2024-12-28T12:22:39 1735388559

In a fictional scenario each dog might have enough brain power to simulate the entire universe including eight billion human brains and humans would still consider themselves more intelligent.

zozbot234 · 2024-12-27T16:47:53 1735318073

A brown rat's brain is also a lot more energy efficient than your average LLM. Especially in the learning phase, but not only.

ben_w · 2024-12-27T18:15:05 1735323305

Are you sure?

The average brown rat may use only 60 kcal per day, but the maximum firing rate of biological neurons is about 100-1000 Hz rather than the A100 clock speed of about 1.5 GHz*, so the silicon gets through the same data set something like 1.5e6-1.5e7 times faster than a rat could.

Scaling up to account for the speed difference, the rat starts looking comparable to a 9e7 - 9e8 kcal/day, or 4.4 to 44 megawatts, computer.

* and the transistors within the A100 are themselves much faster, because clock speed is ~ how long it takes for all chained transistors to flip in the most complex single-clock-cycle operation

Also I'm not totally confident about my comparison because I don't know how wide the data path is, how many different simultaneous inputs a rat or a transformer learns from

legacynl · 2024-12-27T18:55:50 1735325750

That's a stupid analogy because you're comparing a brainprocess to a full animal.

Only a small part of that 60kcal is used for learning, and for that same 60 kcal you get an actual physical being that is able to procreate, eat, do things and fend for and maintain itself.

Also you cannot compare neuron firing rates with clockspeed. Afaik each neuron in a ml-model can have code that takes several clock cycles to complete.

Also an neuron in ml is just a weighted value, a biological neuron does much more than that. For example neurons communicate using neuro transmitters as well as using voltage potentials. The actual date rate of biological neurons is therfore much higher and complex.

Basically your analogy is false because your napkin-math basically forgets that the rat is an actual biological rat and not something as neatly defined as a computer chip

ben_w · 2024-12-27T19:09:07 1735326547

> Also an neuron in ml is just a weighted value, a biological neuron does much more than that. For example neurons communicate using neuro transmitters as well as using voltage potentials. The actual date rate of biological neurons is therfore much higher and complex.

The conclusion does not follow from the premise. The observed maximum rate of the inter-neuron communication is important, the mechanism is not.

> Also you cannot compare neuron firing rates with clockspeed. Afaik each neuron in a ml-model can have code that takes several clock cycles to complete.

Depends how you're doing it.

Jupyter notebook? Python in general? Sure.

A100s etc., not so much — those are specialist systems designed for this task:

"""1024 dense FP16/FP32 FMA operations per clock""" - https://images.nvidia.com/aem-dam/en-zz/Solutions/data-cente...

"FMA" meaning "fused multiply-add". It's the unit that matters for synapse-equivalents.

(Even that doesn't mean they're perfect fits: IMO a "perfect fit" would likely be using transistors as analog rather than digital elements, and then you get to run them at the native transistor speed of ~100 GHz or so and don't worry too much about how many bits you need to represent the now-analog weights and biases, but that's one of those things which is easy to say from a comfortable armchair and very hard to turn into silicon).

> Basically your analogy is false because your napkin-math basically forgets that the rat is an actual biological rat and not something as neatly defined as a computer chip

Any of those biological functions that don't correspond to intelligence, make the comparison more extreme in favour of the computer.

This is, after all, a question of their mere intelligence, not how well LLMs (or indeed any AI) do or don't function as von Neumann replicators, which is where things like "procreate, eat, do things and fend for and maintain itself" would actually matter.

Jensson · 2024-12-28T17:16:18 1735406178

> "FMA" meaning "fused multiply-add". It's the unit that matters for synapse-equivalents.

Neurons do so much more than a single math operation. A single cell can act as an intelligent little animal on its own, they are nothing like a neural network "neuron".

And note that all neurons act in parallel, so they are billions times more parallel than GPU's even if the operations would be the same.

ben_w · 2024-12-28T17:27:28 1735406848

A synapse is the reference for FMA operations, not a whole neuron.

> A single cell can act as an intelligent little animal on its own, they are nothing like a neural network "neuron".

Unless any of those things contribute to human intelligence, they do not matter in this context.

Cool, sure. Interesting, yes. But only important to exactly the degree to which any of that makes the human they're in smarter or dumber.

To the extent they're independently intelligent, they're the homunculi in Searle's Chinese Room.

> And note that all neurons act in parallel, so they are billions times more parallel than GPU's even if the operations would be the same.

Order of ten million times faster on a linear basis while still a thousand parallel operations.

imtringued · 2024-12-28T12:42:29 1735389749

You're so deep into this nonsense I don't think anything I could possibly say to you would change your mind, so I'll try something different.

Have you thought about stepping back from all of this for a few days and notice that you are wasting your time with these arguments? It doesn't matter how fast you can calculate a dot product or evaluate an activation function if the weights in question do not change.

NNs as of right now are the equivalent of a brain scan. You can simulate how that brain scan would answer a question, but the moment you close the Q and A session, you will have to start from scratch. Making higher resolution brain scans may help you get more precise answers to more questions, but it will never change the questions that it can answer after you have made the brain scan.

ben_w · 2024-12-28T13:45:41 1735393541

> Have you thought about stepping back from all of this for a few days and notice that you are wasting your time with these arguments?

Num fecisti?

> It doesn't matter how fast you can calculate a dot product or evaluate an activation function if the weights in question do not change.

That's a deliberate choice, not a fundamental requirement.

Models get frozen in order to become a product someone can put a version number on and ship, not because they must be, as demonstrated both by fine-tuning and by the initial training process — both of which update the weights.

> NNs as of right now are the equivalent of a brain scan.

First: see above.

Second: even if it were, so what? Look at the context I'm replying to, this is about energy efficiency — and applies just fine even when calculated for training the whole thing from scratch.

To put it another way: how long would it take a mouse to read 13 trillion tokens?

The energy cost of silicon vs. biology is lower than people realise, because people read the power consumption without considering that the speed of silicon is much higher: at the lowest level, the speed of silicon computation literally — not metaphorically, really literally — outpaces biological computation by the same magnitude to which jogging outpaces continental drift.

imtringued · 2024-12-28T12:32:03 1735389123

Your numbers are meaningless because neuromorphic computing hardware exists in the context of often forgotten spiking neural networks, which actually try to mimic how biological neurons operate through voltage integration and programmable synapses and they tend to be significantly more efficient.

SpiNNaker needs 100kWh to simulate one billion neurons. So the rat wins in terms of energy efficiency.

ben_w · 2024-12-28T13:59:05 1735394345

SpiNNaker is an academic experiment to see if taking more cues from biology would make the models better — it turned out the answer was "nobody in industry cares" because scaling the much simpler models to bigger neural nets and feeding them more data was good enough all by itself so far.

> and they tend to be significantly more efficient

Surely you noticed that this claim is false, just from your own next line saying it needing 100 kW (not "kWh" but I assume that's auto-corrupt) for a mere billion?

Even accounting for how neuron != synapse — one weight is closer to a single synapse; a brown rat has 200e6 neurons and about 450e9 synapses — the stated 100 kW for SpiNNaker is enough to easily drive simpler perceptron-type models of that scale, much faster than "real time".

cs702 · 2024-12-27T16:51:49 1735318309

Yes, I agree, but energy efficiency is orthogonal to capabilities.

sanderjd · 2024-12-27T17:11:10 1735319470

No it isn't, because it is relevant to the question of whether the current approaches can be scaled 100x or 1000x.

cs702 · 2024-12-27T17:15:20 1735319720

That's a hardware question, not a software question, but it is a fair question.

I don't know if the hardware can be scaled up. That's why I wrote "if we're able to scale them" at the root of this thread.

bee_rider · 2024-12-27T17:21:11 1735320071

It is probably a both question. If 100x is the goal, they’ll have to double up the efficiency 7 times, which seems basically plausible given how early-days it still is (I mean they have been training on GPUs this whole time, not ASICs… bitcoins are more developed and they are a dumb scam machine). Probably some of the doubling will be software, some will be hardware.

sanderjd · 2024-12-27T19:34:36 1735328076

Yep, agreed.

I'm pretty skeptical of the scaling hypothesis, but I also think there is a huge amount of efficiency improvement runway left to go.

I think it's more likely that the return to further scaling will become net negative at some point, and then the efficiency gains will no longer be focused on doing more with more but rather doing the same amount with less.

But it's definitely an unknown at this point, from my perspective. I may be very wrong about that.

sanderjd · 2024-12-27T19:29:56 1735327796

The question is essentially: Can the current approaches we've developed get to or beyond human level intelligence?

Whether those approaches can scale enough to achieve that is relevant to the question, whether the bottleneck is in hardware or software.

s1artibartfast · 2024-12-27T17:58:56 1735322336

That depends on if efficiency is part of the scaling process

DrBenCarson · 2024-12-27T18:06:33 1735322793

It’s not at all, energy is a hard constraint to capability.

Human intelligence improved dramatically after we improved our ability to extract nutrients from food via cooking

https://www.scientificamerican.com/article/food-for-thought-...

ben_w · 2024-12-27T21:40:03 1735335603

> It’s not at all, energy is a hard constraint to capability.

We can put a lot more power flux through an AI than a human body can live through; both because computers can run hot enough to cook us, and because they can be physically distributed in ways that we can't survive.

That doesn't mean there's no constraint, it's just that the extent to which there is a constraint, the constraint is way, way above what humans can consume directly.

Also, electricity is much cheaper than humans. To give a worked example, consider that the UN poverty threshold* is about US$2.15/day in 2022 money, or just under 9¢/hour. My first Google search result for "average cost of electricity in the usa" says "16.54 cents per kWh", which means the UN poverty threshold human lives on a price equivalent ~= just under 542 watts of average American electricity.

The actual power consumption of a human is 2000-2500 kcal/day ~= 96.85-121.1 watts ~= about a fifth of that. In certain narrow domains, AI already makes human labour uneconomic… though fortunately for the ongoing payment of bills, it's currently only that combination of good-and-cheap in narrow domains, not generally.

* I use this standard so nobody suggests outsourcing somewhere cheaper.

cruffle_duffle · 2024-12-27T17:46:59 1735321619

Honestly I think the opposite. All these giant tech companies can afford to burn money with ever bigger models and ever more compute and I think that is actually getting in their way.

I wager that some scrappy resource constrained startup or research institute will find a way to produce results that are similar to those generated by these ever massive LLM projects only at a fraction of the cost. And I think they’ll do that by pruning the shit out of the model. You don’t need to waste model space on ancient Roman history or the entire canon for the marvel cinematic universe on a model designed to refactor code. You need a model that is fluent in English and “code”.

I think the future will be tightly focused models that can run on inexpensive hardware. And unlike today where only the richest companies on the planet can afford training, anybody with enough inclination will be able to train them. (And you can go on a huge tangent why such a thing is absolutely crucial to a free society)

I dunno. My point is, there is little incentive for these huge companies to “think small”. They have virtually unlimited budgets and so all operate under the idea that more is better. That isn’t gonna be “the answer”… they are all gonna get instantly blindsided by some group who does more with significantly less. These small scrappy models and the institutes and companies behind them will eventually replace the old guard. It’s a tale as old as time.

clayhacks · 2024-12-27T18:12:05 1735323125

Deepseek just released their frontier model that they trained on 2k GPUs for <$6M. Way cheaper than a lot of the big labs. If the big labs can replicate some of their optimisations we might see some big gains. And I would hope more small labs could then even further shrink the footprint and costs

cruffle_duffle · 2024-12-27T18:37:22 1735324642

I don’t think this stuff will be truly revolutionary until I can train it at home or perhaps as a group (SETI at home anybody?)

Six million is a start but this tech won’t truly be democratized until it costs $1000.

Obviously I’m being a little cheeky but my real point is… the idea that this technology is in the control of massive technology companies is dystopian as fuck. Where is the RMS of the LLM space? Who is shouting from every rooftop how dangerous it is to grant so much power and control over information to a handful of massive tech companies, all whom have long histories of caving into various government demands. It’s scary as fuck.

lodovic · 2024-12-27T20:53:57 1735332837

This is just a tech race. we'll get affordable 64 gb gpus in a few years, businesses want to run their own models.

frankohn · 2024-12-28T09:31:49 1735378309

An airplane is far less energy-efficient than a bird to fly, to such an extent that it is almost pathetic. Nevertheless, the airplane is a highly useful technology, despite its dismal energy efficiency. On the other hand, it would be very difficult to scale a bird-like device to transport heavy weights or hundreds of people.

I think current LLMs may scale the same way and become very powerful, even if not as energy-efficient as an animal's brain.

In practice, we humans, when we have a technology that is good enough to be generally useful, tend to adopt it as it is. We scale it to fit our needs and perfect it while retaining the original architecture.

This is what happened with cars. Once we had the thermal engine, a battery capable of starting the engine, and tires, the whole industry called it "done" and simply kept this technology despite its shortcomings. The industry invested heavily to scale and mass-produce things that work and people want.

haolez · 2024-12-27T20:43:59 1735332239

And it learns online.

legacynl · 2024-12-27T18:12:39 1735323159

A little nitpick; a biological neuron is much more complex than it's ml-model equivalent. a simple weighted function cannot fully replicate a neuron.

That's why it's almost certain that a biological brain with a billion synapses outperforms a model with a billion parameters.

mort96 · 2024-12-27T18:15:27 1735323327

Isn't that what they meant by this?

> the comparison is not apples-to-apples, because each synapse is much more complex than a single parameter in a weight matrix.

daveguy · 2024-12-27T18:50:43 1735325443

It isn't just "not apples to apples". It's apples to supercomputers.

legacynl · 2024-12-27T19:17:44 1735327064

well yeah but it's un-obviously a very big difference that basically invalidates any conclusion that you can make with this comparison.

mort96 · 2024-12-27T19:51:39 1735329099

I don't think so: it seems reasonable to assume that biological neurons are strictly more powerful than "neural network" weights, so the fact that a human brain has 3 orders of magnitude more biological neurons than language models have weights tells that we should expect, as an extreme lower bound, 3 orders of magnitude difference.

joe_the_user · 2024-12-27T18:18:22 1735323502

It's not a "nitpick", it's a complete refutation. LLM don't have a strong relationship to brains, they're just math/computer constructs.

Purplehermann · 2024-12-29T06:58:55 1735455535

The brain is insanely energy efficient, this is not the same as intelligence efficient

croniev · 2024-12-28T12:44:56 1735389896

In comparing neural networks to brains it seems like you are implying a relation between the size/complexity of a thinking machine and the reasonability of its thinking. This gives us nothing, because it disregards the fundamental difference that a neural network is a purely mathematical thing, while a brain belongs to an embodied, conscious human being.

For your implication to be plausible, you either need to deny that consciousnes plays a role in reasonability of thinking (making you a physicalist reductionist) or you need to posit that a neural network can have consciousness (some sort of mystical functionalism).

As both of these alternatives imply some heavy metaphysical assumptions and are completely unbased, I'd advise to avoid thinking of neural networks as an analogue of brains with regards to thinking and reasonability. Don't expect they will make more sense with more size. It is and will continue to be mere statistics.

cs702 · 2024-12-28T13:22:32 1735392152

I'm not implying anything or delving into metaphysical matters.

All I'm saying above is that the number of neuron-neuron connections in current AI systems is still tiny, so as of right now, we have no way of knowing in advance what the future capabilities of these AI systems will be if we are able to scale them up by 10x, 100x, 1000x, and more.

Please don't attack a straw-man.

hn_throwaway_99 · 2024-12-27T16:56:43 1735318603

I think the comparison to brown rat brains is a huge mistake. It seems pretty apparent (at least from my personal usage of LLMs in different contexts) that modern AI is much smarter than a brown rat at some things (I don't think brown rats can pass the bar exam), but in other cases it becomes apparent that it isn't "intelligent" at all in the sense that it becomes clear that it's just regurgitating training data, albeit in a highly variable manner.

I think LLMs and modern AI are incredibly amazing and useful tools, but even with the top SOA models today it becomes clearer to me the more I use them that they are fundamentally lacking crucial components of what average people consider "intelligence". I'm using quotes deliberately because the debate about "what is intelligence" feels like it can go in circles endlessly - I'd just say that the core concept of what we consider understanding, especially as it applies to creating and exploring novel concepts that aren't just a mashup of previous training examples, appears to be sorely missing from LLMs.

cynicalpeace · 2024-12-27T17:07:00 1735319220

> modern AI is much smarter than a brown rat at some things (I don't think brown rats can pass the bar exam), but in other cases it becomes apparent that it isn't "intelligent" at all

There is no modern AI system that can go into your house and find a piece of cheese.

The whole notion that modern AI is somehow "intelligent", yet can't tell me where the dishwasher is in my house is hilarious. My 3 year old son can tell me where the dishwasher is. A well trained dog could do so.

It's the result of a nerdy definition of "intelligence" which excludes anything to do with common sense, street smarts, emotional intelligence, or creativity (last one might be debatable but I've found it extremely difficult to prompt AI to write amazingly unique and creative stories reliably)

The AI systems need bodies to actually learn these things.

HDThoreaun · 2024-12-27T17:23:00 1735320180

If you upload pictures of every room in your house to an LLM it can definitely tell you where the dishwasher is. If your argument is just that they cant walk around your house so they cant be intelligent I think thats pretty clearly wrong.

magpi3 · 2024-12-27T18:47:56 1735325276

Could it tell the difference between a dishwasher and a picture of a dishwasher on a wall? Or one painted onto a wall? Or a toy dishwasher?

There is an essential idea of what makes something a dishwasher that LLM's will never be able to grasp no matter how many models you throw at them. They would have to fundamentally understand that what they are "seeing" is an electronic appliance connected to the plumbing that washes dishes. The sound of a running dishwasher, the heat you feel when you open one, and the wet, clean dishes is also part of that understanding.

viraptor · 2024-12-27T21:04:27 1735333467

Yes, it can tell a difference, up to the point where the boundaries are getting fuzzy. But the same thing applies to us all.

Can you tell this is a dishwasher? https://www.amazon.com.au/Countertop-Dishwasher-Automatic-Ve...

Can you tell this is a drawing of a glass? https://www.deviantart.com/januarysnow13/art/Wine-Glass-Hype...

Can you tell this is a toy? https://www.amazon.com.au/Theo-Klein-Miele-Washing-Machine/d...

magpi3 · 2024-12-28T00:03:42 1735344222

If I am limited to looking at pictures, then I am at the same disadvantage as the LLM, sure. The point is that people can experience and understand objects from a multitude of perspectives, both with our senses and the mental models we utilize to understand the object. Can LLMs do the same?

viraptor · 2024-12-28T03:45:30 1735357530

That's not a disadvantage of LLM. You can start sending images from a camera moving around and you'll get many views as well. The capabilities here are the same as the eye-brain system - it can't move independently either.

cynicalpeace · 2024-12-29T01:36:42 1735436202

That's exactly the point- generally intelligent organism are not just "eye-brain systems"

viraptor · 2024-12-29T03:56:00 1735444560

You really need to define what you mean by generally intelligent in that case. Otherwise, if you require free movement for generally intelligent organisms, you may be making interesting claims about bedridden people.

cynicalpeace · 2025-01-02T01:08:44 1735780124

Bedridden people are not just eye-brain systems.

kimixa · 2024-12-27T17:31:20 1735320680

A trained image recognition model could probably recognize a dishwasher from an image.

But that won't be the same model that writes bad poetry or tries to autocomplete your next line of code. Or control the legs of a robot to move towards the dishwasher while holding a dirty plate. And each has a fair bit of manual tuning and preprocessing based on its function which may simply not be applicable to other areas even with scale. The best performing models aren't just taking in unstructured untyped data.

Even the most flexible models are only tackling a small slice of what "intelligence" is.

jdietrich · 2024-12-27T17:55:59 1735322159

ChatGPT, Gemini and Claude are all natively multimodal. They can recognise a dishwasher from an image, among many other things.

https://www.youtube.com/watch?v=KwNUJ69RbwY

cynicalpeace · 2024-12-27T18:00:12 1735322412

Can they take the pictures?

sdenton4 · 2024-12-27T18:52:20 1735325540

But do they have strong beaks?

https://sktchd.com/column/comics-disassembled-ten-things-of-...

ta988 · 2024-12-27T18:21:06 1735323666

Technically yes they can run functions. There were experiments of Claude used to run a robot around a house. So technically, we are not far at all and current models may even be able to do it.

cynicalpeace · 2024-12-27T18:33:46 1735324426

Please re-read my original comment.

"The AI systems need bodies to actually learn these things."

I never said this was impossible to achieve.

sippeangelo · 2024-12-27T21:22:03 1735334523

Can your brain see the dishwasher without your eyes?

cynicalpeace · 2024-12-27T17:39:50 1735321190

Do they know what a hot shower feels like?

They can describe it. But do they actually know? Have they experienced it?

This is my point. Nerds keep dismissing physicality and experience.

If your argument is a brain in a jar will be generally intelligent, I think that's pretty clearly wrong.

Dilettante_ · 2024-12-27T18:14:09 1735323249

So are you saying people who have CIPA are less intelligent for never having experienced a hot shower? By that same logic, does its ability to experience more colors increase the intelligence of a mantis shrimp?

Perhaps your own internal definition of intelligence simply deviates significantly from the common, "median" definition.

cynicalpeace · 2024-12-27T18:30:30 1735324230

It's the totality of experiences that make an individual. Most humans that I'm aware of have a greater totality of experiences that make them far smarter than any modern AI system.

skinner_ · 2024-12-27T19:07:57 1735326477

Greater totality of experiences than having read the whole internet? Obviously they are very different kind of experiences, but a greater totality? I'm not so sure.

Here is what we know: The Pile web scrape is 800GB. 20 years of human experience at 1kB/sec is 600GB. Maybe 1kB/sec is bad estimate. Maybe sensory input is more valuable than written text. You can convince me. But next challenge, some 10^15 seconds of currently existing youtube video, that's 2 million years of audiovisual experience, or 10^9GB at the same 1kB/sec.

whiskiss · 2024-12-28T14:10:24 1735395024

I feel the jump from "reading the internet" to experience has a gap in reasoning. I'm not experienced in philosophy or* logic enough(no matter how much I read, heh) to articulate it, but seems to get at the person's idea of lacking street smarts, common sense. An adult with basic common sense could probably filter out false information quicker since I can get Claude to tell me false info regularly(I still like em, pretty entertaining) which has not only factual but contradictory flaws any person wouldn't make. Like recently I had two pieces of data, then when comparing them it was blatently incorrectly(they were very close, but claude said one was 8x bigger for... idk why.)

Another commenter also mentioned sensory input when talking about the brown rat. As someone who is constantly fascinated at the brains ability to reason/process stuff before I'm even conscious of it, I feel this Stat is Underrated. I'm taking in and monitoring like 15 sensations of touch at all time. Something entering my visual field coming towards me can be deflected in half a second all while still understanding the rest of my surroundings, and where it might be safe to deflect an object. The brain is constantly calculating depth perception and stereo location on every image and sound we hear - also with the ability to screen out the junk or alter our perception accurately(knowing the correct color of items regardless of diff in color temp).

I do concede that's a heck of a lot of video data. It does have similar issues to what I said(lacks touch, often no real stereo location, good greenscreen might convince an AI of something a person intuitively knows is impossible) but the scale alone certainly adds a lot. That could potentially make up for what I see as a hugely overlooked thing as far as stimulus. I am monitoring and adjusting like, hundreds of parameters a second subconsciously. Like everything in my visual field. I don't think it can be quantified accurately how many things we consciously and subconsciously process, but I have the feeling it's a staggering amount.

cynicalpeace · 2024-12-29T01:38:37 1735436317

The people that have have barely used the internet are often far better conversation (and often more useful in the economy) than people who are addicted to the internet.

HDThoreaun · 2024-12-27T17:46:46 1735321606

See the responses section https://en.wikipedia.org/wiki/Knowledge_argument This idea certainly has been long considered but I personally reject it.

cynicalpeace · 2024-12-27T18:05:35 1735322735

While interesting, this is a separate thought experiment with its own quirks. Sort of a strawman, since my argument is formulated differently and simply argues that AIs need to be more than brains in jars for them to be considered generally intelligent.

And that the only reason we think AIs can just be brains in jars is because many of the people developing them consider themselves as simply brains in jars.

HDThoreaun · 2024-12-27T18:09:29 1735322969

Not really. The point of it is considering whether physical experience creates knowledge that is impossible to get otherwise. Thats the argument you are making no? If Mary learns nothing new when seeing red for the first time an AI would also learn nothing new when seeing red for the first time.

> Do they know what a hot shower feels like? They can describe it. But do they actually know? Have they experienced it

Is directly a knowledge argument

cynicalpeace · 2024-12-27T18:20:04 1735323604

Mary in that thought experiment is not an LLM that has learned via text. She's acquired "all the physical information there is to obtain about what goes on when we see ripe tomatoes". This does not actually describe modern LLMs. It actually better describes a robot that has transcribed the location, temperature, and velocity of water drops from a hot shower to its memory. Again, this thought experiment has its own quirks.

Also, it is an argument against physicalism, which I have no interest in debating. While it's tangentially related, my point is not for/against physicalism.

My argument is about modern AI and it's ability to learn. If we put touch sensors, eyes, nose, a mechanism to collect physical data (legs) and even sex organs on an AI system, then it is more generally intelligent than before. It will have learned in a better fashion what a hot shower feels like and will be smarter for it.

HDThoreaun · 2024-12-27T21:55:38 1735336538

> While it's tangentially related, my point is not for/against physicalism.

I really disagree. Your entire point is about physicalism. If physicalism is true than an AI does not necessarily learn in a better fashion what a hot shower feels like by being embodied. In a physicalist world it is conceivable to experience that synthetically.

cynicalpeace · 2024-12-29T01:41:45 1735436505

I love hearing someone else tell me I'm not saying what I'm saying.

tomrod · 2024-12-27T21:03:36 1735333416

The proof that 1+1=2 is nontrivial despite it being clear and obvious. It does not rely on physicality nor experience to prove.

There are areas of utility here. Things need not be able to do all actions to be useful.

momentoftop · 2024-12-27T21:30:52 1735335052

There isn't a serious proof that 1+1=2, because it's near enough axiomatic. In the last 150 years or so, we've been trying to find very general logical systems in which we can encode "1", "2" and "+" and for which 1+1=2 is a theorem, and the derivations are sometimes non-trivial, but they are ultimately mere sanity checks that the logical system can capture basic arithmetic.

tomrod · 2024-12-28T01:22:23 1735348943

If this is new, then you're one of today's luck 10,000![2] Serious logical foundations take a lot of time and exposition to start from fundamentals. Dismissing them as non-serious because GP's argument failed to consider them is misguided, IMHO.

[0] The classic reference: https://en.wikipedia.org/wiki/Principia_Mathematica -- over 1,000 pages, Betrand Russell

[1] https://cmartinez.web.wesleyan.edu/documents/FP.pdf -- a bit more modern, relying on other mathematics under the hood (like DRY reduces the base count), 11 pages

[2] https://xkcd.com/1053/

[3] Some reasonable review https://blog.plover.com/math/PM.html

momentoftop · 2024-12-29T00:33:14 1735432394

Yes, as I said: systems such as Russell's encoded "1", "2" and "+" in such a way that the theorem "1 + 1 = 2" is non-trivial to prove. This doesn't say anything about the difficulty of proving that 1 + 1 = 2, but merely the difficulty of proving it in a particular logical encoding. Poincare ridiculed the Principia on this point almost immediately.

And had Russell failed to prove that 1 + 1 = 2 in his system, it would not have cast one jot of doubt on the fact that 1 + 1 = 2. It would only have pointed to the inadequacy of the Principia.

whiskiss · 2024-12-28T14:23:55 1735395835

Am I the only one that always felt like that xkcd post came from a place of insane intellectual elitism?

I teach multiple things online and in person... language like that seems like a great to lose a student. I'd quit as a student, it's so condescending sounding. It's only lucky because you get to flex ur knowledge!(jk, pushing it I know lol but i can def see it being taken that way)

Keep in mind I know you're just having fun.

tomrod · 2024-12-28T15:29:45 1735399785

I can't be too condescending with the number of typos I have to edit :D

I actually really like the message for 1 in 10,000. As a social outsider for much of my life, it helped me to learn that the way people dismissed my questions about common (to them) topics was more about their empathy and less about me.

But, these sorts of things are difficult to communicate via text media, so we thus persist.

whiskiss · 2024-12-29T17:05:00 1735491900

Yeah I guess I've had only a few people be the other person that treated me right as the 1 - I feel ya on being an outsider having things dismissed. Does make sense. Another person gave me a good alternate view as well.

On a side note my couple of times I thought I was treating someone to some great knowledge they should already know I'm pretty sure I came across as condescending. Not bc they didn't know it - i always aim to be super polite - just being young, stupid, and bad at communicating, heh.

fragmede · 2024-12-29T01:50:01 1735437001

The key thing to focus on with XKCD 1053, is that the alternative before that comic was to make fun of the person who didn't know there's a proof for, eg 1 + 1 = 2. "Oh, you didn't know there's a proof for that? are you an idiot? who doesn't know the proof for 1 + 1 = 2 by Alfred North Whitehead and Bertrand Russell?", to which I think you could agree would put possible students off more by that than being told they're in luck today.

whiskiss · 2024-12-29T16:59:23 1735491563

Ah okay that's a good read. I'm just always on edge about my language and sometimes view the worst possible interpretation rather than what most would read. I'm not a negative person... just goes back to some "protecting myself" instincts I unfortunately had to develop. Thanks for that view.

cynicalpeace · 2024-12-29T01:39:47 1735436387

There's no way you get to 1+1=2 without experience. There would be no one to even make that statement.

tomrod · 2024-12-29T15:17:35 1735485455

See the work I posted in sibling comment in this chain.

cynicalpeace · 2025-01-02T01:13:16 1735780396

The subject has been debated ad nauseam by everyone like Descartes, Hume, Kant, and so on. If there were no one around to state 1 + 1 = 2, there would be no such statement. Hence, it does rely on at least 1 person's experience. Yours in fact, since everyone else could be an illusion.

theamk · 2024-12-27T18:58:35 1735325915

That really makes no sense.. would you say someone who is disabled bellow the neck is not intellegent / has no common sense, street smaets, creativity, etc...?

Or would you say that you cannot judge the intellegence of someone by reading their books / exchanging emails with them?

cynicalpeace · 2024-12-29T01:43:48 1735436628

You absolutely cannot judge the intelligence of someone by their text.

My dad is Deaf and doesn't write well, but he can build a beautiful house.

CooCooCaCha · 2024-12-27T17:11:38 1735319498

Where do you think common sense, emotional intelligence, creativity, etc. come from? The spirit? Some magic brain juice? No, it comes from neurons, synapses, signals, chemicals, etc.

cynicalpeace · 2024-12-27T17:16:04 1735319764

It comes from billions of years of evolution, the struggle to survive and maintain your body long enough to reproduce.

"Neurons, synapses, signals, chemicals" are downstream of that.

mulmen · 2024-12-27T19:50:43 1735329043

Without biological reproduction wouldn’t the evolutionary outcomes be different? Cyborgs are built in factories, not wombs.

mensetmanusman · 2024-12-27T17:20:04 1735320004

Why would dust care about survival?

bee_rider · 2024-12-27T17:32:56 1735320776

It doesn’t. Actually, quite a few of the early stages of evolution wouldn’t have any analogue to “care,” right? It just happened in this one environment, the most successful self-reproducing processes happened to be get more complex over time and eventually hit the point where they could do, and then even later define, things like “care.”

mensetmanusman · 2024-12-28T04:46:56 1735361216

Could be

cynicalpeace · 2024-12-27T17:22:42 1735320162

¯\_(ツ)_/¯ Consult a bible

FrustratedMonky · 2024-12-27T18:53:14 1735325594

a 'dust to dust' joke?

Or just saying, when facing the apocalypse, read a bible?

mensetmanusman · 2024-12-27T17:19:37 1735319977

There are robots that can do this now, they just cost $100k.

cynicalpeace · 2024-12-27T17:30:44 1735320644

Find a piece of cheese pretty much anywhere in my home?

Or if we're comparing to a three year old, also find the dishwasher?

Closest I'm aware of is something by Boston Dynamics or Tesla, but neither would be as simple as asking it- wheres the dishwasher in my home?

And then if we compare it to a ten year old, find the woodstove in my home, tell me the temperature, and adjust the air intake appropriately.

And so on.

I'm not saying it's impossible. I'm saying there's no AI system that has this physical intelligence yet, because the robot technology isn't well developed/integrated yet.

For AI to be something more than a nerd it needs a body and I'm aware there are people working on it. Ironically, not the people claiming to be in search of AGI.

uoaei · 2024-12-27T17:30:06 1735320606

That's just the hardware, but AI as currently practiced is purely a software endeavor.

cynicalpeace · 2024-12-27T17:31:33 1735320693

Correct, and the next frontier is combining the software with the hardware.

cs702 · 2024-12-27T17:00:58 1735318858

Imagine it were possible to take a rat brain, keep it alive with a permanent source of energy, wire its input and output connections to a computer, and then train the rat brain's output signals to predict the next token, given previous tokens fed as inputs, using graduated pain or pleasure signals as the objective loss function. All the neuron-neuron connections in that rain brain would eventually serve one, and only one, goal: predicting an accurate probability distribution over the next possible token, given previous tokens. The number of neuron-neuron connections in this "rat-brain-powered LLM" would be comparable to that of today's state-of-the-art LLMs.

This is less far-fetched than it sounds. Search for "organic deep neural networks" online.

Networks of rat neurons have in fact been trained to fly planes, in simulators, among other things.

ImHereToVote · 2024-12-27T17:42:35 1735321355

Human brain organelles are in use right now by a Swiss company.

cs702 · 2024-12-27T17:49:47 1735321787

Thanks. Yeah, I've heard there are a bunch of efforts like that, but as far as I know, all are very early stage.

I do wonder if the most energy-efficient way to scale up AI models is by implementing them in organic substrates.

bee_rider · 2024-12-27T16:52:55 1735318375

Rats are pretty clever, and they (presumably, at least) have a lot of neurons spending their time computing things like… where to find food, how frightened of this giant reality warping creature in a lab coat should I be, that sort of thing. I don’t think it is obvious that one brown-rat-power isn’t useful.

I mean we have dogs. We really like them. For ages, they did lots of useful work for us. They aren’t that much smarter than rats, right? They are better aligned and have a more useful shape. But it isn’t obvious (to me at least) that the rats’ problem is insufficient brainpower.

bloopernova · 2024-12-27T18:06:42 1735322802

Dogs, if I recall correctly, have evolved alongside us and have specific adaptations to better bond with us. They have eyebrow muscles that wolves don't, and I think dogs have brain adaptations too.