You are mixing up knowledge and reasoning skills. And I've definitely met high s...

friendzis · 2024-09-26T09:23:21 1727342601

I choose to disagree, mostly semantically.

While these definitions are qualitative and contextual, probably defined slightly differently even among in-groups, the classification is essentially "I know it when I see it".

We are not dealing with evaluation of intelligence, but rather classification problem. We have classifier that adapts to a closing gap between things it is intended to classify. Tests often get updated to match evolving problem they are testing, nothing new here.

alasdair_ · 2024-09-26T18:53:38 1727376818

>the classification is essentially "I know it when I see it".

I already see it when it comes to the latest version of chatGPT. It seems intelligent to me. Does this mean it is? It also seems conscious ("I am a large language model"). Does that mean it is?

friendzis · 2024-09-27T06:12:14 1727417534

The question is not whether you consider a thing intelligent, but rather whether you can tell meatbag intelligence and electrified sand intelligence apart.

You seem to get Turing test backwards. Turing test does not classify entities into intelligent and non-intelligent, but rather takes preexisting ontological classification of natural and artificial intelligence and tries to correctly label each.

sigmoid10 · 2024-09-26T13:46:01 1727358361

This is not a question of semantics. If anything, it's a question of a human superiority complex. That's what Turing was hinting at.

hadlock · 2024-09-26T17:58:36 1727373516

Can you list some sources or quotes? I'm not familiar with the parts you're referencing, it seems like you're putting a lot of words in his mouth.

hnlmorg · 2024-09-26T09:03:22 1727341402

I think you’re overthinking things here.

Tests need to grow with the problem they’re trying to test.

This is as true for software engineering as it is for any other domain.

It doesn’t mean the goal posts are moving. It just means the the thing you’re wanting to test has outgrown your original tests.

This is why you don’t ask PhD students to sit the 11+.