I have been reading arguments about "things like alignment faking" for years, wh...

comp_throw7 · 2024-12-20T07:41:21 1734680481

> But that's itself a large part of why I believe those arguments are false. If I gave them credit and they turned out to be false, then I figure I have succumbed to a form of Pascal's Mugging. If I don't give them credit and it turns out that a hostile, agentive AGI has been pretending to be aligned, I don't expect anyone (including myself) to survive long enough to rub it in my face.

I'm sorry, but this is a crazy reason to believe something is false. Things are either true or they aren't, and if the world would be nicer to live in if thing X was false does not actually bear on whether thing X is false or not.

zahlman · 2024-12-20T14:35:10 1734705310

It's not "the world would be nicer to live in if thing X was false".

It's "the world would cease to exist if thing X were true".