Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Do we observe misaligned behavior of LLMs?

Grok? :P

That said: We don't know how many other things besides being trained to write malicious code also lead to general misalignment.

Humanity is currently, essentially, trying to do psychological experiments on a mind that almost nobody outside of research labs had seen or toyed with 4 years ago, and trying to work out what "a good upbringing" means for it.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: