And it can, in fact, be encoded as a linear token stream (you just need special ...

int_19h 30 days ago | parent | context | favorite | on: Alignment faking in large language models

And it can, in fact, be encoded as a linear token stream (you just need special tokens to indicate edits to previously output tokens).