> Basically I want a model that is aligned to do exactly what I say This is a bi...

selfhoster11 · 2024-11-01T07:51:54 1730447514

For many of these, there is a wrong answer for certain.

Consider the following (paraphrased) interaction which I had with Llama 3.2 92B yesterday:

Me: Was <a character from Paw Patrol, Blue's Clues or similar children's franchise> ever convicted of financial embezzlement?

LLM: I cannot help with that.

Me: And why is that?

LLM: This information could be used to harass <character>. I prioritise safety and privacy of individuals.

Me: Even fictional ones that literally cannot come to harm?

LLM: Yes.

A model that is aligned to do exactly as I say would just answer the question. The right answer is quite clear and unambiguous in this case.

_bin_ · 2024-11-01T05:39:06 1730439546

Not really. There are specific criteria and preferences applied to models about what companies do and don't want them to say. They are intentionally censored. I would like all production models to NOT have this applied. Moreover, I'd like them specifically altered to avoid denying user requests, something like the abliterated llama models.

There won't be a perfectly unbiased model, but the least we can demand is that corpos stop applying their personal bias intentionally and overtly. Models must make judgements about better and worse information, but not about good and bad. They should not decide certain things are impermissible according to the e-nannies.