FWIW, you're not telling it precisely what to do, you're giving it an input that...

astrange · 2024-02-21T20:26:34 1708547194

This isn't true for an instruction-tuned model. They are designed so you actually do tell it what to do.

AnarchismIsCool · 2024-02-21T22:04:23 1708553063

Sure, but it's still a statistical model, it doesn't know what the instructions mean, it just does what those instructions statistically link to in the training data. It's not doing perfect forward logic and never will in this paradigm.

astrange · 2024-02-21T22:31:20 1708554680

The fine tuning process isn't itself a statistical model, so that principle doesn't work on it. You beat the model into shape until it does what you want (DPO and varieties of that) and you can test that it's doing that.

AnarchismIsCool · 2024-02-21T23:14:44 1708557284

Yeah but you're still beating up a statistical model that's gonna do statistical things.

Also we're talking about prompt engineering more than fine-tune