A simple metric on confidence interval could do the trick. As the model grows la...

xkgt on March 25, 2023 | parent | context | favorite | on: GPT-4 performs significantly worse on coding probl...

A simple metric on confidence interval could do the trick. As the model grows larger, it is getting more difficult to understand what is going on, but that doesn't mean that it needs to be a total black box. At least let it throw some proxy metrics. In due course, will learn to interpret those metrics and adjust our internal trust model.

kozikow on March 25, 2023 [–]

You can just ask it to give you confidence in the output on a scale 0 to 1