You can ask the model sth like: is xyz correct, answer with one word, either Yes... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

mmoskal 9 months ago | parent | context | favorite | on: Lessons after a Half-billion GPT Tokens

You can ask the model sth like: is xyz correct, answer with one word, either Yes or No. The log probs of the two tokens should represent how certain it is. However, apparently RLHF tuned models are worse at this than base models.

nurple 9 months ago [–]

Seems like functions could work well to give it an active and distinct choice, but I'm still unsure if the function/parameters are going to be the logical, correct answer...

Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact