Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's pretty well known that the AI companies are heavy users of Amazon mturk for their RLHF post-training.


He meant the opposite, you could be paid to be a worker on MTurk but actually just be feeding it LLM output.


In this case it's Model Distillation. Training a new model with the outputs of a another.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: