Hacker News new | past | comments | ask | show | jobs | submit login

It's Google. Assume the training set contains, as a subset, the entirety of all public digitized information. How would you like to them to share it?





If they wanna do research where they claim they did something novel, without showing that they didn't just "look it up" in their massive training set, then yes, they should share what is and what isn't contained within.

As a tar, please.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: