Hacker News new | past | comments | ask | show | jobs | submit login

Even if you have the full population in question and thereby avoid sampling issues, you still have a lot of pitfalls. For example if you just start correlating every variable against every other one and picking out ones that hit some test of statistical significances as "findings", you run into a range of familiar problems generally grouped under the pejorative term "data dredging".



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: