Hacker News new | past | comments | ask | show | jobs | submit | google2342's comments login

You shouldn't use the mean when doing benchmarking. Better to use the median or fastest time. Lots of random things can happen on computers (usually in the OS) that can result in some operation taking 1000x longer.


Interesting point. The %timeit functionality actually used to output the mean of the x fastest runs, they seem to have changed that at some point. The docs still explain the old behavior [1].

I assume that they feel your concerns are addressed because they display the stddev together with the mean, so you can see if there were any extreme outliers.

[1] https://ipython.org/ipython-doc/dev/interactive/magics.html#...


If you're testing a single-threaded benchmark, then the test statistics aren't going to be meaningfully different in interpretation, especially if you're only asking the question "is A or B faster?" What's more important is that you capture enough runs to characterize the distribution well; if you have that, you'll get meaningful results no matter which statistic you're actually measuring.


How do you expect someone to justify a statement that's not true? I suppose you could say that it's a fair characterization of a basic decision tree but that doesn't describe modern ML methods.


That's not a fair characterization. These contractors are mostly used for quality assurance. Once you've trained your model (which is not just if statements), you might you send a few thousand examples to raters to judge the true quality of your model.


Don't do that to yourself. Better to take care of yourself than buying a new phone.


New grads can start at 200K+ in the right companies.


I suspect the winners will already be well established in the field.


Right, and 25k is their bi weekly paycheck.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: