The biggest mistake engineers make is determining sample sizes. It is not trivia...

travisjungroth · on June 16, 2023

People often don’t determine sample sizes at all! And doing power calculations without an idea of effect size isn’t just hard but impossible. It’s one of the inputs to the formula. But at least it’s fast so you can sort of guess and check.

Anytime valid inference helps with this situation, but it doesn’t solve it. If you’re trying to detect a small effect, it would be nicer to figure out you need a million samples up front versus learning that because your test with 1,000 samples a day took three years.

Still, anytime is way better than fixed IMO. Fixed almost never really exists. Every A/B testing platform I’ve seen allows peeking.

I work with the author of the second paper you listed. The math looks advanced, but it’s very easy to implement.

hackernewds · on June 16, 2023

The biggest mistake is engineers owning experimentation. They should be owned by data scientists.

Realize though that is a luxury, but I also see this trend in blue chip companies

pbae · on June 16, 2023

Did a data scientist write this? You don't need to be a member of a priesthood to run experiments. You just need to know what you're doing.

bonniemuffin · on June 16, 2023

I agree with both sides here. :) DS should own experimentation, AND engineers should be able to run a majority of experiments independently.

As a data scientist at a "blue chip company", my team owns experimentation, but that doesn't mean we run all the experiments. Our role is to create guidelines, processes, and tooling so that engineers can run their own experiments independently most of the time. Part of that is also helping engineers recognize when they're dealing with a difficult/complex/unusual case where they should bring DS in for more bespoke hands-on support. We probably only look at <10% of experiments (either in the setup or results phase or both), because engineers/PMs are able to set up, run, and draw conclusions from most of the experiments without needing us.

playingalong · on June 16, 2023

... and by some definition you'd be a data scientist yourself. (Regardless of your job title)