Reading the article, it looks like they did test on Mac hardware, just doing the initial distributed test on cheaper hardware first and then confirming each one individually on more expensive mac hardware. Or at least that's how I read OP.
It'd be way more expensive, they might as well just fuzz with WebKitGTK+ since it produces the results they desire; then validate the crashes on Safari.