"You need to take care so the synchronization isn't eating your performance wins...

ghusbands · on May 31, 2019

You probably already know of it, but you might find Halide interesting. It aims to separate computation strategy and dataflow, in a performant fashion.

Also, potentially pyCUDA and other things that google autocompletes when you type "pyCUDA vs".

None of them are exactly what you're talking about, of course.

sorenjan · on May 31, 2019

This is one of those things I have in my list of things to try out one of these days. If I'm not mistaken, this is what Google use to implement their image algorithms in their Google Camera Android app. Marc Levoy is one of the co-authors of the Halide paper [0] and is now working at Google, so it's natural. Do you happen to know how widely used it is in industry?

[0] http://graphics.stanford.edu/papers/halide-cacm18/halide-cac...

https://halide-lang.org/

ghusbands · on June 1, 2019

If you mean me (and I guess others aren't likely to come across this conversation):

I was very impressed by Halide but I'm not involved and have no idea how widely it is used, sorry. I could imagine that a lot of potential users enjoy tweaking GPU and SIMD code enough, or have enough confidence in their results, that they don't give it a good look.