Disclaimer: I do ML-based work in Google Cloud, but I am not on the AutoML team....

petra · on Jan 17, 2018

What kind of datasets ? what size ? Is it something practical for a small business to gather ?

eggie5 · on Jan 17, 2018

Figure 4 of the The Decaf paper shows meaningful learning w/ only 10 examples! https://arxiv.org/abs/1310.1531

ska · on Jan 17, 2018

Transfer learning is hardly a panacea, however much some would like it to be.

TheIronYuppie · on Jan 17, 2018

Disclosure: I work at Google on Kubeflow

Can you say more? I don't think anyone is saying it's magic pixie dust, but it does dramatically reduce the amount of data you need.

_delirium · on Jan 17, 2018

I'd probably phrase it as "can" dramatically reduce the amount of data you need rather than "does". Getting transfer learning to work in any kind of reliable way is still very much open research, and the systems I've seen are heavily dependent on basically every variable involved: the specific data sets, domains, model architectures, etc., with sometimes pretty puzzling failures.

I don't doubt Google has managed to make something useful work, though I'm more skeptical of how general the ML tech is. One advantage of an API like this is that it allows control over many of those variables. I'm not sure if this is what it does, but you could even start out by making a transfer-learning system that's heavily tailored to transfering from one specific fixed model, which coupled with some Google-level engineering/testing resources, could produce much more reliable performance than in the general case.

TheIronYuppie · on Jan 17, 2018

Disclosure: I work at Google on Kubeflow

As you can see here[1], we do provide quite a bit of information about the accuracy and training of the underlying model.

Additionally, the AutoML already (often) provides better than human level performance[2]. Your comment about transferring a heavily tailored model from one model to another is basically what it's doing - it's taking something domain specific (vision) and allowing you to transfer it to your domain.

[1] https://youtu.be/GbLQE2C181U?t=1m15s

[2] https://static.googleusercontent.com/media/research.google.c...

ska · on Jan 17, 2018

I was about to type a very similar comment, but this is much of what I had in mind.

I've also seen it used to justify insufficient validation - resulting in strange generalization failures.

eanzenberg · on Jan 17, 2018

It depends on the domain. It works for images because images are the same in time. It doesn’t work as well for text because there’s tons of nuance to speech patterns between groups (yelp vs google reviews)

colochef · on Jan 17, 2018

you might want to test www.monkeylearn.com for text

zeroxfe · on Jan 17, 2018

Um... I don't think anyone here is saying (or even implying) that.

ska · on Jan 17, 2018

Maybe I was misreading, but I read it as something like : transfer learning is going to be a general solution for not having enough data. That's really not the case.

rasmi · on Jan 17, 2018

That's a fair point. There are certainly technical challenges involved in bringing this to other domains.