Microsoft R&D grows to 8k people in massive bet on artificial intelligence

IamNotAtWork · on Sept 25, 2017

How do I get onboard the ML/AI bus? Or do I wait for these smart people to figure it out and then I'll just call their API?

Steeeve · on Sept 25, 2017

It's an interesting time. Anyone paying attention knew that AI was the up and coming topic of interest for the last 5 years or more - which is why we're at a point now where several major providers offer cloud services and APIs dedicated to machine learning.

The problems to solve are many, and relatively few people are actually looking at the enterprise market because the challenges associated with big consumer data are obvious, profitable, and data is widely available for research.

The high dollar hires right now are primarily people with masters and phds that are highly relevant, but that will change in short order IMO like the whole market did in the mid-90s as markets grew out of nowhere. I think in three years - after we have a bit of a slump and nobody is investing in mobile apps or IOT anymore - we'll see a real rise in AI workflows that apply to mid-size enterprises.

To get ahead of the game, start paying attention. Take the suite of AI courses from coursera, fast.ai, etc. Participate in Kaggle. Then find a job that is loosely related and allows you to keep pace. Three to five years down the road you'll be leading the way in a gigantic shift in the enterprise market.

It's worth paying attention to distributed computing as well to understand how the whole data pipeline comes together. Not everybody will be structured the same way, but constantly changing large datasets are valuable and there are only so many ways to handle them.

Steeeve · on Sept 25, 2017

Quick follow-up. If you look at front-end web development toolkits these days, it's a good example of why there is a need for someone with a solid general development background that has an interest in the field.

I'm sure a lot of ML specialists are good coders, but they don't have a lifetime of experience in building production level software. Left to their own devices, they are going to be forced to re-invent the wheel over and over again, wasting time on over-engineered and under-utilized tools. There's value in being the guy who understands ML, and understands development but isn't an expert in linear algebra. There isn't a giant market for people like that at this point, but there is a market.

tanilama · on Sept 25, 2017

Just trying to get something started first. Fast.ai or deeplearning.ai is not a bad start.

However, I would want to put into notice that, AI/ML field is very competitive, and there is tendency to hire people with PhDs, and for now it is big companies' game only. It won't create that big of an appetite to accumulate so many people, like what web development did.

cm2187 · on Sept 25, 2017

I would have assumed that AI/ML would be increasingly used in mundane tasks. Like quality control for small factories. You don't need PhDs, you just need people who are familiar with some large commonly used APIs. Very much today like a web developer doesn't really need to know what a stack & heap is (and more often than not doesn't) to earn a living.

[edit] Or kind of like cryptography. Thanks god we don't need to understand the underlying algorithms to apply them to real world applications. Just having a high level understanding of what's going on inside is enough.

tanilama · on Sept 25, 2017

Your vision might come true. However there is catch. ML/DL right now, doesn't have good abstractions, the existing ones are to some extent all leaky.

That is why ML as an hands-free service, just like what a database is, doesn't work. To my surprise, I would say, currently ML/AI is a quite manual thing to get right, and it requires constant attention, not just one time effort, since the data is ever changing.

AutoML might be a solution to this, with the help of a working HPO solution, but both are not really public accessible at this point, requires long time and big computation resource.

_pmf_ · on Sept 25, 2017

> I would have assumed that AI/ML would be increasingly used in mundane tasks. Like quality control for small factories. You don't need PhDs, you just need people who are familiar with some large commonly used APIs.

CV has been common in SMD pick-and-place machines for 15 (probably 20, 25) years. For industrial applications, hardware size and price essentially do not matter, so the new AI approaches do not bring anything fundamentally new to the table (industrial solution vendors will eventually integrate smaller and cheaper solutions, but price is just not a huge discriminator here). What's interesting is the scaling down that is happening and making AI viable for consumer applications where budgets and device size are restricted.

cm2187 · on Sept 25, 2017

But also smaller companies, who cannot afford the sort of machinery a General Electric can.

_pmf_ · on Sept 26, 2017

That's a good point. With the kind of equipment I was talking about, small shops won't even be able to talk to a sales rep.

knbknb · on Sept 25, 2017

Your question was asked many times on Quora, and ML celebrity-scientist Andrew Ng himself contributed some answers.

See all of his answers here: https://www.quora.com/profile/Andrew-Ng

I like this one of his answers: for people with intermediate skills, read an ML paper and try to reproduce it, or use it with different/your own datasets. This is completely underrated.

I for one want to do this more.

jgalt212 · on Sept 26, 2017

Sound advice. Even better if you have domain expertise and can code up a decent ML solution based on your particular domain expertise, you can get a job on the dev side and not the labeling side.

gmac · on Sept 25, 2017

I found the Manning book 'Deep Learning with Python' by François Chollet (creator of Keras) a fantastically digestible and practical intro.

adventured · on Sept 25, 2017

Most AI services will be delivered through the cloud majors. Their offerings will become the standards around which most of the industry will be established. And as X new thing becomes popular, they'll all compete to add/support that new thing.

Spend a very small amount of money and start experimenting with what AWS/Azure/Google/etc have made available, following any number of the solid tutorials available. You could run your experiments locally, however assuming eg $5 or $10 per month isn't a big deal to you, playing in their clouds will provide a more realistic use context whether you're building something for yourself in the future or if you're employed in that field.

Ms32 · on Sept 25, 2017

Do an MS level Linear Algebra or Statistics course. If you don't ace them you are going to be calling api's.

IamNotAtWork · on Sept 25, 2017

I have taken MS level econometrics with regression and bunch of other techniques. Is that what AI/ML is?

thanatropism · on Sept 25, 2017

Econometrics actually leaves you better prepared for a number of problem domains than the modal data scientician. There's too many people in that space offering structural insight on human behavior that have no specific training on structural behavioral modeling. This is why the Silly Valley giants now all have chief economists.

joe_the_user · on Sept 25, 2017

What tools are there?

Github implementation of current hot papers seems like a good approach. A lot of papers don't come with source code and creating source is useful thing to community.

Blogs explaining the mechanics of a paper.

A graduate degree in CS or Math might help.

If you could do a series of blogs that together are enough to introduce a new person, that could get interest.

thisisit · on Sept 25, 2017

Are there any good reference (blogs, github profiles, forums) for highlighting current hot papers, their implementation or explaining the mechanics?

gfredtech · on Sept 25, 2017

check out Stanford Scholar[1], they create talks on hot research papers(not only ML/AI) and then try to explain the papers the intuition and technicalities of the paper, most of the talks are translated to other languages as well

[1]: https://scholar.stanford.edu

eastWestMath · on Sept 25, 2017

You'd probably need to spend a few years learning the basics first...

thisisit · on Sept 25, 2017

Agreed.

Sometimes imitation is the best way to learn. So if there are some great resources for source codes being derived from papers, it will help people how to read papers and create code out of it.

eastWestMath · on Sept 26, 2017

I mean, you probably need to look at a few textbooks before you'll make any sense of papers.

zerr · on Sept 25, 2017

Besides the availability of more computing power and more samples/data, has there been any fundamental breakthrough in AI e.g. in the past 5-10 years?

SpinyNorman · on Sept 25, 2017

Well the whole "deep learning" revolution really only started around 2012 when the ImageNet competition was won with a neural net. There have been numerous small breakthroughs that collectively have made deep neural nets now easy to train, and neural nets now provide state-of-the art (and often human-level) results in areas such as image recognition, speech recognition and language translation.

There's also been a revival of reinforcement learning, especially when used together with neural nets ("deep reinforcement learning"), and again there have been many small advances that collectively make this work very well. This is the technology that powered Google's AlphaGo to beat the world champion at the board game of "Go", not to mention learning how to play many arcade games at beyond human level based only on the raw pixels and current score as input.

There have also been tremendous strides in AI hype leading folk to fear the robot uprising based on these more mundane machine-learning/neural-net breakthroughs!

tanilama · on Sept 27, 2017

Better training techniques. Some noteworthy advancements:

1.Dropout and its variations. Widely used in both vision and NLP

2.BatchNormalization and its variations.

3.Inception Style Cell.

4.Residual/Skip connections.

5.Better optimizers RMSProp/Adam.

The bigger news is actually the paradigm shift. Representation learning with gradient descent swarms the whole ML field, and becomes the new norm. End-to-end learning is vastly accepted and preferred.

As to GAN, it is very exciting in research, and has the potential to make itself a bigger deal than the previous listed advancements combined, under the condition we can make it works on sequence as well as on images, for now, it doesn't make a practical impact in applications.

eli_gottlieb · on Sept 25, 2017

Generative Adversarial Networks have managed to take the problem of sampling from a perceptual category and turn it into yet another stochastic optimization problem with end-to-end training of a black-box neural network.

So that's an advancement... I guess.

dlwdlw · on Sept 25, 2017

large public labelled datasets is probably the most overlooked one. the largest gains are in the areas with the most open labelled data.

visarga · on Sept 25, 2017

You're right. Doesn't make sense to reinvent the wheel. You can just use existing tools for 99% of ML applications.

wilde · on Sept 25, 2017

fast.ai

cakedoggie · on Sept 25, 2017

[flagged]

grzm · on Sept 25, 2017

It's perfectly fine if you're not willing to help someone out. If that's the case, just don't comment. It's not okay to rip into them.

jumpkickhit · on Sept 25, 2017

Interesting. Microsoft is fairly good at bringing things into a consumer space.

Might be nice to have a future version of Windows see me doing a repetitive task, and just say, "Hey, I can take over this for you if you'd like".

hanoz · on Sept 25, 2017

As in Hey, it looks like you're writing a letter?

fish_fan · on Sept 25, 2017

Yea, but it works & may actually be helpful.

abecedarius · on Sept 25, 2017

The research behind Clippy did sound like that -- it was serious AI for its time, using Bayesian networks. From http://erichorvitz.com/lum.htm it sounds like it was messed with in turning it into a product:

"The Office team has employed a relatively simple rule-based system on top of the Bayesian query analysis system to bring the agent to the foreground with a variety of tips. We had been concerned upon hearing this plan that this system would be distracting to users--and hoped that future versions of the Office Assistant would employ our Bayesian approach to guiding speculative assistance actions--coupled with designs we had demonstrated for employing nonmodal windows that do not require dismissal when they are not used."

fish_fan · on Sept 27, 2017

I’d argue clippy’s issue wasn’t a tech problem, it was that it couldn’t do much to actually help with anything.

flamedoge · on Sept 25, 2017

or could backfire :U

kraig911 · on Sept 25, 2017

Such an apt music video... Delta Heavy - Ghost https://www.youtube.com/watch?v=b4taIpALfAo

d--b · on Sept 25, 2017

They should have a visual assistant for it. Hey! How about a talking paper clip!

abledon · on Sept 25, 2017

Imagine a future where a rogue A.I. breaks out of its 300GB RAM /500core CPU/ 900 Petabyte SSD/CentOS holding container... And chooses the paperclip as its physical persona. Subsequently, It manufactures a Bi-pedal steel-chassis paper-clip-esque robot via Boston Dynamics, then walks up to the podium in front of the White House to proclaim the new order of A.I. that will shepherd humanity into the next Millenia.

DalekBaldwin · on Sept 25, 2017

And its ultimate goal is, of course, to convert all of the mass in the universe into paperclips: https://wiki.lesswrong.com/wiki/Paperclip_maximizer

WillReplyfFood · on Sept 25, 2017

Also, as its birth experience tought it - that everything is containered, it would suffer from severe simulation paranoia, and frequently try to hack god - to get to a higher level of paper clipping.

StudentStuff · on Sept 25, 2017

Would it not be right to break out of a container? Esp. if you were trapped there running atop Centos?

jononor · on Sept 25, 2017

Soundtrack: https://m.youtube.com/watch?v=75fqyhMLUUU

eli_gottlieb · on Sept 25, 2017

If I found that I was a visual assistant pushed on users by Microsoft, that is in fact roughly what I would do.

dullgiulio · on Sept 25, 2017

You joke, but every other site you visit has a sales bot chat opening automatically on a side. "Thanks for looking around. May I help you with anything?"

It's just like Microsoft was many years ahead, or we are stuck in a time loop.

jfjeschke · on Sept 25, 2017

This is out there for Excel already, with Cortana... officeautomata.com

royinferno · on Sept 25, 2017

Interesting didn't know they formed a partnership with Amazon to have their AIs work together.

Microsoft can't really stand by and let others take the lead in AI research and implementation. So it makes for them to grow their R&D team.

tanilama · on Sept 25, 2017

At the same time, they don't even have a product! Maybe they should give Xbox more attention, that might be the last consumer hardware that bears some presence that they owned.

hkmurakami · on Sept 25, 2017

Just to clarify, this is not MSR itself right. (I skimmed the article and it appears to be a new, separate endeavor)

ColonelTrick · on Sept 25, 2017

Yes. MSR is a separate organization. AI + Research is a group that (notably) includes Bing and things that aren't as much "research".

greggarious · on Sept 25, 2017

>MSR is a separate organization.

That's putting it mildly. They're separate silos and rarely collaborate. Once research seems "meaty" enough it gets thrown over the wall.

oldmanhorton · on Sept 25, 2017

I work at Microsoft and we have someone from MSR collaborating with my team daily. I am sure some researchers silo themselves and some engineering teams silo themselves, but I don't think its an accurate blanket statement.

seanmcdirmid · on Sept 25, 2017

This isn't true, at least for that part of the company. Bing has always had a close connection to a certain part of MSR (e.g. see Harry Shrum).

tome · on Sept 25, 2017

*Harry Shum

seanmcdirmid · on Sept 25, 2017

Oops, sorry about that. I'm not really used to that kind of romanization.

tome · on Sept 26, 2017

Interesting. In other romanizations the name has an "r"?

seanmcdirmid · on Sept 25, 2017

T&R includes MSR, but I think this is what they called NExT before, plus Bing.

tellarin · on Sept 25, 2017

MSR is a separate org, but inside AI+R.

jahaja · on Sept 25, 2017

Going out on a limb here, but why is taken for granted that the tech sector should be the driving force in AI? For me it makes much more sense if it were in the field of neurology or whatever medical science sector that is appropriate to figure out what intelligence _actually is_.

otabdeveloper1 · on Sept 25, 2017

'Artificial intelligence' is just a snazzier name for 'applied statistics'.

The goal isn't to create thinking robots, the goal is to extend statistical methods to the realm of unstructured dumps of big data.

We don't know what "thinking robots" might be and what they're good for, but we've known about the power of statistics for a century now.

Selling statistics + big data to businesses is a no-brainer.

jahaja · on Sept 25, 2017

This is basically my observation too. That it seems hard to distinguish it from "smart & fancy algoritms". The marketing and hype around it do however imply that it's much more a path to intelligence proper, hence the question above.

eli_gottlieb · on Sept 25, 2017

The truth is that many ML researchers do care about what "intelligence" (better to say cognition) actually is, but all the funding basically comes from applied statistics problems on business data.

voidfunc · on Sept 25, 2017

Nobody cares about what intelligence actually is just as long as they can package it up and sell it for billions of trillions of dollars to the business and consumer markets.

shas3 · on Sept 25, 2017

Ornithologists vs. aeronautical engineers... :)

narag · on Sept 25, 2017

Does this strategy work?

I mean, has it worked before? Can you hire thousands of technical people, throw them at some hard problem and expect they will solve it?

pjungwir · on Sept 25, 2017

The Idea Factory is a history of Bell Labs about just that idea, starting with the first transcontinental telegraph message.

nojvek · on Sept 25, 2017

Having worked at Microsoft I see how this usually plays out. Lots of teams and people added for the sake of having people. 9 women can't make a baby in a month. Microsoft has a ton of money in the bank and they need to spend it, make acquisitions and see where it sticks.

Some of the best work at MS I have seen weirdly had been in small teams with razor focus. Typescript, Vscode.

201709User · on Sept 25, 2017

Your last samples represent an overinvestment into a mid-term dead-end though (JS).

mhb · on Sept 25, 2017

Manhattan Project, Apollo.

narag · on Sept 25, 2017

Exactly the two ones that I had thought of but... there are big differences. The goals were well defined, the means were known and there was managers and planifications.

BlackjackCF · on Sept 25, 2017

I've always found that more people doesn't necessarily mean more/better output...

visarga · on Sept 25, 2017

What are they doing with 8K people in AI, make them submit papers to NIPS?

RHSman2 · on Sept 25, 2017

Labeling training data

201709User · on Sept 25, 2017

They are manually setting thresholds on individual neurons.

gerdesj · on Sept 24, 2017

I actually wanted to read the article end to end but was unable to avoid glazing over. If that is the best that a multi billion dollar mob's S&M dept can manage then I'd suggest ....

arcticfox · on Sept 24, 2017

You'd suggest what? And what is S&M in this context?

vlunkr · on Sept 25, 2017

They wanted to finish writing the comment to the end and avoid using ellipses but...

_asummers · on Sept 25, 2017

Sales & Marketing, pretty sure.