Hacker News new | past | comments | ask | show | jobs | submit login
Voice Is the Next Big Platform, and Alexa Will Own It (backchannel.com)
162 points by steven on Dec 19, 2016 | hide | past | favorite | 287 comments



God I hope not.

Voice input is a terrible UX, because as a user, you have no idea what the system can and can't do. Every single Voice UX I've used for input (Siri, Alexa, etc) is a fumbling around experience where inevitably I can find out the weather, turn on some music, and then hear some half-funny replies from the device because it doesn't actually know what to do. It can work 99% of the time, but if that 1% it breaks down and I have to revert to a device with a screen, it kind of ruins the point.

Maybe Amazon is addressing this with their supposed Alexa with a 7" screen in the future.

What I want is improved Voice Output UX. I think Apple is trending towards this with the AirPods, where you can leave them in your ears all day and have Siri just talking to you, which is the only practical use of any of these things unless you're stoked about annoying everyone around you with your robot assistant awkwardly talking out loud all day.


Voice input is what we humans use when we speak to each other.

Things said in our home all the time:

"Do you know what the weather is going to be like?"

"Is it going to rain today?"

"We need to buy some milk."

"The roast need one and a half hour."

We speak all the time with each other and there is nothing weird about that.

I bought a Google Home and it's literally transformed our lives in the little details for the better.

Before asking for the weather was something which required one of us to find the phone, open it find the weather app.

It sounds banal but if you have to do it every day because you take your kids to school believe me it's a godsend.

My wife was very sceptic about this and is certainly no tech fanatic, she now loves it because as she said she can now turn listen to radio just like in the old days when all you had to do was turn on the radio.

My kids love talking with it.

And it's basically just a very practical assistant which is revolutionary in the little details not some sci-fi scenario.

It's not disrupting anything it's improving the quality of tech by allowing us to interface with technology in one of the ways which removes yet another layer of abstraction.

You don't need to know what the system can do as long as it can do enough Google Home certainly can and it's just getting started.


Voice input is what we humans use when we speak to each other.

Except that humans have an average IQ of 100. They understand context, they get what I'm trying to say. Try getting Siri to turn off all repeating alarms, just for today. Or turn off all alarms until 12pm.

"But that's not how Siri is supposed to work!"

Sure, fine. It's just a ux to a limited api. But then you can't also compare it to how we use the same input for humans. Robots aren't humans. Not by a long shot. They have an api, a very specific and limited set of capabilities. They are not flexible. They don't get context.

Voice interfaces have their place. Because once you know the api, the possibilities, voice may well be most practical. But not because we happen to talk to humans.


Siri and Google Voice are in completely different leagues. I find Siri completely useless; I use Google's voice interface all the time; it even understands context.


The product you're talking about I think is "Google Now". Google Voice is Google's VoIP-ish service, precursor to Google Fi


and completely abandoned.


Abandoned but atleast it still works. Can make a new phone number and I can still call US numbers abroad for free.


but sending an sms via the app fails 80% of the time


Heh :)

Thats like complaining about a commodore 64 not being able to run the same games as a Playstation.

The discussion is not whether Alexa, Siri and Google Home are perfect AI's but whether they are good enough for some very basic things like those I described.

And no it's not because we talk to humans that they are good, it's because we interact with other humans through speech.


If you really understand context, a lot of the time you don't need voice at all.

Consider a voice operated light switch. Why do I need to tell the switch I want the lights to turn on when I enter I room? The house AI should know the ambient light level, the time of day, my location in the house, and my default lighting preferences, and lighting should "just work" unless I want to change something - which is when I can ask for it.

Alexa can't do any of this yet. I can give Alexa commands to turn lights on or off, but currently there isn't even a context for the current light state. So I can't say "Alexa, lights" and have Alexa work out whether that means "turn the lights on" or "turn the lights off" for the room I'm in.

IFFT may eventually be able to do this, but it's far from transparent and straightforward.

It's about affordances. I have a reasonable idea what a human's affordances are. The current voice UI (vUI?) equivalent is a handful of dots of implemented functionality surrounded by huge areas of not-working-yet space.

Not only is there no map, there's no way to guess what might be on the map.

It's also about cognitive load. I'm typing this in a bedroom with a couple of 433MHz switches controlled by a remote - one for a light, one for a heater.

Using the remote takes no conscious load at all. When I had the switches controlled by Alexa, formulating a command took effort.


I disagree. My habits would be so hard for a computer to learn, mostly because they already frustrate humans who are much smarter. When the sun goes down, I don't want all the lights to turn on. I don't even necessarily want all the lights on in the room I'm sitting in. Many times I sit in a lit-up room just because I can't turn the sun off. Once it goes down, it's nice to sit in the dark watching TV.

That is unless I'm reading a book, in which case I need a light on. But not just any light, I'd probably just lamp next to me to shine on my book instead of the one that lights the whole room. I don't need the whole room lit, just my book. So now the contextual system needs to have cameras to see if I'm reading or just watching TV or playing on my phone or napping.

And when I walk into the bathroom in the middle of the night, I'm fine with the nightlight above the toilet. Turning on all the lights will wake me up and ruin my night vision. But my wife likes the light on. So now we need facial recognition to tell who is walking into the room.

Right now I have to get up and turn on a light if I'm on the couch reading. If we're talking effort, that's a hell of a lot more effort than saying "Alexa, turn on my reading lamp". And don't get me started on the effort it takes to try to find the missing remote...


All what you are saying is true but it's missing the point I was trying to make originally which was that what it can do now is more than enough to be very useful for many of us.

I don't think you are appreciating the things that used to be which got lost with digitalization and now can get back.

It's not just about what the AI can do, it's how it gets expressed.


I'm with you on this most of the way.

However, the voice ux projects at big tech companies are inevitably about collecting training data.

Five years down the road that limited API is going to be a lot less limited.


The general issue I have today is that, for a lot of things, voice can feel like an exercise in coming up with the right wizard's incantation--especially as you add Skills, etc.

But that's not really an inherent issue with voice. It's just that today's systems are mostly stuck with requiring relatively strict adherence to specific syntax. This will certainly improve over time.

I don't find Alexa as transformational as some people I know do but I find it useful enough.


Google Assistant is slowly getting better with stuff like this. While usually I stick with the commands and syntax I _know_ works, occasionally I'll try getting fancy and throw it a curveball by just speaking 100% naturally and Assistant will surprise me by actually doing what I intended.

For example, a while back I was using Assistant to look up showtimes for a movie at a specific theater, and just to see what would happen I said "show me a map" without giving any explicit context. I was expecting Assistant to come back with a search query or maybe pictures of generic maps in response, but to my surprise it actually understood that I was talking about a map of the theater and gave me that.

Of course, that specific response in that exact situation isn't in and of itself all that impressive, and for every time Assistant succeeds in figuring out what I meant in a situation like that there's maybe one or two other times when it has no clue, but the idea of being able to speak 100% naturally to a virtual assistant and having it "just work" is unbelievably cool.


Just to elaborate for those of us without kids...

As a parent in New England, you have to choose between 3 jackets and when you are trying to get socks on two kids and can ask "alexa, what's the weather today" and know the answer without having to pause and get a phone and let one of the kids escape, that's really nice.


Thank you for that. I was thinking; wait people actually care about the weather but this makes sense. As someone without kids, we have not checked the weather for over 10 years and so using that as an example 'what people talk about' sounded weird to me.


It is interesting how little we now need to think on the weather.

With modern heating systems (specifically thermostatic radiator valves) we no longer need to ensure that the heating is prepared at the right time, so we don't need to know the weather.

With the prevalence of tumble driers washing doesn't need to hang out, so we don't need to know the weather.

Most modern clothing is suitable for a very large range of temperatures and good waterproofs can be comparatively small and lightweight, allowing them to be brought everywhere without burden, so we don't need to know the weather.

Cars and roads have improved significantly, the impact of the weather on a small or long journey is often negligible, so we don't need to know the weather.

More jobs are performed indoors now and where they aren't special machines and tooling have been made that make the job easier alongside significantly reducing the risk and discomfort that weather can cause, so we don't need to know the weather.

There remain some situations where knowing the weather is very important, but very few of them affect 'the general populace' is a consistent enough manner to cause them to waste time following it.


Conversely, many jobs need to be performed on the schedule that was agreed upon beforehand regardless of the weather. The weather is rarely accepted as a legitimate source of everyday disruption and misery, except in extreme events.


Case in point, my area got 11" of snow overnight recently and my wife got yelled at for being late to work because subcompact cars don't do too well in the deep snow. So people end up buying SUVs that they drive year-round just for the 1% of the time when they're really needed.


So looking at the price difference... I'd buy a smaller car and tell my boss that its not worth it (unless I'm being paid the big bucks).

Then again I'm British, and an American I've recently befriended has shown me how different attitudes to the employer employee relationship can be


Idk.. sounds very weird to me because when I was a kid in the Northeast I had one (1) winter jacket. If it was between September and March I left the house with the jacket else I didn't. Simple. I can't imagine my parents ever checking the weather and I can't imagine a child owning several jackets. Very odd. People make things much too complicated.


> People make things much too complicated.

Pretty much the story of modern civilization. Just step outside and check the temperature, and look up at the clouds and guess if it's gonna rain later. Wear a sweat shirt or waterproof windbreaker if in doubt. Or you can just not care about getting mildly cold or wet. I say this as someone who spent 25 years in Minnesota.


Honestly there has been a few times in my entire life I've checked the weather report for guidance on how to dress. Every single time I've been burned and ended up dressing inappropriately. I do much better using my own senses. However, I don't really use an umbrella because I never had one growing up so I guess if you wanted to make sure you weren't caught in the rain without one you'd have to check the weather report.


And then you have a kid who you have to pick up from school, you are going to be at work until you pick them up. It might look fine in the morning but might rain in the afternoon.

But sure, you don't need umbrella either and we could all be driving open cars :)


Meh, if my kid is anything like me, they won't care about getting wet. If they do care then they can buy a Google Home with their allowance and check the weather to their heart's content. Plus, no plans to send my future children to school or to be working a 9-5 job.


My kids don't care about getting wet either. I do.

Don't get one if you don't want to, the discussion is whether it's useful and it is. Not just for the weather not just for rain. For many little things in life.


This winter so far the temp ranged from 2f to 60f... In 48 hours... So in those cases you know if it is single jacket, double jacket, or sweatshirt.

That's one example. Schedules, playing music, guitar tuner, uke tuner, metric conversion when hands are dirty from cooking, and plenty of other examples are abundant in our house


Yes, I am extremely familiar with that temperature fluctuation. I still owned 1 winter jacket and if I got hot I unzipped it or took it off. We didn't have money for multiple fancy jackets or sweaters or whatever. You'd outgrow your multiple jackets before you even got to wear them.


So it doesn't rain in the summer? It doesn't rain in the fall? It doesn't rain in the spring?

Nothing complicated about it, it's just a reality for many people.


Well, in my case, I didn't own an umbrella as a kid and neither did any other kid I knew, so I guess it didn't matter.


And 200 years ago we didn't have umbrellas. Sure, none of it matters. You don't need a television either. You don't need running shoes to run.

Not sure what you think you are proving here.


I'm simply answering a question you, yourself, asked. Sorry for having the "wrong" answer.

I didn't really think of rain in my original comment because when I was growing up we literally just walked in the rain.


you didn't use computers either, or cellphones, or visited this forum.

Again whats your point?


+this. You just described every morning at my house (including asking Google home what the weather is), and we only have one kid. It's also very useful that everyone in the room gets to hear the answer. Same for setting timers. "you have five minutes left of play time. " It makes the process very transparent. We have a 20 minute walk to school, rain, snow, or shine, and getting the munchkin dressed appropriately (and willingly) is key.

Both Alexa and Google home have been great with a kid.


People used to have passive displays of the outdoor weather, that could be checked handsfree.[0] No voice-recognition spy machine necessary.

[0] http://www.leevalley.com/en/images/item/Gardening/ab804s1.jp...


Personally, I devote the upper 1/8th of my phone's homescreen to a widget that shows a weather graph (temp, rain/snow, wind) for the next 36 hours. The forecast updates every 2 hours.

But then again, we live a few hours north of the location where they filmed the Battle of Hoth. Asking anyone, let alone a machine, "what is the weather outside" gives you little information about the weather in six hours.


Must be interesting to live that far north. FWIW further south people configure their phones likewise because the question of whether there will be rain is when rather than if - or at least during the rainier season(s).


Or you can buy a bit more advanced but disconnected from IoT botnet:

https://www.amazon.com/AcuRite-02027A1-Forecast-Temperature-...


Yes good point, my granddad had barometer etc.

There is something about the tangibility of information we use a lot. Instead of it being hidden behind login, app launch etc we just access it directly.


Voice is great to have conversations. To give short commands, other interfaces are good as well.

For example, I can check the weather for today on my phone with minimal fuzz, clicking 2 or 3 times and getting much richer information (a full graph with short term forecast, a few hours in advance, etc) in a single screen shot. Another example, ordering a pizza. I prefer a GUI over calling and taking to some human. The options are clearer, and I can review easily what I'm ordering.

Not to say that it's impractical, but voice doesnt carry a lot of info for a lot of services. We'd prefer a phone call over an specific tactile interface for everything, and that's not the case...


I a not sure I understand what you are getting at.

I gave some pretty specific examples of what it's great for.

I am not saying it's great enough yet for much more complex things no disagreement there. But unlike Seamless web or mobile interface I could potentially just with the command, "order the usual from Joe's Pizza" skip a whole suite of interactions needed to order food today.


For some tasks, yes. But even in the case of human conversation, the spoken word is often a poor medium of information exchange.

Telling Alexa "send me some paper towels" only works if you have established a protocol in writing with Amazon (i.e. I want a 8 pack of jumbo bounty).

Your use case "how is the weather" is useful, but is it useful enough to be "the next big thing"? It may be big, but not smartphone big.


It's funny - we've been remodeling our house, and as part of it, added a bunch of z-wave compatible switches. I was moving some stuff back in (we've been out of it), walked into the house at night, and after living with an Echo and a Google Home for the last year and a half, had a very strong urge to say "Hey, Google, turn on all the lights." -- unfortunately, I hadn't moved the Home in yet! :) I was carrying a large box and it would have been fantastic.

My experience with both devices is that there's no one Big Thing, except maybe music. There's a lot of stuff that just works surprisingly well by voice. Quoting my daughter: "Hey, Google, play Puff the Magic Dragon" - "What's the weather today?" - "Turn on/off the lights" - "Set a timer for 5 minutes" - "What's 17 tablespoons in cups?" - "How long does it take light to get to Mars?"

It's probably not smartphone big. But I think that in the long term, we're going to find that some kind of voice-based interactive device like this will become very common.


That was not my only use case though.

I cook a lot, timing is important.

I have fun playing games with my kids teaching them trivia, Google Home does that while we are free to do other things.

We listen to a lot of different music. Google at home allow me to stream from Spotify simply by saying the type i want.

My wife likes to listen to radio, she just tell Google at home to start a radio station.

Listening to the news, google does a pretty good job assembling what is most important during the day.

You can ask it to play the latest "Startups for the rest of us" podcast simply by asking it to play that.

You can ask it when you next flight is.

How long it's going to take to get to somewhere.

You can add things to a shopping list which is important when you are family.

And these are just things that it does out of the box. I don't care about switching on lights which it also can do and turn the heat up or down.

What I think most of you realize is that just like the touchscreen removed a layer of abstraction to access technology because it allow for things humans do naturally. So does the voice. It's extremely intuitive as there is nothing to learn. As voice gets better and better it's going to change a lot of things.

The sales numbers of Alexa is pretty telling too and this is from all sorts of people like my parents in law who now have one they use and understand.

I don't care whether it's going to be the next big thing, but to claim as the parent did that voice is a bad interface is simply as wrong as it can be and to claim there is no value in what it can besides a few things, well it turns out at least for my family those few things are pretty useful.


Building a huge complex and expensive system to save you 3 second to check the weather is not what I would call revolutionary.


Then it's a good thing that thats not the only thing it does and that it's never been the claim from anyone.


Right now it's pretty much what it does. Small little things that takes 2 seconds without it. And it's less reliable. I've yet to witness a revolutionary use of IoT.


I use it for a hell of a lot more than just the weather. Google Home is reliable. Siri isn't.


My phone is always with me, either on a countertop or in my pocket.

"Ok Google, is it going to rain today?" is something I can (and do) ask at any point. No need to have a new device in every room of my house and no need to learn a different interface when driving in my car etc.


So you are agreeing with what I say :) Voice is easier than opening your phone and finding the app to launch it.


> My wife was very sceptic about this and is certainly no tech fanatic, she now loves it because as she said she can now turn listen to radio just like in the old days when all you had to do was turn on the radio.

Oh god please. I can't understand how complicated radio is these days. Fuck me, I'm forever returning or rescanning or some bullshit just to listen to something.


Is there something weird about radio in the US? Radio is still the same old AM and FM stations here, you just "turn on the radio" to listen to it like you always have. Am I missing something?


That's because our Inteligent Designer only gave us low fidelity analog I/O ports. We can do way better with keyboards and digital signals nowadays.


> just like in the old days when all you had to do was turn on the radio

Weather and traffic reports are still a staple of radio broadcasts


Lol your entire family is being data-mined, and your kids will grow up to feel like this is normal, good job.

Hope it was worth the convenience.


You are being datamined just being on this website. True but trivial.


Well of course. But that is certainly different than having a Google-powered microphone that is always listening.


Well a microphone cant hear what website you are visiting. So not sure what your point is.


I'm not sure what your point is?

As if knowledge of me visiting a public site is comparable to Google having an always-on microphone that probably is riddled with 0days and has constant internet activity that can easily mask a malicious connection?

it only takes approximately 10 minutes of recorded audio to interpret individual keystrokes with a ~96% accuracy. [1]

Google / NSA / foreign state hackers / your next door neighbor could potentially find or pay for an exploit to the device and have all of your passwords down within a few days. All of your children's passwords.

It can reconstruct every message you've typed in range of the microphone. Ergo unless you rely on bookmarks and not the address bar, it can in fact track which websites you visit. Furthermore, it could interact with ultrasonic sounds emitted by your computer speakers to track your ad experience, and if you live in a malevolent state, to locate you. [2]

If your kids are a problem to the police state, they will be identified and marked before they even know they want to be political activists.

This is the world you aren't only allowing, but are defending as well.

[1] http://rakesh.agrawal-family.com/papers/ssp04kba.pdf, https://people.eecs.berkeley.edu/~tygar/papers/Keyboard_Acou...

[2] http://arstechnica.com/tech-policy/2015/11/beware-of-ads-tha...


You would be surprised what you visiting public sites can be used for. Insurance purposes for instance. You should be much more worried about that.

And then there are the ISPs you use to access the internet. Or the cellphone tracking your wereabouts or your camera and your microphone on your laptop. Or your router.

I could go on. If that's what you are afraid of that game is already lost.


Why do you insist on thinking I'm also not concerned about those forms of unlawful surveillance?

You're trying to invalidate my legitimate concern over a device's security implications by telling me other devices have security implications as well. No shit.


There are certain times when a voice interface is dramatically faster, simpler and more intuitive than a visual UI. One example is setting a timer or alarm. Another is asking for conversions (how many teaspoons in a tablespoon).

The Amazon Dot is my favorite kitchen tool, we use it more than we do spoons at this point. The ability to set multiple timers, convert units and measures, and remind you to do things all while playing your favorite tunes is amazing.


I tried using Google Now for timers for a while, the amount of times it didn't start properly or you'd say:

"Google Now, Set a timer for..[slight pause]...<How long would>three<you like>minutes<a timer for?> [Pause, pause, pause] Three minutes. Three minutes! Three Fuck<Setting timer for>ing Minutes<three minutes>.

I am now manually inputting timers again.


Yeah, well, don't assume Siri will help you with that. <Long press Home> "What's the..."

"I'm sorry, mikestew, I didn't catch that."

Maybe if you waited more than 2.6 milliseconds for me start flapping my pie hole, you'd have something to catch.


I presume on your phone? I set echo timers/alarms nearly every day and don't think I have ever had to repeat myself. The far-field mics make a huge difference in recognition accuracy when you aren't talking directly into the device.


I like Alexa in the kitchen for shopping lists and some other tasks such as, as you say, converting measurements.

I don't actually care that much for it as a time though. The other timers I can easily set in the kitchen tell me at a glance how much time is left. Alexa I have to ask. These are small things but it means I don't default to using Alexa.


If you only have one timer going and you're watching the clock then it is a bit of a pain.

However when you're preparing a meal or making cookies and you're constantly moving and not watching the clock, just responding to it, it becomes handy. It's also nice to not have to stop and wash your hands to set an egg timer.


I was going to suggest the initial use cases for voice control are summed up as "when your hands are busy not on a keyboard."

HN isn't the best sample group for this, because for most people that's a pretty large amount of the time.


I use the timer feature all the time for kids time outs. I've got four kids, and often need multiple timers. It's perfect because I can set it from the other room while resolving whatever issue sent them away. I do wish it had an "Add 5 minutes" feature though, for every time they come out early.


Billion dollar kitchen timers


You laugh but it is a seriously good thing. I mentioned the 'side project' I had submitted that Google legal had rejected, it was combining Siri with timers and cooking facts as the ultimate kitchen timer. As the GP points out, there are a bazillion factoids you might consult while cooking and a huge win for cooking is having everything come out at the same time (co-ordinating timers) so as an app on your phone following you around in your pocket it can be pretty killer.


"I see you're baking chicken, Dave. 98% of recipes agree your requested baking time and temperature will result in an overly dry bird. I'm afraid I can't allow you to set your own smart oven parameters anymore..."


Given how badly people screw chicken up on a regular basis, I'm not even seeing the problem here, and I'm usually the one to shout Skynet during an otherwise reasonable discussion on AI.

Fucking up food is a sin.


Dave needs to get a food thermometer.


No. A billion dollar system that is so accessible that it's not even insane to connect a simple kitchen timer to it.

Sci-fi got it wrong with their humorous stupid robots providing comic relief while smart robots do the work. All the robots are the same smart, and getting better every day.


I could see Alexa married to a Kindle with a kickstand being a killer kitchen tool.


Same here. Dot in the kitchen connected to 10" subwoofer+satellites. Spotify/Amazon Music (kids' favourite), setting timers, listening to news in the morning. That's about it. Jokes and calculator sometimes.

I'd like it if the knowledge graph was better, for answering questions. Just wikipedia queries is a bit lame. I think Google Home will win on that one.

Oh, and I'd really, really like to be able to voice call/skype through it (presuming the array mic + software will be able to isolate my voice/remove reverberation well enough).

IM (with TTS) could be fun too


Home automation tasks also tend to lend themselves much better to voice interfaces than touch because voice commands are always available regardless of where you are or what devices you have (or don't have) with you.

I was a huge skeptic of voice assistants before the Echo came along, but being able to turn my lights on and off with a single voice command alone has been more than worth the price of entry for the Echo Dot + TP-Link Smart Switch combo I got for $60.


The same criticism can be levied against keyboard-driven tools aimed at power users. That doesn't make the tool less useful, it just shifts its target audience.


Sure, but it does contradict any assertion calling it the "next big platform".


Exactly. The use cases and target audience are different. I use my echo dot for playing songs, switch off lights and add random alarm/timer etc. The future not might be voice only, but it will definitely voice intensive.


"Big" only works if someone can make big money. It can't have advertising without becoming useless. So, Voice activated lights are "The Clapper 2.0" which was not a big deal last time around.

Sure, there is a niche for reordering products easily, but commodity voice tech is going to rapidly hit zero margins. Like those 10$ TV dongles that used to take a full PC.


The clapper was a huge product. Millions of units shipped, mostly for $30 1980s dollars at a high margin.


"Millions" is not such a big deal. 10 Million of units shipped * 30$ ~= 300 million retail.

Google makes 330 million in profit every week.


Why does every business have to reach Google profit levels, though? A business that turns a healthy profit with low margins is a successful business.


Sure... but how much does Google Home make?


I have this exact criticism with modern GUIs. Hide the scroll bars. Move all the icons into a hamburger menu. Strip chrome from everything.

It's maddening. There's absolutely no way to discover how to use apps like this unless you already know how.


The side swiping menus (like in Snapchat) are so confusing to me as an old guy. How the heck are we expected to know you can swipe left and right on a screen to reveal a completely new screen?!?


It's discovery/referral gating. Teens who get Snapchat upon hearing it from a friend can ask their friends on how to use it, and it stimulates conversation and feature discovery. If you don't have at least one existing friend who is experienced at using Snapchat, you're probably not in their target market (by design).


You see this social gatekeeping in how they distributed their spectacles, too.


There's no good way, and I really dislike this modern trend of un-discoverable UIs.


Slack's iOS app has like two side-swiping menus essentially stacked on top of each other. One of those menus (the first one) is NEVER what I want. The whole thing is so confusing, and I have to relearn it every time I actually need it.


>https://www.google.com/#q=snapchat+swipe+left

Just consult the nearest "X things you need to know/didn't know about Snapchat" article /s


Because you try it once or twice when exploring a new touch interface?

It takes all of < 1 second. I get it is obfuscated but we aren't talking about a crazy level of interface hiding.

People expect to be able to swipe on most interfaces these days.


It does take < 1 second. And then so does swiping right. And left. And up. And pressing. And long-pressing. And double-tapping. And two-finger tapping. And two-finger swiping (in all directions). For every element on the screen. For all screens.

See you in a few hours.


I can't stand how almost every mobile app use icons without labels, so I have to press them to find out what they do. If I'm lucky there's a tooltip available if I long press.


I've seen these complaints, slightly re-worded a number of places, and I remain confused. The changes aren't that big.

"File, Edit, View" etc model is condensed into 3 horizontal lines. The lines I suspect were modeled on the idea of a row, with one option per row after another. Click "View" in a random app, oh look, one after another. Don't like that notion? Think of it as a Start menu in ever app, and you still have to drill down. It's barely a change, since the idea has existed since Windows 95!

Small screens need less chrome to highlight content. It's about the user having the best view of their content, not peddling a catalog of "me to" features and pretty icons in people's faces. Turns out, people actually enjoy seeing more of their content on desktops too, rather than rows of default icons.

Scrollbars? People still stop what they're doing to look for the scrollbar? I stopped dealing with that once MS shipped the Intellimouse 20 years ago. Tap/drag on mobile. You know how to tell if you're at the bottom of a document? It won't scroll further.

"It's maddening. There's absolutely no way to discover how to use apps like this unless you already know how."

But you KNOW how to use them already. They're just slightly modified. And these things have existed long enough now, there's no excuse not to just swipe either side or tap the hamburger to see what's up? I mean, how am I going to know what the new app using classic "File, Edit, View" menus does? By using the input options I have, mouse, keyboard, to dig through them. That hasn't changed on these new desktop apps. On mobile, input options are fingers, tap and swipe about.

This falls into a category of "Personally, I don't want to rethink things ever." type arguments.


This is the Infocom problem: You're trying to present the user with two fundamental illusions. 1) the computer can understand what you're saying, totally and completely, and 2) that you (or it) can do anything.

However, it can't, it can't, and you can't. And when the illusion fails, it's frustrating: Neil Stephenson would call it Metaphor Shear, I (being less fancy) would call it why-didn't-they-give-me-a-better-ui-or-at-least-a-manual-dang-it.

This is ultimately why I prefer unambiguous interfaces over "friendly" ones.


I only have two problems with voice, one of which is UX related:

1. It's slow. Apple AirPods are a good example. Siri controls the volume. Talking is slow. The input and response until the volume is changed is slow. Faster would be to use the long side of the AirPod as a touch control and sliding up and down it changes the volume. One finger slide = volume change. Two finger slide = track change. In any case, touch controls are way faster than voice.

2. It's weird in public, and faulty in loud environments. Talking aloud to yourself is going to get strange looks at you. I tend to do things that don't bring me attention, so I don't use Siri. Also, good luck using Siri at a concert or loud train commute.


> because as a user, you have no idea what the system can and can't do

This is how I thought about search without search operators. Then Google got it right and owned the market.

So I agree with you -- but I also think someone will eventually get it right, and that company will take off.


It's like a CLI- intimidating and not discoverable at first, but hopefully powerful later on. Except I am not sold on how it will be powerful yet: it lacks th "replay" that cli's have with scripts, and it lacks composability right now.


Tab completion, the "history" command, and using the up-arrow weren't in the first shells either. Eventually someone will find a way to create custom macros with voice commands.


I have no idea what that would look like for the end consumer but I remain hopeful!


> It can work 99% of the time, but if that 1% it breaks down and I have to revert to a device with a screen, it kind of ruins the point.

I don't think your statistics makes any sense.

99% of the time, you save time by speaking your intent, vs interfacing with your phone. That time saved is meaningful and outweighs the 1% of the time you have to fall back to your screen device.

99% of the time, you can use your hands/fingers for other tasks while speaking your intent. That allows you to multitask more efficiently.


Voice input is a terrible UX, because as a user, you have no idea what the system can and can't do

Very true -- our Echo can control the lights, but I can never remember what each light is called, while with a GUI, I can see the list and it's obvious that "Livingroom - TV" is the light near the TV that I want to dim, while "Livingroom - TV table" is the one on the table to the right of the TV and I want to turn that one off.


In addition, Voice works well if you are alone in a silent enviroment. It does not work well if kids are playing loudly in the same room and the wife is yelling at you from the next room.

I also don't like the feeling of having a dedicated microphone set up that records everything, waiting for a command and is completly intransparent about what it sends back to Amazon/Google/Apple.


It's great for certain interactions. I use it to turn off the lights and set alarms before I go to bed, get the weather every morning, make grocery lists, etc. Nothing huge, but the reduced friction is great.

Also, for the $100 price it frequently goes on sale at, it's a great wireless speaker with Spotify integration built in.


Google Home is an exception to that I think. Partly because you have a good idea which Google searches result in a knowledge graph result. Those are read out by Google Home.

Also you're being very "No wireless. Less space than a nomad. Lame."


I want voice-activated lights... That's about it.

The problem is: The electronic light control modules are expensive, break down after 1-2 years, and then go obsolete.

I'll spend more time rewiring my house than stumbling around in the dark!


> I want voice-activated lights... That's about it.

https://en.wikipedia.org/wiki/The_Clapper


The led wifi light bulbs seem like a good option when you have actual lamps. When they become obsolete in a few years you buy another one?


Maybe this could work like Tesla is trying to get auto driving working:

When you say 'Hey Google / Siri / Alexa / whatever' a connection is set up to a real human who can control your house automation. Then the system could be self learning because the operator is translating your command into an action.

But it would not be for me. I absolutely never felt the need for home automation (don't think it is going to make my life better). And I care for my privacy.


The point about talking out loud is the key one for me. It doesn't matter how good the interface is, I don't want to talk out loud to myself in public. The same way I don't use magic wand like gestures to control phone.

Talking, while potentially more natural, is also simply slower than most other input types.


>Voice input is a terrible UX, because as a user, you have no idea what the system can and can't do.

This is the same as the command line. Just like the command line, voice may not be completely intuitive initially but for regular use, it's probably much faster than a colorful GUI.


much powerful maybe, but not much faster.


Even using Siri, which everyone can agree is terrible, having her set a timer using voice is much, much faster than picking up your phone, unlocking it, navigating to the Clock app, switching to the right tab of the app, then setting a timer.

"Hey Siri, set a timer for fifteen minutes". Done.


That's a bit disingenuous. iOS command center swipe up, timer button and it'll remember your last setting. Works faster than voice for the most common use case.


You aren't including the time it takes to find your phone, pull it out, maybe unlock it, and THEN do the steps you describe.

That's friction, and that's where these frictionless interfaces step in.


>That's a bit disingenuous

People love to misuse that word.

I just tried it with best case scenarios, and it's still a second or so faster using voice (1.5 seconds) vs control center (3 seconds). I mean, voice is literally just yell out "hey siri set a timer for 15 minutes" and that ends the interaction. Done, just like that.

If you're changing the time that is set, it's significantly longer.


I didn't see Google Now in the list of "every single voice UX you've tried", so recommend trying that. It's pretty great actually, and only getting better. I've tried most of these systems, and Siri is actually a joke in comparison.


Thx. I thought i was the only one WHO HATE Voice input. Especially when i have a long day i dont want to speak to a bloody machine just to get things done.

I think Voice Input should be an Additional, not as the ONLY input method.


Voice is a command line style UX and is a tool that is useful in those cases. You have to memorize commands although.


"you have no idea what the system can and can't do" - shouldn't matter.


I agree that the UX is pretty bad now. But Potentially, a system could monitor your on screen behavior and educate you unobtrusively when you do something that might have been easier by voice. Theoretically, the device could monitor your location and background noise to understand if voice commands are reasonable at that time.


I want to see voice assistants become actually useful and usable in the coming years, and agree that massive data-vacuuming like "monitoring background noise" is probably the fastest way to pump enough data through ML and AI techniques to push them forward, but that feels unnervingly intrusive.

I avoid these devices for the privacy concerns and also the generally poor security of IoT devices. I hate to sound like a luddite, but i really abhor these devices and how hard the tech companies are trying to foist them upon us as "the next big thing".


I am not proposing sucking in all the data, I mean an awareness of its environment so it knows when voice is appropriate. For instance if it can detect me driving in my car (due to speed) with no passengers (noise it has heard) maybe it can understand it should use voice.


You do realize that in order for the assistant to know you're in a car, and that you're "at speed" it would likely need access to your location information, right? Aggregated over time (even if only when the assistant is "on" - ie, the iOS "while using" setting), that's a pattern of your movements.

Either Apple/Google/etc and granulize the permissions such that things like "boolean: user is in vehicle at speed" are addressable separately (possible but unlikely) or such knowledge comes with a cost - the app knows where you are and have been.

Same applies doubly for "noise it has heard".

Consequently my decision is: no assistant, thankyouverymuch.


Ok, so how exactly would assistants do that? They'd need to take environmental inputs such as GPS data to determine if you were travelling in a car. It would need to have consistent access to your microphone and speakers so that it could gather and compute info relating to your environment (noise levels, voice recognition, etc). Combine that with the fact that most phones already track your location fairly precisely. There exists almost zero incentive for Google/Fb/Apple/Amazon/MSFT to NOT collect and aggregate massive amounts of user data.

How will it learn to compare all these different environments? ML obviously; which pretty much entails leveraging the consumer data to provide a better product. The cost of usable, voice-driven assistants is giving up a vast trove of personal data and abandoning the concept of privacy completely, just so I can order some paper towels from an Internet of Shit connected device instead of walking three blocks to a store.

Your position is akin to a Louis CK joke(not offense meant): "Oh boy! I hope he doesn't do what he's going to do!"


I start from the firm belief that "technology X will never be worse than it is now."

Whether you're considering voice interfaces, connection speeds, machine learning, web development, phone battery life, or whatever.. things are trending towards the better. Sometimes it's steady improvement, other times it's punctuated. Either way, the floor on what we can considering is always increasing.


Counter-examples: Privacy (almost always getting worse), security (no clear trend, but it would be hard to argue it's consistently getting better), usability / discoverable interfaces (debatable)


Privacy, I'll agree with.

I think security is getting better but what we see now are the consequences of years of poor decisions. It's the post-Christmas credit card bill.


This is rather debatable when it comes to human-machine interfaces. UI goes through phases of improvement followed by radical degredation due misguided, cargo-culted redesigns. Take cars as an example. In my car I can adjust the airflow and temperature with a button and a knob, but in newer models a touchscreen or voice command is required that takes quite a bit longer.


I start from the firm belief that "technology X will never be worse than it is now."

While not quite what you meant, Firefox is frequently bloody awful now and it used to be quite usable :)


Mozilla as an org is off the tracks and has been for a long time.


When I visited Berlin last year, I visited the DDR Museum. It's not about the RAM or the dance game you might know, but instead the organization we just know as "East Germany." You can probably guess where I'm going with this.

One of the exhibits was of a common East Berlin living room. The exhibit explained how the state had hidden microphones here. They were listening for people to express anti-socialist ideas in their own homes. This was a real threat faced by people not more than 30 or 40 years ago. You could have your life turned upside down just for saying the wrong thing at home.

This exhibit drove home for me the inherent abusive powers enabled by some tech. One of these technologies is always on, always connected microphones. Why would I want one in my house? Given the potential downsides, what could possibly be worth it to justify having one?


The risk is rather low if you don't live in a totalitarian state and you don't engage in extremely subversive or criminal activities.

Sure, the government could force Amazon to let Echo spy on you. But they could force Apple or Google to do the same thing with your phone. Speakerphone isn't as good as far field mikes, but it's pretty good.

The government could also just bug your apartment the old fashioned way.

The big fear would be that echo is storing audio of your house that a future government could use against you. But there is no indication that Amazon does that. It only sends audio after it's triggered by the codeword. Honestly, my google and bing histories are probably a lot more damning.

But if Trump goes all hitler on us, I'd throw mine out.


You're assuming the device never acts incorrectly/has bugs. Why?

Also, the recent findings show it's not just subversive actors who gets surveiled. There are already massive dragnet operations. That's happening right now. So it's not some hypothetical, that's reality.

I think the better approach, rather than waiting and finding out too late, is just to avoid these altogether. The world has mostly been pretty safe but the security appears to now be threatened and civil liberties are rapidly eroding. So just for that reason I think our position for ourselves and for friends/family should just be to pass altogether on this type of device


I wonder if, in the future, it will be just as hard to find a "dumb" toaster/lightswitch/thermometer as it is to find incandescent bulbs.


The problem with this logic is you don't know what the future will hold, yet the data is being captured and stored, likely in perpetuity, today.

This means a future dictator will have a huuuge corpus of data on everyone with the perfect surveillance apparatus to leverage it. And by then it is too late.

I wonder what direction our new Russian-influenced President will take things given Putin's KGB background and savviness in shutting down free speech.


That's a strange thing to say. "The government" (whatever that may be) doesn't force anything on us.

Yet, when everyone has email, that becomes a rather important data source. That draws the interest of those organizations tasked to keep us safe. They certainly wouldn't be doing their jobs very well if they left that trove untrawled, and voilà, soicograms of everybody.

When everyone has a live audio feed from their homes to Google, Amazon and Samsung that becomes an even bigger data source. It would be unthinkable not to mine it and I don't think anyone knows what the effects of that will be.


> subversive or criminal activities

My standard counter-argument to this is "drugs". Drugs are a sufficient bug-bear to be used by the UK government as justification for invasive surveillance, but there are so many drug users in the UK that if they tried enforcing that law they would bankrupt the nation three times over from each of prosecution, building and staffing sufficient prisons, and the sheer number of people in prison who would then not earn any money or pay any taxes.

And that is just from a thing the government uses as its justification for the surveillance power it now has.


> The risk is rather low if you don't live in a totalitarian state and you don't engage in extremely subversive or criminal activities.

And what happens when failing to have an Alexa or equivalent devices becomes an act of subversion or at least suspicion?

> Sure, the government could force Amazon to let Echo spy on you. But they could force Apple or Google to do the same thing with your phone

I think you'll find this is an argument against Google and Apple handing over data wholesale to the government, not an argument in favour of getting another device that does that same thing too.

> The government could also just bug your apartment the old fashioned way.

You mean with judicial oversight and at some considerable amount of effort and expense that would make dragnet surveillance unpopular both within and without the government meaning its use would be reserved only for the sorts of activities proponents of surveillance say it would be used for? Yes, I'm okay with the old fashioned way.

> The big fear would be that echo is storing audio of your house that a future government could use against you. But there is no indication that Amazon does that.

The big fear is that by inviting these amoral entities into our lives they CAN (not will) perform these immoral activities at the request of other parties. It enables it.

> Honestly, my google and bing histories are probably a lot more damning.

Yet you still submit to the surveillance. Good for you. Not everyone is will to make the same compromises.


And you think you wouldn't be arrested for throwing it out?


Even if you trust the state or at least trust it to some degree, security breaches and blackmail are still a concern. Cue Black Mirror S03E03


That's a valid point and at the same time, we have to wonder about things like bundled cameras and mic's on computers, cell phones, TV's. These days it seems like the act of having an internet connection at all can open this type of door.


Quite. So it becomes a matter of trust. Amazon and Google are probably on the wrong side of that balance. For a lot of IoT apps the net isn't the appropriate place at all - local net is better. I'm not a fan of thermostats that prevent you having heating because some cloud server is being DDOS'd etc.

I had hoped and frankly expected that there'd be a few more privacy oriented choices in this space by now. There was Zoe[1], but that got canned at last minute. I really didn't want a refund...

[1] https://www.indiegogo.com/projects/protonet-zoe-start-your-s...


I spent a few minutes on the campaign page but couldn't find details about the closure of the project (it looks like it's still open). As someone with an interest in privacy-protecting home devices, can you explain a bit more about why it closed?


Click into the updates, 3 months ago. I think the only thing on the main page, now the funding has closed, is the updated FAQs at the end including info about refunds.

In short 5 months ago all seemed to be going OK, then they went a tad quiet. Three months ago there was another update including a CEO statement.

"Due to unforeseen delays in the development, we are not going to deliver the ZOE in the current form and all contributors will be refunded." They mention it's a partner having development delays in some critical component, but all very vague with no details.

Supposedly the project is alive and we'll get an update email when they're ready to try again. I infer from refunds all round it's not going to be soon. If it was just a few months I figure they'd ask for patience to work around whatever issue.


Ok, thanks. I'm certain all of the necessary technological building blocks exist to make this kind of thing a reality, but all the megascale funding goes to mediocre, insecure devices and services. These cloud consumer perceptions and make it difficult to know what would actually work in the market.


There's probably scope for an open project here. There might be one - never searched as Zoe ticked all the boxes for me. Local voice recognition, open to the cloud for better accuracy, and set exactly what gets to talk to the net. From a company with some experience in private cloud and hardware.

I don't know enough of IoT specs, protocols etc to know what would be needed other than the broadest terms. Probably yet another use for a Pi 3.


Yes, but in all those examples the camera/microphone defaults to "off", not "on".

That is the most important distinction. Sure my laptop has a camera but if I keep good security practices I can feel reasonably assured that it is not monitoring me unless I've switched it on for some reason. The Alexas and Google Homes of the world default to "on", and everything you do/say is monitored because "cloud processing".

I will always have a problem with anything that tries to use my actions to build a more predictable model of who I am or what I am/will be doing


Either you can be scared all your life or adopt a system which is going to make your/your family's life more convenient.

Wrt your fear of always on microphones, there is a possibility that this may also happen on your smartphone where some rogue app might be listening to all your conversations. I don't see this to be quite different from that. I guess as more and more people adopt Alexa and you see the benefits, you will reach a tipping point where the pros outweigh the cons..


> you can be scared all your life

Resistance to technology that has unbounded risk isn't "fear". It's vigilance against an increasingly dangerous threat. Ignore that risk at your own peril.

> this may also happen on your smartphone

What smartphone? Regular phones still work, and a full size screen and keyboard has much better ergonomics.

Also, the existence of malicious software doesn't mean it's a good idea to voluntarily increase your attack surface. Lockpicks exist, but you probably still lock your house and car.

> you see the benefits

That will never happen, because voice input isn't ever going to be compatible with my apartment's old walls. Talking loudly at night is rude to the neighbors.


I tried Alexa a month ago and gave up on it within a week. Digital assistants fall flat in walled gardens. It was clear during this week that Alexa only worked well with Amazon services, which were always subpar. Google search? Nope. Support for Chromecast? Nope. Support for Instacart? Nope. It was basically a perverse link to the Amazon-only supply chain.

Google's assistant might be striving for more openness, but I don't have high hopes here either; at least until a formal development kit is released. Given that Google won't be able to use its same tried and true ad revenue strategy, we can expect them to offload voice requests to the highest bidder or prefer Google services above all else. This is another perverse link into Google's realm of anti-privacy; but worse, because they won't be able to do ads along the side. The responses will be the ads.

All of this to say that all of these locked down digital assistants will fail until a company truly approaches it from an open and wholistic perspective, and one that doesn't rely upon troves of private data. These devices may sell well this year, but so did digital picture frames. How many of those are still in use?


I'm not really sure why you think it's Amazon only or a walled garden. I have limited experience with it but from the research I've done thus far it seems to have a fairly robust development platform.

I'm under the impression that if it feels Amazon-only it's because of a failure of companies and devs to take advantage of the platform, not a failure of the platform itself. For example the Google services you're asking about, is that Amazon's fault or has Google not taken the time to develop that app because they're focusing on their own product?

I've used my echo dot most often with 3rd party services so far (daily briefing integrations, capital one, yahoo fantasy football).


I agree with this perspective. However, you have to wonder why we don't have more Alexa skills in the platform. Here are a couple of my thoughts.

1. They require you to say "Alexa, tell <app_name> to <do_something>"; This pushes all 3rd party services one level deeper.

2. Amazon competes with too many companies and services these days. Instacart devs won't want to work on a skill within a platform that is itself is a competitor.


#1 does get annoying. I think it does make the experience a little more jarring having to use the same sentence structure for each question. For example, if I want an update on my fantasy football team, I have to say "Alexa, ask yahoo fantasy football"... this is weird and not natural. In normal conversation, I would say "ask yahoo", which isn't specific enough, or skip right to "alexa what's going on with my fantasy team?" since I only have the one team and one integration capable of dealing with fantasy sports.

#2 I agree is an issue. I wonder if with the current environment any independent would be allowed to gain traction, or would the competitors lock them out of services that users find mandatory (amazon shopping lists, google search, apple music, etc).


Plus if you have a million skills on the platform, the namespace for reasonable things will fill up pretty quick, consider you evoke each skill by the app name


I agree 100% #2.


I don't recall where I saw it, but I recall reading a comment from Google saying they weren't planning on having "Apps for Home" where the voice commands were things you installed like on Android/Echo, but were keeping it locked down and working with partners for the foreseeable future.

Alexa is actually not a walled garden at all; skills are almost entirely separated from each other, and I think that will be one of the biggest problems though. My gut says that the reason language is convenient is that it allows for things to be implicit rather than explicit and I expect the voice assistants that will be most useful are the ones that can best understand this context, which will require deeper integration.


> I don't recall where I saw it, but I recall reading a comment from Google saying they weren't planning on having "Apps for Home" where the voice commands were things you installed like on Android/Echo, but were keeping it locked down and working with partners for the foreseeable future.

Except that: https://developers.google.com/actions/

OFC partner negotiations will need to take place in this; this is a new field with massive implications for customer security and caution and oversight _should_ be the watchword. Imagine the open ecosystem model for this and the disasters it could bring.

But the door to the ecosystem is plainly marked now and the rules are posted over the queue. You can do it today.


Ok, I went and found what I'd read: http://www.theverge.com/2016/12/8/13878444/google-home-devel...

> The first and most important difference is that Google is not going to create an “Action Store” where users can select which ones they want and “install” them on Google Home. Instead, Google itself is going to approve all the keywords that developers want to use to invoke their actions and make them all available to everybody.

> That effectively means that actions will be curated by Google (like an app store), but users won’t have to install anything before using them (like the web). “It's not a direct analog to any existing ecosystem,” says Jason Douglas, director for actions on Google.

That article does specify that there will be a hand-off to 3rd party systems, rather than Google orchestrating everything, but it still sounds quite tightly constrained and I think that if the plan is to have things all work nicely together and not be silos, then tighter integration with more Google control will be the name of the game.


Even explicit instructions don't work for some reason.

With Google voice assistant I can't say "look up the New York Giants score then text it to Phil"

I don't recall reading that functionality in the Alexa docs, either.

Seems natural and logical to me.

* I had to specify New York Giants because Google always brings up the SF Giants if I don't


> Google's assistant might be striving for more openness, but I don't have high hopes here either; at least until a formal development kit is released.

But they have released one? https://developers.google.com/actions/


That's weird, I use my Alexa almost entirely to connect to Spotify, Google Calendar, and TuneIn (for radio stations). I never use Amazon services through it except music and the occasional Alexa-based sale.


I agree, I own two of them. I use them to play music from Spotify and set timers (which is actually very helpful, and makes me realize how lazy I am).

However, I get an email once a week from Amazon with "whats new" on Alexa. Its always a few useless things like holiday trivia and then some garbage you can order from Amazon. It was a ground breaking device, but its clearly meant to just be another gateway to Amazon's weird, limited ecosystem.


> Given that Google won't be able to use its same tried and true ad revenue strategy

Is there a technical reason why they couldn't play an audio ad after a response based on what you asked for and your account history? It seems to me that voice UIs, like mobile UIs, have constraints on the quantity of ads but don't make them impossible. These constraints, by reducing the ad space supply, can also drive up the price for said space, especially when you consider the audience for an audio ad is A) captive and B) not restricted to a single individual.


I have this hope that Google will shift to selling hardware and services and ween itself off search revenue gradually such that people pay them for their AI directly or indirectly.

This would make users the customers and better align incentives letting them do things like put user privacy first.


Google opened up the assistant platform where you can create 'Actions' similar to skills on Amazon. It's only a matter of time, Google Assistant supersedes in 3rd party integrations.


Please checkout Hound/Houndify https://houndify.com

I used to work there, and I can definitely say that are working hard and fast at becoming a non-walled-garden platform for voice.

The voice capabilities are still lightyears beyond Siri/Alexa, but they have spent more time focusing on big partnerships than the fickle user facing market.

I hope they make some moves very soon, because frankly I believe their tech is truly superior.


Not tried google home, but Spotify support on the Echo is pretty good - aside from the ability to create an on the go playlist (please add this!) its actually working really well for me.


I helped writing the Alexa skill for The Guardian [0], and I really didn't think highly of it, nor I thought I'd ever use it myself: the voice recognition is still poor, and it's difficul to understand what you can/cannot do with it. All in all, a big meh.

Then one day someone forwarded to our team this email, and it really touched my heart.

I am quadriplegic and the Amazon Echo has transformed my life. I haven't been able to open a newspaper for 10 years, but now I can! [...] Please pass this email on to all of those concerned at the Guardian... They need-to-know the difference that they have made".

[0] https://github.com/guardian/alexa-skills-lambda


One our teams works with this community. They have inspired and awed us.


My (unbiased) 2 cents..

“Qualitatively, Amazon’s position is more secure than the numbers would indicate.”

I beg to differ. I believe Apple has a much more secured position than Amazon in the long run, given how Siri is already integrated in iOs mobile devices and macOS based desktops. So far, has Apple capitalized on this advantage? Nope. Are they gonna? Most likely, given the fact that they always wait for everyone to show their projects in high school science fair and then they roll out their better comprehensive showcase.

If Apple does not launch a voice based home automation product in late 2017 or early 2018 then it might be too late.

Apple's true competitor in this domain can only be Google with its staggering command on Android ecosystem. Microsoft effort is limited to Windows based PCs and even they might have a better advantage than Amazon.

Amazon has mainstreamed a voice tech. They have done it well and made a point that this is the way of the future. Now the second part of the problem is to integrate already existing daily use computational devices with this "voice-tech". Amazon literally has none, no mobile and no desktops/laptops/tablets.


>If Apple does not launch a voice based home automation product in late 2017 or early 2018 then it might be too late.

Why would it be too late? This isn't a social network where the network effects make competition almost impossible. Apple already has a platform in HomeKit that leverages the much larger iOS installed base. More HomeKit compatible devices are being released all the time.

In fact many people criticized Apple for imposing requirements around hardware-based security which slowed the release of devices, but the recent DDOS attacks and the Internet of Shit Twitter account has shown that was the correct move.

I don't think there's any deadline for Apple to release an Echo competitor. If they release a product that is better/differentiated in some way, people will buy it. And they'll likely have a number of HomeKit devices already that work with it.


> This isn't a social network where the network effects make competition almost impossible

There are still economies of scale. More microphones being used more frequently means more data on which to train.


Apple said there are 2 billion Siri requests made per week.

http://www.theverge.com/2016/6/13/11906654/apple-wwdc-2016-n...


I wasn't making a specific claim regarding Amazon vs. Apple. Just pointing out that scale, and thus time-in-market, does have competitive meaning in this context.


Well in this context Apple already has scale. There's no advantage to Apple to releasing an Echo competitor sooner rather than later.


I like your recent DDOS attack point in context to IOT devices. One argument in favor of Apple's wait stance could be that they just want the IOT/home-automation market to mature enough for them to come in with their voice-tech integration.


It's a bit weird that Apple recently killed their Airport products[1]. Seems like it would be a good thing to retain, after all the router/wifi is sort of the nexus of any home. Sure, they may not directly make a lot of money off them but they that's also the case with many other things Apple make (iOS, for example).

Most of Apple products (Mac/iPhones/iPads/AppleTV/[future Apple home product]) go through them.

Google went the other direction, recently adding routers to their product mix.

[1] http://arstechnica.com/apple/2016/11/report-apple-has-broken...


Routers are mostly built into broadband modems now, or provided as part of the service. They are dumb; the value is moving toward WiFi-enabled smart devices like the set-top box or Echo or heck the smartphone itself.

The big selling points of Apple WiFi routers were the ease of use and the integrated Time Machine backups. Now they are hard to use (since there is already an ISP-supplied router to work around) and people backup online instead of locally.


Apple had the Airport line for 17 years. Google announced the launch date for their new mesh WiFi network and Apple ended the line 1 week before the Google release.

Probably a weird coincidence but is interesting.


I think this market is Google's to lose, providing they can execute.

The price point of Google Home is aggressive, and the same functionality is provided (or will be soon) to the Android ecosystem via Google Now / Google Assistant.

Their voice and intent recognition is getting very good. I have noticed a marked improvement over the last year, even in noisy environments. I think competitors will struggle to keep up, and this is probably the single most defining feature of a personal assistant.


> Microsoft effort is limited to Windows based PCs and even they might have a better advantage than Amazon.

Not really. Microsoft has dozens of apps on iOS and Android, including Cortana, and it has Cortana on Xbox One as well. Microsoft also has a huge cloud business, second only to Azure. The Windows PC is one of the smaller parts of Microsoft's business nowadays.

Microsoft has a big advantage over Google, because it supports online, on-premise and hybrid operations. It has a big advantage over Apple because it doesn't really care which hardware you use.

> Amazon literally has none, no mobile and no desktops/laptops/tablets.

Amazon has a substantial tablet business with a forked version of Android, and has its own app store. It also has plug in Fire TV products (which work much better than Apple TV), and an e-reader range, with content libraries to match.


> Microsoft also has a huge cloud business, second only to Azure

Ooops. That should say "AWS"...


>Amazon literally has none, no mobile and no desktops/laptops/tablets.

Amazon has Kindle Fire tablets[0], [1]; these have Alexa built-in.

[0] https://www.amazon.com/dp/B018SZT3BK/ref=twister_B01BRWH8G8?...

[1] https://www.amazon.com/dp/B0189XYY0Q/ref=fs_ods_fs_tab_ts


Kindle? I know it's not necessarily a general purpose tablet but they have some experience with hardware.


Given how possessive Google have become of Android, I doubt it Amazon would want to invest any further in Android based devices.


I think you're right. Alexa and Google home do well at...home. But Apple want everyone using a phone/watch/Airpods to interact with their AI assistant wherever they are.

We can argue about the quality of Siri, but I think Apple have the best chance of dominating the wearables market.


Always-on surveillance devices (at least the ones I know about and for which I do not control the endpoints) do not have a place in my home.

I'm increasingly feeling like humans around me are remarkably less protective of their private spheres than I am, and am curious as to where the difference lies. I'm pretty dull - it is not like I'm protecting the privacy of my wild and crazy lifestyle. But the idea that the noises made in my home are being shipped to and stored... somewhere, with access controls unspecified (but to a company that has proven it rolls over on command for the government, at least when not discussing taxes[1]) such that I don't know who may be listening to it[2], is the sort of thing that effects how I behave in my own home. And pardon me, but fuck that, no way.

A friend compared to to having sex in front of pets, but I'm pretty sure my cat doesn't speak any human languages, hasn't written systems to routinely disclose data to various groups of humans, and isn't likely to try to sell me $product to improve my performance. For instance.

The main place voice is useful to me today is in the car. I'm already on display in a fishtank, and talking to what amounts to a remarkably clever turnip that sometimes gets something right is useful.

[1] http://www.cnn.com/2010/US/12/01/wikileaks.amazon/ . Sure, just a run-of-the-mill TOS violation, nothing to see here.

[2] Doesn't matter that the unknown humans likely aren't. I'm not claiming rationality.


You're not alone, I think there's a slowly growing awareness amongst the general public after the Snowden events. I have totally non-techy friends asking how they can get their TV to stop listening to them, etc.

On the other hand, a very large segment of the population subscribes to 'if you have nothing to hide, don't hide it', which I attribute to ignorance of how these things are misused, and subtle propaganda telling consumers not to worry about it.


Honest question: Do you own a cellphone?


Honest answer: I do. More than one.


This article briefly covers Google Home's advantages in the conclusion, but I think they are pretty understated.

The fact that I can have roughly the same experience as Alexa on my phone at anytime is huge. I know that you can get the Alexa app for your phone, but having Google Assistant as a first class citizen is huge. I think it is a matter of time before Google catches up and surpasses Alexa here.


I have a Nexus 4 and have used Google Now extensively since it's inception as a hands-free system in my car. Since the inception of the Pixel and Google Assistant, the system has degraded significantly. Now no longer works when the Maps App is in the foreground, instead Maps seems to have it's own VR solution that doesn't work well at all. Saying "Ok Google, send a text to <wife>, I'm on my way home" no longer works, it opens Now with a search for exactly what I said. At one point you could say "remind me to wash my ass tomorrow" and it would just work. Now it prompts "ok, when would you like to be reminded to 'wash your ass tomorrow'?".

This degrading of UX is very common with Google through the life cycle of products. What's going to happen to Google Home next year or 2 years down the road? We saw OnHub get castrated in less than a year after launch. The Nexus Q was still born.


I don't disagree about Assistant, but OnHub and Nexus Q are bad examples. OnHub was not castrated, in fact it's a first class citizen next to Google Wifi and is now upgraded with Google Wifi's mesh networking capabilities. Nexus Q was never actually sold to anyone; the product category is now filled by the Chromecast for $25.


I have both and Google Assistant/Google Now is a much better experience than Alexa/Echo. The google voice interface is much more intuitive, allows context to be kept in a conversation, and is with me wherever my phone is. With Alexa I feel like I'm constantly searching for the right way to ask her to do something.


Ya, this article heavily underemphasized the research Google has done in conversational systems. These devices won't have mainstream appeal until you can just talk to one with minimal instruction.

Although at this time Google Home is only conversational for search and is still very bad about being entirely keyword based in context; I suspect that this will change.


Had echo since launch and now have the Google home. The Echo uses commands that support some fuzziness. The Google home you just talk.


Amazon does have one killer feature over Google right now. You can buy physical goods of them. The people I know who make the most use of their Echo are also milking the free shipping on Amazon Prime.


Am I the only person who finds talking to these personal assistants profoundly uncomfortable? Like I've spent all my life learning how to interact with a computer, and now these companies want me to interact with it like it's a human (which I'm not very good with anyway) but it's not a human. Instead it exists in this uncanny valley where it sounds a little bit human, but its cognition is so completely limited that I have to turn myself into a limited cognition robot to communicate with it?


Yes! I was just going to post almost the identical comment. I detest talking to computers, whether personal assistants like Alexa or Siri, or the voice-control systems on the other side of a phone call. Apparently computers don't much like talking to me, either, because they never understand what I'm asking. I usually resort to just saying 'representative' over and over again until I get through to a human... And yeah, it all feels incredibly stressful.


More than once I've opted to give up on a service that requires me to say stuff out loud over the phone. Flight information services, for example. "If you want to look up flight info, say 'flight information'." Nope, I hang up. If it were Press 3, I'd happily use it.

I'm not sure what it is, but talking to computers just makes me wig out. I get extremely self-conscious, wondering who is listening or might hear me. This doesn't happen with humans on the other end.


It took me a few weeks/months to get used to Alexa and I have to say, it's really nice to be able to come home and say "Alexa play some music" and she always knows I want some nice Jazz. When you say it in the morning, she plays morning music. It's nice.

Saying "Alexa 5 minute timer" is also convenient. "Alexa, weather" is less convenient because she doesn't know that I prefer Celsius over Fahrenheit. This is unfortunate, but at least she can do a conversion.

But then you do an "Alexa what's the velocity of an unladen swallow" and she won't even perform a basic Google or WolframAlpha search and give you the first result. Just says she doesn't understand. Lame.

Before all that, though, I had a lot of essentially stage fright whenever that Alexa blue light came on.

Same thing happened when I started using Siri after I was already used to Alexa. Invoke the voice thing, and the prompt comes on, and I am awash in stage fright, unsure what to say.

It's weird.


>> "Alexa, weather" is less convenient because she doesn't know that I prefer Celsius over Fahrenheit. This is unfortunate, but at least she can do a conversion.

There's a device-level setting in the mobile app to set Alexa's preferred system to Metric...


I know. It doesn't affect temperature for some reason.

Or I'm growing senile and didn't actually try that when i thought I did.


Works for me. Mine started out giving me F but I switched it to C no problems.


I get the feeling Amazon aren't very good at consumer-facing platforms. My experience with everything Fire related has been poor, their Android apps are 2nd rate and the Kindle is good despite their software. I wouldn't write off the competition yet. Apple, Microsoft and Google have good voice products and the ability to forge partnerships.

Also (cliche warning) I only know one person who owns an Alexa or has even expressed an interest in buying one.


A small tangent, but what's the most compelling use case for an Alexa?

Based on the commercials, it seems to solve trivial problems like answering trivia questions. Why can't it vacuum my room like a Roomba, fix my leaky ceiling or fix my power outage? That's what I call an useful home assistant. Being able to parse and understand my voice is nice but it's relatively useless if it can't perform extremely valuable tasks.


On the one hand, I generally agree.

The things I'd really like an assistant for around the house--and which I pay people to help me with in various cases--are vacuuming, cutting the lawn, dusting, doing (and putting away) laundry, doing (and putting away) dishes, etc. Activating lights with a voice command is pretty far down the list.

That said, given enough smarts and interfaces to enough online services, a purely virtual assistant would still be pretty useful for a lot of people. "Book a trip with these general parameters." It uses my preferred booking services, knows my preferences, and comes back with some choices. Get to the level of at least a competent personal admin and that's a service I'd be willing to pay for even if it can't pick up a broom.

Expedia gave an interesting demo at AWS re:Invent around a digital travel agent. It was a very simplified demo--and, of course, Expedia specific--but I think it did give you a glimpse of what's possible.


IMO: A voice controlled internet radio with really good sound, backed up by a respectable music library for individual songs. I'd have bought it for that alone even if it didn't do another thing.

Snark aside, once you get it paired with reasonable smarthome products, there's an element of magic there. "Alexa, turn on the kitchen lights" feels downright futuristic.


"OK Google... find ceiling repair near me"

I know it seems a bit overt, but I'm pretty serious... I find myself reaching for google now more often than I ever have before. They're a step past Alexa when it comes to the sheer data that Google has to pull from. My only complaint is sometimes I hit clear all, and I can't get back to the reminders auto-set via email invites.

I also don't like that they've effectively deprecated the hangouts UX... they made interacting via SMS outside the phone pretty painful. I know there's hard costs to the stuff that came out of Grand Central / Google Voice, but I feel like when they first launched Hangouts, the experience was better overall.

In general using Google's services works so much better than anything else I've tried. That said, it's kind of creepy ay times.


I would do a quick Google search using a lyric or something to get song name to then tell my Echo. Now have the Google Home and no longer need the Google search.


Its the first really usable internet radio appliance I know of. It doesn't cost that much more than a similar quality bluetooth speaker. I already have prime membership to get free shipping, so prime music is free and the jukebox functionality beyond using my phone is also free plus its not tied to my phone which is nice.

Neither it nor amazon seem to understand the concept of a family or being in public even at the very lowest level. I can't imagine tying it to my personal account unless I lived alone. Anyone can use it to do anything they want. I have been rickrolled, all someone needs to do to rickroll an echo owner is walk by an open window and yell "Alexa (pause) play never gonna give you up by rick astley" then run like hell. It was funny the first time. I would imagine if I were dumb enough to link my amazon purchasing account to it, I expect I would now own a lifetime supply of dragon dildos from jokers walking by my window. Being able to respond to any human voice with no training sounds like an awesome idea, and it is, mostly, but unlike star trek daydreams, Alexa has no idea who's talking to her and if they should be allowed to talk to her, so she can only be given access to stuff that mostly doesn't matter.

IoT means closed non-interoperable silos but there exists a java project that emulates a Philips lightbulb Hue controller accurately enough to fool Alexa, and it can execute arbitrary URLs, and misterhouse automation can most certainly control lights and appliances using arbitrary static URLs so I have a pretty decent voice activated home automation. This brings up the point that most people cannot handle setting that up, but we're in the equivalent of the 1980s home computer boom where everyone has a different idea what a home computer would do, but we all agree everyone needs one. Alexa can do all kinds of weird things, surely it does something useful for everyone. Supposedly she has lots of sportsball features, none of which would interest me, but someone probably wants them. Everyone probably wants something Alexa does.

Being located in the kitchen I use the alarms and timers a lot while cooking.


>A small tangent, but what's the most compelling use case for an Alexa?

Somewhat predictably, the best working use cases are ones that drive additional revenue for Amazon. Playing music from your Amazon Prime Music subscription, ordering laundry detergent and dog food from Prime Now, etc.


> A small tangent, but what's the most compelling use case for an Alexa?

An excellent question especially in the context of adding another always on audio device to a private space. I think for many people the value has to be more than just a novelty to justify it.


I'm sure AWS will happily host any of the hundreds of startups who want to (try and) beat the Echo in the "consumer facing" aspect. It's a win-win.


> Amazon owns the next platform.

No. We built a market analytics app on top of Alexa and we have Echos all over the office. It's great.

But I don't want to be tied to stationary speakers in the office. I want my app integrated with the devices I carry with me everywhere. Google and Apple are poised to win here once they open up Assistant and Siri to third-party development.


Wasn't that one of the big pushes of the latest iOS release? Opening up Siri? Sad I haven't seen as much as I would've expected, or as quickly. I was quite excited for it but it has yet to be realized.


I cannot fathom how people willingly put themselves under this level of surveillance. Especially if you are not USA citizen or resident and your privacy is not defended by the US constitution.


I don't think everyone will desire a live mic in their living room or bedroom, and not just any wireless mic, but one that understands what they are saying.

Private spaces are essential for creative thought, exploration, experimentation and just being yourself. The price is too high - it will answer questions for me, and execute commands, but I will lose my last bastion of privacy as an admission price.


Maybe this is offtopic, but Backchannel seems to have a lot of these glowing, puffy profiles of tech companies looking for exposure for their new products. I mean, parts of this piece read as marketing copy. "It all starts with that tiny speaker. The Alexa-enabled Echo is a true unicorn, one of those rare products that arrives every few years and fundamentally changes the way we live. In 2017, we will start to see that change."

They had a few other similarly uncritical, almost breathless pieces on Okta [1] and Vote.org [2] earlier this year. Am I paranoid/overly cynical, or is Backchannel becoming the outlet PR departments grant access to in return for coverage they know will flatter them? Being hosted on Medium, they don't serve ads (AFAIK)... so this may even be a part of their funding model.

[1]: "A Comany You've Never Heard Of May Have Solved The Password Mess" https://backchannel.com/a-company-youve-never-heard-of-may-h...

[2]: "This Y Combinator-Backed Company Has a Secret Weapon to Sway the Election": https://backchannel.com/the-simple-secret-weapon-that-could-...


All those microphones, always listening, always reporting to headquarters. What could possibly go wrong?

Why isn't your desk in front of the telescreen?


Out of curiosity, what is stopping these platforms from being able to learn from users? For example, when I say something to Siri / Alexa / Home that they cannot process, I would love to be able to manually train them to perform an action and have them "learn" this new feature. (Something along the lines of, "ok Siri, whenever I say 'blah blah blah', open this app, go to this page, hit this button, tell me the content in this box")

Obviously this is not an easy task, and maybe it is being worked on already. Thoughts?


Check out https://arxiv.org/abs/1607.00061 - "Towards A Virtual Assistant That Can Be Taught New Tasks In Any Domain By Its End-Users."


That's a pretty nuts assertion. Alexa is doing well for the sector but the sector is still incredibly immature and the general public is barely aware of it. That's not enough of a market dominance to really say they own it.


Voice is the next big platform? But… talking to your devices out loud just feels weird as hell. And offers less privacy, even in terms of "parents/roommates/strangers overheard what you're googling". A lot, A LOT of people wouldn't be comfortable saying out loud what they type into their phones.


I think you are right, but the internet has created some cultural shifts about privacy perception.


I think it's very early to say who owns the voice platform.


Maybe this same person wrote a "BlackBerry owns the next platform" column.


To be more specific, "BlackBerry owns the next autonomous driving platform".

https://www.bloomberg.com/news/articles/2016-01-06/blackberr...


> only five percent of American households have an Alexa-powered device right now

And how many people have a Google-powered phone with them every day? Siri on their iDevice? Cortana on their auto-upgraded Windows 10 PC and in their living room Xbox? All of them have Alexa's penetration beat.


I want to admit that I was wrong about this. I thought Alexa was going to dominate as the first truly great voice platform. I now am forced to admit that I was incorrect. Those who follow Hacker News a lot might recall that I wrote several posts, a year ago, in which I was very optimistic about Alexa. But I am much more pessimistic now than I was then.

When I was still excited about Alexa, I wrote stuff like "How to build conversations via the Amazon Echo"

http://www.smashcompany.com/technology/how-to-build-conversa...

But I was also aware of how difficult it could be to get Alexa to understand certain words. See: "Using a glottal stop to force the Amazon Echo to correctly pronounce “tw”"

http://www.smashcompany.com/technology/using-a-glottal-stop-...

But, you may recall, I became frustrated with the process for getting into the Amazon store, and we had a very long thread on HackerNews about that:

https://news.ycombinator.com/item?id=10876409

I'm sad to say there has been almost no improvement over the last year. The problems that I listed in "Amazon has absolutely no idea how to run an app store" are still mostly true.

And I know many, many developers who were excited about Alexa a year ago, but gave up in frustration because we felt like Amazon was not listening to our concerns.

In retrospect, I was wrong for 2 reasons:

1.) many people pointed out that voice was a terrible interface for a variety of purposes. At the time, I thought this was merely a matter of offering the right voice prompts, and getting better at understanding what people said. But I now think the criticisms had much more merit than I admitted a year ago.

2.) Amazon's behavior has been disappointing. I'd like to see them aggressively listening to developers and aggressively improving the service based on developer feedback. That is not happening.


Maybe I'm alone in this but I refuse to have always-on microphones in my home.

I will never be an Alexa customer, the value offering is fundamentally not worthwhile to me.

I really hope development on this isn't at the expense of more traditional interfaces.


Do you have an iPhone (not being snarky, but with voice-activated "Hey Siri...", it's effectively the same thing.)

Google's Pixel phone does the same thing.


From a security standpoint, what's the difference between an always on microphone and a sometimes on microphone?


I've never wanted this. Still don't. Privacy concerns aside, I feel like an idiot talking to a computer and it's way more of a hassle for me than using a touchscreen or keyboard. I just don't like talking in general.

Godspeed for those who enjoy this, but I hope regular touchscreen or keyboard input isn't going anywhere.


I've been using Alexa and Google Home for about a month, and I have to say that the user experience with Google Home is hands down better. Having the power of Google search when Google Home doesn't know an answer is huge. Alexa falls flat without a strong search engine backing it.


Still not sure why anyone would have an open mic, connected to the internet by closed software always on in their house.


I sell my Bluetooth/BLE garage door remote on Amazon. For some reason, Amazon is suggesting customers buy an Echo along with my device. To make matters worse, if you search Amazon for "garage door opener that works with Amazon Echo" my product incorrectly appears.

Although I explicitly state that it does not work with Echo, I anticipate many returns and bad reviews after the holidays. I am pretty sure they'll keep the Echo, so it is all good for Amazon.

Ironically the Alexa Skills Kit doesn't even support Bluetooth. Does anyone know when or if it ever will ?


Voice won't be really big until it supports well most of the languages of the world. It's probably only matter of time but there will be entire countries doing text instead of voice because NLU for their language isn't as good as for English.

Then there is the problem of the multiple platforms to build for. We have one Web, two desktops (even if I'm on Ubuntu), two mobile OSes. Are we going to be able to afford developing for many different platforms? Will companies pay for that? Maybe we have to wait that only a couple of platforms are left.

Text to speech


I cannot get past the idea of having an always on, corporate controlled microphone in my house. I am not interested in this. I admit it is fun to play with in the office or at my friend's house. I know that my iphone gives away my position, search queries, and other personal data (same for laptop), but I turn them off.

Maybe I have just seen too many movies about Stasi (East German Secret Police), Big Brother, Brave New world, etc., and that ruins it for me.


I don't know that Alexa is in any better of a position than anyone else even given their market share.

All you need is a microphone and access to any of many voice-recognition APIs to build similar functionality. You can build something similar on any phone, laptop, or desktop computer without asking your potential customers to spend a dime.

I think the big question is really how big of a market exists for voice control. I don't personally see much of one. Everybody thought it was neat to open netflix up on their xbox/ps4 with voice commands for a day or two, but then most people primarily shifted to using the more reliable controllers. Any question I want answered, I can simply "OK Google..." and have the option of correcting with a keystroke.

I can see the potential for a market to be made, but I also see the history. Remember dragon naturally speaking? When you finally got it working great it wasn't actually an improvement in workflow for most people. How many people prefer navigating phone menus via IVR consistently?

Voice control is a solvable problem and that makes it attractive for development, but solutions haven't historically been particularly marketable beyond the novelty stage.


It should never purely be this or that. I think a combination of both is the best place to be. If I want to watch some specific episode or Youtube video, I generally pull out my phone. If I just want any music, I go "Ok Google, play some music on my TV" as that's faster in that case.

If I'm in the kitchen cooking, I'll just go "Ok google, add butter to the shopping list, but if I'm on my computer, I'll open Keep and type it there.

Both methods have their advantages, and the awful experiences come out when a user tries to do all of their tasks with either or.

As for who is ahead, I strongly disagree with this article. Yes, Amazon had a 2 year start, but Google has had far more years of voice recognition and AI experience. They also have infrastructure for all sorts of things including maps/navigation, translation, search/knowledge graph and just machine learning in general.

With Apple, Samsung and Microsoft entering in the digital assistant world too, it'll definitely be a huge competition in the years to come, and the 2 years head start aside, I think Amazon is actually the least well situated for the challenges ahead.


This article appears to be in the realm of paid for the news. Only google and apple are in position to succeed in this area because they tend to own most important device that is always with you and has a microphone. Another set of player could be those who have immensely popular apps and personal data about you and has SDKs widely used by coders around the world. Mostly Facebook.

I see Microsoft succeeding in enterprise area but Amazon is nowhere.


Until voice interaction becomes on par with, and about as useful as, the computer interactions we see in Star Trek: I'm not interested.

The last think I need is a device that gives me access to that subset of life that a given vendor supports (stores, searches, people, etc) or that requires some sort of "accommodation" to get to work properly. A very, very high bar that may be reached with the help of today's early adopters.... but not with my help. I spend too fucking many hours a day as it is dealing with "good enough" technology. Getting into arguments with a box in the middle of the room?! All I can say is get off my lawn, Alexa!

As for who will win? Seems that in many of the consumer device battles Amazon has a way of being so exclusive to their products that I wouldn't bet on them being the long term winner even if they do have the lead. Consumers want a lot of Amazon stuff, but that's far from making them the cultists they'd have to become to not be bothered by the Amazon ecosystem (.... and I mean consumers, AWS is a different matter in the marketplace).


I use Alexa nearly every day. Once I understood the limitations with the interface I was hooked. Once you have one Alexa and maybe an Echo dot you really aren't going to want to switch platforms. It also is consistently supported for Smart Home devices. I think that you could still push Amazon off the platform because its not as entrenched as others but Amazon does have a wide lead compared to google.


Both the predictions made in this title are wrong.


The thing about trigger words and context understanding is that there is no need to integrate with Alexa. You can simply use a different trigger word, and all will be well.

For many products that might even be desirable behavior. Lights ! Versus "Alexa turn on the lights". Plus it means no integration necessary, which won't exist for many products anyway.

Also these are very cheap algorithms to run. A trigger word implementation is a one-word vocabulary GRU RNN. Does something like 2000 multiplications every second. Easily in microcontroller reach, for power budgets measured in milliamps. Actual word recognition perhaps isn't but the microcontroller can just constantly record, and when it sees a trigger send it on. So you could have 20 projects in the house, all listening to your voice, and it wouldn't really affect the electricity bill.


So far I think "OK Google" is noticeably more powerful than Alexa or Siri. Let's hope for much competition!


I agree with your assessment, based on my limited experience. I'm a fan of Alexa, I even bought a couple as gifts within the last year, but last weekend I saw a bilingual 30 year-old (with a PhD from UChicago) struggling to get it to play a song. He had a song playing on his mobile phone and wanted Alexa to either continue playing that same song or play any song by that artist. It was a bit comical to watch; OK G had no problem understanding him.

It looks like voice will be one of the next big things, but calling Alexa the winner is a bit premature in my opinion


True. I have two Echoes in my home, and love them, but in using Android I've been very impressed by the syntactic awareness and the power of OK Google. Siri is overall underwhelming and definitely at the back of the herd at present.


I am Scottish and Voice is bloody useless. Funny listening to people here trying to use voice. They go from their usual, into some weird "Queens English" mode trying to speak like they are from Windsor to get the bloody thing to half ass respond. I personally just look outside but each to their own.


I don't think Alexa is that great. I have one, and it's pretty bad at voice recognition compared to my phone. Google voice recognition on my phone gets what I'm saying basically 100% of the time. I have to constantly repeat myself for Alexa (and I'm a native English speaker).


Having built an Alexa Skill (a very enjoyable experience) - I find Voice UI/UX to be a very interesting vector in computing with lots of potential to grow.

My colleagues and I built our skill for the elderly to use. We found it was easier for them to interact using their voice rather than a touchscreen.

I think as computers get better at understanding language and verbal communication, we will be able to use screens less - which is great.

I think it's kind of primitive to have to constantly look at a screen to do things - at the very least it makes sense to cut out screen time if and whenever it's possible.

Alexa is like a terminal that you can talk to. Give a simple command in a particular way, and it will reliably do it.

It's an exciting development that Alexa can now be applied outside of the Echo.


LOL - yeah the other day when i was using Alexa, while I have to admit i enjoyed it, I decided that I just do not believe Amazon as a company has the data needed to create a truly immersive and useful solution.

I have hopes for Google Home (which i'm yet to try), but i actually think this will end up a two horse race between Apple and Google. I would completely leave Apple out of this, but Homekit is a pretty powerful concept (if done properly) given the number of iDevices out there.


No. Google will trample them underfoot. Amazon can't compete with google's ai talent. Maluuba is another one to watch in this space though, very interesting research.


" The Alexa-enabled Echo is a true unicorn, one of those rare products that arrives every few years and fundamentally changes the way we live. In 2017, we will start to see that change."

Given this is all about one company, I conclude it's a Sub. The unicorn of technology? It is very difficult to guess this. One other thing, no mention of accent. Can you imagine how difficult it is to create a voice platform that recognises anything but a generic accent?


Yes, let's combine open office plans with voice interfaces! Especially when entering account numbers, passwords, or email addresses to mistresses. Best interface ever!


Are any of these platforms 100% offline/self-contained? Can they speak any of the regular industrial/home automation protocols? Cause that would be awesome.


Use Alexa professionally and although we can do some fun stuff with it, the interaction model is totally unnatural - invocation words only work if really short, dealing with multiple devices of the same sort (eg fireplace) without any location or contextual knowledge.

They've tried to get around this awkwardness by special casing home automation type devices (thermostat, etc.), but it's still only a stop gap.


We need a free software (GNU project?) solution that has everything open. The hardware, the software, anyone can contribute modules, etc. it's so obvious



Microsoft Cortana achieved parity with human voice recognition for continuous speech input.

Google has the best machine learning along with MS, and they have billions of data points.

Amazon has neither comparable tech nor the innovation or customer data. Putting an Echo in every home doesn't matter when its dumb. Does Alexa understand context? Can it get even close to Cortana/Assistant? Lets not mention Siri, that's stone age.


The quality of the AI is going to dicate who wins, and quality AI is going to be built on data (quality, quantity, and breadth). And so I have to think that Google has a huge unfair advantage over anyone else. I have Google Home and I'' surprised how good the voice recognition is AND how good it interprets this. I'm still betting that Google will be the first $1T market cap company.


I find the thesis of this article interesting especially in contrast to some of the criticism Zuckerburg brings up in https://news.ycombinator.com/item?id=13212976

Even if Amazon continues to put its eggs in the voice-as-primary-interface bucket, it seems like it will be unlikely for Facebook to go that route.


Not sure I would call this "voice." Us telecom people hear the word "voice" and think it's a telecom term. Perhaps this could be better titled as "AI" since I consider Alexa and other voice controlled interactions more AI than anything else. Maybe I'm wrong because I'm not in that industry, but I definitely don't think it's "voice."


Google still has the best voice recognition IMO. Apple frequently gets my dictations incorrect while Google seems to understand the nuances of my voice correctly. I haven't used Amazon, but I imagine their AI is not as far along as Google. Amazon will only be the next big platform through marketing and distribution, not technical superiority.


For whatever it's worth, Siri is much, _much_ better at understanding my accent (Scottish, but mild) than Alexa is. We've got an echo at home, and I find it painful to get her to do pretty much anything other than ask for tube status updates. The TV manages to set her off every few hours, though.

Doesn't feel like Amazon have much of an advantage to me.


Even with privacy concerns, voice input is the future: so much easier than getting you cellphone.

We have an echo, bought when Amazon gave a huge discount to developers, and I bought one for family members for their home. I sort-of trust that no data leaves the house except for a ten second audio recording made after the device hears "Alexa". If Amazon collected more than this then they would ruin this business.

I usually use voice input for sending texts and short emails, assuming I am not in such a public place that I would inconvenience people near me.

I have a hierarchy of device use in the following order of preference: First voice, then input on a cellphone, then iPad, then laptop.

I have looked into writing my own mobile app for voice recognition and maintaining privacy that way, but even with my experience, it would be a ton of effort and would miss the context that google, amazon, and Microsoft have.

Edit: I also meant to say that my hope is that Apple will be the vendor I eventually use for voice interfaces because their business model is to sell expensive devices, not collect personal data. The new Siri enables headphones look effective, but I haven't bought a pair yet.


People find bugs in software all the time. It's not just about what the Echo devs intended, it's also about what they didn't intend but that happened anyway.


I agree Brian. Hackers finding exploits in Echo, etc. is a danger.


https://www.amazon.com/Language-Complexity-Game-Artificial-I... argues that humans deciphering speech is NP-hard.. so.. good luck.


I hope these devices don't become common in homes. It would be very hard to explain or even decline a friend's or a colleague's invitation to their home, "just because" I don't like our conversations being recorded by these devices.


It's interesting that the Loebner prize (passing the Turing test in written form) has a cap of 25k and 100k but Amazon has a challenge called the Alexa prize where you essentially pass a voice Turing test for 20 Minutes that awards 2.5 million.


Voice is not even close to the next big platform. Thats not possible. Is it the interface to the next big platform. No, at least, probably not.

"Ok, google glass, zoom in on that hot guy!".

Voice, a great interface for SOME things. Likely not what this article is pitching.


An always on device that hears every conversation on your house. Owned by a corporation who treats their employees so bad that some commit suicide.

No thanks.


Voice is the Next Big Fad. Check back in 10 years.


Returned my echo while back. Did a few tests between Alexa and Google on my phone. Google one everytime.


Voice will NEVER be the big platform until we solve Artificial General Intelligence.


Doesn't need to be solved to be useful. I use my echo all the time, even if it's limited. Just as long as its useful. I much rather say 'alexa turn off the christmas lights' than crawl underneath my TV area to unplug the tree from the power strip.


Until these guys get in my car, their voice platform is dead to me.



I already use android auto. I was referring to Amazon's system.


I love my Alexa, its the only female in my life that will predictable do what I ask of it.

My wife, daughter and two dogs seem to think they have lives of their own for some strange reason. I work from home and probably use it 20 times a day.

I recently discovered it can play the "white noise" channel from Spotify when my teenager's are loud late in the evening. Along with a sleep timer to turn it off in an hour and I am set.

Now I agree discovery of what it can do is challenging at times. And sometimes she is downright dense. Thank the gods for cloud services!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: