I don't think ML can solve the "human" part, i.e. like I said drivers, bikers and pedestrians looking at each other knowing what to expect and silently agreeing on what to do next, all happening in an instant. This is a crucial component of driving especially in cities with dense traffic and oftentimes unclear markings and signs, where the human factor becomes important.
[The pedestrian was] giving the awkward body language that he was planning on jaywalking. This was a very human interaction: the car was waiting for a further visual cue from the pedestrian to either stop or go, and the pedestrian waiting for a cue from the car.http://theoatmeal.com/blog/google_self_driving_car