He mentions some of the even harder problems to solve that are specific to video pass through mainly, the problem of having to focus exclusively on a screen closed to your eyes all the time which gets tiring fast.
Virtual retinal displays will be able to fix that; they display an image directly on the retina, using lasers, and will eventually be able to have a range of focal depths in a single scene.