I would have thought that they needed to 'ground' (https://en.wikipedia.org/wiki/Symbol_grounding_problem) at least one shared reference word/concept so they could proceed to define everything relative to that and thereby communicate effectively and know how to map different languages to the same latent space. But I guess perhaps one can infer/approximate it statistically -- https://cis.upenn.edu/~ccb/publications/discriminative-bilin...
I would have thought that they needed to 'ground' (https://en.wikipedia.org/wiki/Symbol_grounding_problem) at least one shared reference word/concept so they could proceed to define everything relative to that and thereby communicate effectively and know how to map different languages to the same latent space. But I guess perhaps one can infer/approximate it statistically -- https://cis.upenn.edu/~ccb/publications/discriminative-bilin...