Hacker News new | past | comments | ask | show | jobs | submit login

Seems interesting! How can it be used with other languages than English?



spaCy only works with English, German, Spanish, Portuguese, French, Italian and Dutch.

FastText for example has pretrained embeddings for 294 languages: https://github.com/facebookresearch/fastText/blob/master/pre...

Google's Parsey McParseface handles POS tagging for 53 languages: https://github.com/tensorflow/models/blob/f87a58cd96d45de73c...


So spacy has support for these languages [1] and wordnet has support for these [2], but neuralcoref (pronoun resolution endpoint) is available only for english.

This current docker image is not exposing those other languages but I can expose them in an update if it helps a lot of people.

[1] https://spacy.io/usage/models [2] http://compling.hss.ntu.edu.sg/omw/


Thanks for the insights. Could you please share the Dockerfile so that one can make the other languages work?



SpaCy models for different languages and how to use them: https://spacy.io/usage/models




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: