Hacker News new | past | comments | ask | show | jobs | submit login
Medical wordlists in English, French and Ukrainian (github.com/theophpo)
63 points by ouzb64ty on April 22, 2023 | hide | past | favorite | 19 comments



Curiously, despite having over 3000 entries, the medications list doesn't contain "propranolol", one of the world's most commonly prescribed medications. On the plus side, there is a lot here and it's a good effort, so I should probably submit a PR :) If OP is here, consider submitting it to https://github.com/gmelodie/awesome-wordlists also.


SPARQL queries are not yet compliant, the most important is in the wordlist.en.txt file, the goal is not to classify by word types but by SPARQL queries.

propranolol at line 134169 in en/wordlist.en.txt :-)


>one of the world's most commonly prescribed medications

TIL. First time seeing this word


Propranolol is becoming a first line treatment for anxiety in many situations now where they want to prevent benzodiazepine misuse, but I also know folks who've been given them for migraines, stage fright, fear of flying, and all sorts. They tend to work reasonably well if your symptoms of anxiety are mainly physical rather than mental (where beta blockers have zero effect at all).


I'm not a doctor, but I work closely with doctors in medical devices. This word list seems of dubious quality; I wouldn't train anything on it personally.


That's really nice but kind of annoying I couldn't import the downloaded raw english word list into office. It gave me an error that it isn't unicode so I had to convert it.

If anyone else is trying the easiest option is open your custom.dic in \users\username\AppData\Roaming\Microsoft\UProof\ and copy paste in in something like notepad++ that handles encoding properly.

I also marked lines with '(', ',', and ' ' to really trim this down to something reasonable due to the amount of duplication (104004 lines with an missing drug I added but don't want to shill for). I don't need 100+ variations on cataracts of types of macular degeneration for spell checking purposes.


Took a brief look, the Ukrainian ones seem fine. These aren't limited to terms related to humans, I had found an animal disease among other things.


If you are wanting lists of medical words in English I would recommend taking a look at the National Library of Medicine's Medical Subject Headings (Mesh) thesaurus. Other useful tools include RxNorm (related to US pharmaceuticals) or Snowmed-Ct which is a medical ontology for multiple languages.

I've really benifited from others mentioning resources like these here on HN so I thought I would share.


The fact that first thing in surgical procedures is "penis enlargement" says a lot


Not sure "matière fécale" is an anatomical structure...


And in the "diseases wordlist", you have "chute des cheveux" and... "voyeuriste" (https://github.com/theophpo/medical-wordlist/blob/main/fr/di...). So, yeah, I didn't look any further but these word lists make no sense at all.

edit: and "chiasse" is also a disease it would seem (L480).


Couldn't "chute des cheveux" be a symptom?


Today I learned about the Wikidata Query Service; pretty awesome


They forgot to add the word `quone`!


Edit: Disregard what I just said! I shouldn’t comment right after waking up :)


In this case it's not a country code but a language code (ISO 639-1). And Ukrainian language code is in fact UK. So dear author, you can revert your last commit ;-)

https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes


> UK is the ISO code for the country.

Erm, since when ?

ISO3166 quite clearly says: UNITED KINGDOM GB and UKRAINE UA

UK is "exceptionally reserved" on request of the United Kingdom, but it is not the official abbreviation, which is GB.

See the https://www.iso.org/iso-3166-country-codes.html and follow the link to "Online Browsing Platform".


Per ISO 639-1 (language codes) it's UK for Ukrainian language

https://www.loc.gov/standards/iso639-2/php/langcodes_name.ph...


Oupss, sorry ! Thank's




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: