This was an almost all-English collection of emails. I'd be very interested to see if the trend is similar in other languages. If you're interested, the accompanying code for the blog post should be able to parse mbox, Maildir and notmuch databases of emails. https://github.com/dannyob/lengthysubject