And also interesting unicode features in the regexes... \d matches anything that is a digit in unicode for example. What if you only want ascii though, like for computer languages and maybe security?
EDIT: security as in, no similar-looking but different characters to confuse users etc...
EDIT: security as in, no similar-looking but different characters to confuse users etc...