Hacker News new | past | comments | ask | show | jobs | submit login

I think it would be more accurate to say that UTF-8 is a Unicode Transformation Format which by its name is logically distinct from Unicode itself. There are good reasons to store and process Unicode in UTF-8 format internally in many cases, but UTF-32 / UCS-4 would probably take over for internal processing if it weren't for memory usage and efficiency issues.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: