Very good talk (really nice and useful slides) about #Unicode and the bad implementation of it in, unfortunately, many programming languages (specially the enterprise ones) In most languages, strings of characters are unfortunately not string of characters. http://www.jnthn.net/papers/2015-spw-nfg.pdf