Character encoding

Thanks to the so-prolific « Simon Willison »:http://simon.incutio.com/ who « pointed »:http://simon.incutio.com/archive/2003/10/13/practicalUnicode to some of the following:

* Tim Bray (« ongoing »:http://www.tbray.org/ongoing/) wrote a lot on the subject. « Here »:http://www.tbray.org/ongoing/What/Technology/Coding/Text/ is the main point of entry. Check « On the Goodness of Unicode »:http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode (6 Apr. 03). Update: « UTF-8 names »:http://www.tbray.org/ongoing/When/200x/2003/10/17/UTF8-plus (8 oct. 03) and « the RFC »:http://www.tbray.org/tag/utf-8+names.html {← »Mark’s b-links »:http://diveintomark.org/}
* Dan Sugalski (« Squawks of the Parrot »:http://www.sidhe.org/~dan/blog/) in an essay called « What the heck is: A string »:http://www.sidhe.org/~dan/blog/archives/000255.html (11 Oct. 03)
* Joel Spolsky (« Joel on software »:http://www.joelonsoftware.com/index.html) wrote « The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) »:http://www.joelonsoftware.com/articles/Unicode.html Nonetheless! (8 Oct. 03).

Simon’s conclusion: « they all lack one critical aspect: practical advice (…) the web needs practical advice on developing Unicode enabled web pages and web applications. Is it just a case of ensuring my text editor is « saving as Unicode »? What about storage – can I throw Unicode at MySQL and expect it to come out again? If I serve a page up with Japanese characters in it, what will my users have to do to be able to read them? It’s a big, confusing world out there. »