Complete treatment of character encodings
Is there a good text or article that talks about encodings, their history & drawbacks, and maybe how many common systems use them? It always scares me to deal with characters, since I feel they're "messy" internally, like all my tools work hard to hide everything from me. And it seems there can be a lot of information loss in conversions.
Developing International Software for Windows 95 and Windows NT : A Handbook for Software Design
Tim Bray has some recent articles that area worth checking out:
fool for python
I find http://unicode.org/ to be an excellent resource. The page on UTF encodings -- http://www.unicode.org/faq/utf_bom.html -- will answer a lot of questions about exactly how characters are encoded, how many bytes they take, etc. And the section called "Code Charts" is just plain fascinating.
"Someday everything will be in Unicode"
Just me (Sir to you)
From my favourites:
Fog Creek Home