character-sets   50

After Seven Years, Microsoft Is Finally Fixing the "J" Email Bug
True story: when I started at Amazon, I thought people were using "J" instead of smileys as shorthand for "joking". Great job Microsoft!

(via Tony Finch)
microsoft  fail  operating-systems  monoculture  character-sets  j  wingdings  exchange  email 
may 2017 by jm
A Programmer’s Introduction to Unicode – Nathan Reed’s coding blog
Fascinating Unicode details -- a lot of which were new to me. Love the heat map of usage in Wikipedia:
One more interesting way to visualize the codespace is to look at the distribution of usage—in other words, how often each code point is actually used in real-world texts. Below is a heat map of planes 0–2 based on a large sample of text from Wikipedia and Twitter (all languages). Frequency increases from black (never seen) through red and yellow to white.

You can see that the vast majority of this text sample lies in the BMP, with only scattered usage of code points from planes 1–2. The biggest exception is emoji, which show up here as the several bright squares in the bottom row of plane 1.
unicode  coding  character-sets  wikipedia  bmp  emoji  twitter  languages  characters  heat-maps  dataviz 
march 2017 by jm
Booking.com, MySQL and UTF-8
good preso from Percona Live 2015 on the messiness of MySQL vs UTF-8 and utf8mb4
utf-8  utf8mb4  mysql  storage  databases  slides  booking.com  character-sets 
december 2016 by jm
Unicode and character sets for developers
Minimum every software developer must know about unicode and character sets
unicode  development  character-sets  programming 
august 2012 by buymeasoda
Javascript, character sets and external files - James; yet another blogging developer
Someone else having something like a problem we've seen. Only we only get it in IE6, and he gets it everwhere.
javascript  utf-8  character-sets  encoding  web-development 
november 2008 by devilgate

related tags

8859-soup  accents  ajax  ascii  automatic-coding-detection  bmp  booking.com  cdp  character-charts  character-encoding  character-encodings  character  characters  chinese  coding  configuration  conscript  czyborra  databases  dataviz  development  diacritics  email  emoji  encoding  exchange  fail  fictional-character-sets  fonts  graphemics  guides  han-characters  heat-maps  htaccess  html  http  i18n  iconv  j  javascript  joel-spolsky  joelspolsky  l10n  langauges  languages  microsoft  monoculture  multilingual  mysql  opensource  operating-systems  perl  php  praphemes  programming  quality  quicktime  ruby  slides  software-development  storage  tools  twitter  type-design  unicode  utf-8  utf8  utf8mb4  utrac  web-development  web  webdev-reference  webdev-tools  wikipedia  wingdings  wordpress  writing-systems  xemacs  xhtml  xml 

Copy this bookmark:



description:


tags: