All topics / Character Encodings and Unicode

Character Encodings and Unicode

Why text turns into garbled symbols, how UTF-8 actually works, and the difference between bytes, code points, and the characters a user sees.

  1. Bytes Are Not Characters
  2. How UTF-8 Actually Works
  3. When Text Lies: Emoji, Graphemes, and the BOM