Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

Character Sets and Encodings

This chapter provides notes and tutorial examples on character sets and encodings. Topics including introduction of character set and encoding; commonly used character sets and encodings.

What Is Character Set?

Commonly Used Character Sets and Encodings

Conclusions:

  • A character set is a collection of characters.
  • A character encoding a schema that maps each character in a character set to a unique sequence of bytes.
  • Unicode is a character set.
  • UTF-8 is a character encoding for the Unicode character set.

Dr. Herong Yang, updated in 2009
Character Sets and Encodings