Herong's Tutorial Notes on Unicode
Dr. Herong Yang, Version 4.02

A newer version of this book is located at http://www.herongyang.com/Unicode/. You will be automatically redirected this new location. Thank you for your visiting! -- Herong

Herong's Tutorial Notes on Unicode
Version 4.02
Dr. Herong Yang

Table of Contents

About This Book

Character Sets and Encodings

  • Definitions
  • Commonly Used Character Sets and Encodings

GB2312 Character Set and Encodings

  • GB2312 Character Set
  • EUC-CN Encoding

GB18030 Character Set and Encoding

  • History of GB Character Set
  • GB18030-2000 Encoding

JIS X0208 Character Set and Encodings

  • JIS X0208 Character Set
  • EUC-JP Encoding
  • ISO-2022-JP Encoding
  • Shift-JIS Encoding

Unicode Character Set

  • What is Unicode

Unicode Transformation Formats (UTF)

  • UTF-8 Encoding
  • UTF-16LE Encoding
  • UTF-16BE Encoding

JDK - Character Set and Encoding

  • What is a Character Encoding
  • Supported Character Encodings
  • Methods to Encode Characters
  • Methods to Decode Byte Sequences

JDK - Encoding Maps

  • Encoding Map Analyzer
  • US-ASCII
  • ISO-8859-1 - Latin 1
  • CP1252 - Windows-1252
  • UTF-8
  • UTF-16
  • UTF-16LE
  • UTF-16BE

JDK - Encoding Map Counts

  • Encoding Map Counter
  • Comparison of Encoding Maps

JDK - Encoding Conversion

  • Unicode Data Entry
  • Unicode Encoding Conversion
  • Viewing Unicode Text
  • Unicode Signs in Different Encodings

References

Key Words: ASCII, book, character set, encoding, example, GB, GB18030, GB2312, GBK, ISO-8859, JDK, JIS, notes, online, tutorial, UTF8, Unicode

Dr. Herong Yang, updated in 2007
Herong's Tutorial Notes on Unicode - Table of Contents