Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

GB2312 Character Set and Encoding

This chapter provides notes and tutorial examples on GB2312 character set and encoding. Topics including introduction of the GB2312 character set for simplified Chinese characters; GB2312 encoding for the GB2312 character set; Relation of GB2312 and Unicode.

GB2312 Character Set for Chinese Characters

GB2312 Encoding for GB2312 Character Set

Relation of GB2312 and Unicode

Conclusions:

  • GB2312 character set contains 7445 simplified Chinese characters, numbers and symbols.
  • GB2312 character set arranges characters into a matrix of 94 rows and 94 columns.
  • GB2312 encoding uses 2 bytes: the high byte is the row number plus 158 and the low byte is the column number plus 158.
  • GB2312 character set is a sub set of Unicode character set.

Dr. Herong Yang, updated in 2009
GB2312 Character Set and Encoding