Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

GB18030 Character Set and Encoding

This chapter provides notes and tutorial examples on GB18030 character set and encoding. Topics including history of GB character sets: GB2312, GB1300.1 (GBK) and GB18030; GB18030 encoding schema.

History of GB Character Sets

GB18030 Encoding for GB18030 Character Set

Conclusions:

  • GBK (GB1300.1) is a super set of GB2312 with 21886 characters.
  • GB18030 is a super set of GBK with 70244 characters.
  • GB18030 character set is compatible with Unicode 3.0 character set.
  • GB18030 encoding uses one, two or four bytes to encode a character.

Dr. Herong Yang, updated in 2009
GB18030 Character Set and Encoding