Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

JIS X0208 Character Set and Encodings

This chapter provides notes and tutorial examples on JIS X0208 character set and encodings. Topics including introduction of the JIS X0208 character set; EUC-JP, ISO-2022-JP and Shift-JIS encodings.

JIS X0208 Character Set for Japanese Characters

JIS X0208 Character Code Values

EUC-JP Encoding

ISO-2022-JP Encoding

Shift-JIS Encoding

Conclusions:

  • JIS X0208 is a character set for Japanese characters.
  • JIS X0208 arranges characters into a matrix of 94 rows and 94 columns.
  • The code value of a JIS X0208 character is a 2-byte value: the high byte is the row number plus 32 and low byte is the column number plus 32.
  • EUC-JP is an encoding that maps a JIS X0208 character to a 2-byte sequence by adding 128 to each byte of the byte pair of the character's code value.
  • ISO-2022-JP is an encoding that maps a JIS X0208 character to a 2-byte sequence by taking both bytes directly from the character's code value. Escape sequences are used to mix ASCII characters with JIS X0208 characters.
  • Shift-JIS is an encoding developed by Microsoft for JIS X0208 characters.

Dr. Herong Yang, updated in 2009
JIS X0208 Character Set and Encodings