This chapter provides notes and tutorial examples on JIS X0208 character set and encodings. Topics including introduction of the JIS X0208 character set; EUC-JP, ISO-2022-JP and Shift-JIS encodings.
JIS X0208 is a character set for Japanese characters.
JIS X0208 arranges characters into a matrix of 94 rows and 94 columns.
The code value of a JIS X0208 character is a 2-byte value: the high byte is the row number plus 32 and
low byte is the column number plus 32.
EUC-JP is an encoding that maps a JIS X0208 character to a 2-byte sequence by adding 128 to each byte
of the byte pair of the character's code value.
ISO-2022-JP is an encoding that maps a JIS X0208 character to a 2-byte sequence by taking both bytes
directly from the character's code value. Escape sequences are used to mix ASCII characters with JIS X0208 characters.
Shift-JIS is an encoding developed by Microsoft for JIS X0208 characters.