Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

Examples of GB18030 Encoding

This section provides examples of encoded byte sequences of GB18030 encoding, which is designed to encode Chinese characters.

Let's continue to play with the testing program, EncodingSampler.java, provided in the previous section with GB18030 encoding:

GB18030 encoding:
Char, String, Writer, Charset, Encoder
0000, 00, 00, 00, 00
003F, 3F, 3F, 3F, 3F
0040, 40, 40, 40, 40
007F, 7F, 7F, 7F, 7F
0080, 81 30 81 30, 81 30 81 30, 81 30 81 30, 81 30 81 30
00BF, 81 30 86 37, 81 30 86 37, 81 30 86 37, 81 30 86 37
00C0, 81 30 86 38, 81 30 86 38, 81 30 86 38, 81 30 86 38
00FF, 81 30 8B 37, 81 30 8B 37, 81 30 8B 37, 81 30 8B 37
0100, 81 30 8B 38, 81 30 8B 38, 81 30 8B 38, 81 30 8B 38
3FFF, 82 32 A6 36, 82 32 A6 36, 82 32 A6 36, 82 32 A6 36
4000, 82 32 A6 37, 82 32 A6 37, 82 32 A6 37, 82 32 A6 37
7FFF, C2 52, C2 52, C2 52, C2 52
8000, D2 AB, D2 AB, D2 AB, D2 AB
BFFF, 83 31 D7 34, 83 31 D7 34, 83 31 D7 34, 83 31 D7 34
C000, 83 31 D7 35, 83 31 D7 35, 83 31 D7 35, 83 31 D7 35
EFFF, 83 38 96 36, 83 38 96 36, 83 38 96 36, 83 38 96 36
F000, 83 38 96 37, 83 38 96 37, 83 38 96 37, 83 38 96 37
FFFF, 84 31 A4 39, 84 31 A4 39, 84 31 A4 39, 84 31 A4 39

It looks complicate.

I think that's enough. You can run the test program with any other supported encodings as an argument yourself.

Sections in This Chapter

What Is Character Encoding?

Supported Character Encodings in JDK 1.4.1

EncodingSampler.java - Testing encode() Methods

Examples of CP1252 and ISO-8859-1 Encodings

Examples of US-ASCII, UTF-8, UTF-16 and UTF-16BE Encodings

Examples of GB18030 Encoding

Testing decode() Methods

Dr. Herong Yang, updated in 2009
Examples of GB18030 Encoding