Examples of GB18030 Encoding
<< Character Encoding in Java
<< Unicode Tutorials - Herong's Tutorial Notes
This section provides examples of encoded byte sequences of GB18030 encoding, which is designed to encode Chinese characters.
Let's continue to play with the testing program, EncodingSampler.java, provided in the previous section with GB18030 encoding:
GB18030 encoding: Char, String, Writer, Charset, Encoder 0000, 00, 00, 00, 00 003F, 3F, 3F, 3F, 3F 0040, 40, 40, 40, 40 007F, 7F, 7F, 7F, 7F 0080, 81 30 81 30, 81 30 81 30, 81 30 81 30, 81 30 81 30 00BF, 81 30 86 37, 81 30 86 37, 81 30 86 37, 81 30 86 37 00C0, 81 30 86 38, 81 30 86 38, 81 30 86 38, 81 30 86 38 00FF, 81 30 8B 37, 81 30 8B 37, 81 30 8B 37, 81 30 8B 37 0100, 81 30 8B 38, 81 30 8B 38, 81 30 8B 38, 81 30 8B 38 3FFF, 82 32 A6 36, 82 32 A6 36, 82 32 A6 36, 82 32 A6 36 4000, 82 32 A6 37, 82 32 A6 37, 82 32 A6 37, 82 32 A6 37 7FFF, C2 52, C2 52, C2 52, C2 52 8000, D2 AB, D2 AB, D2 AB, D2 AB BFFF, 83 31 D7 34, 83 31 D7 34, 83 31 D7 34, 83 31 D7 34 C000, 83 31 D7 35, 83 31 D7 35, 83 31 D7 35, 83 31 D7 35 EFFF, 83 38 96 36, 83 38 96 36, 83 38 96 36, 83 38 96 36 F000, 83 38 96 37, 83 38 96 37, 83 38 96 37, 83 38 96 37 FFFF, 84 31 A4 39, 84 31 A4 39, 84 31 A4 39, 84 31 A4 39
It looks complicate.
I think that's enough. You can run the test program with any other supported encodings as an argument yourself.
Sections in This Chapter
What Is Character Encoding?
Supported Character Encodings in JDK 1.4.1
EncodingSampler.java - Testing encode() Methods
Examples of CP1252 and ISO-8859-1 Encodings
Examples of US-ASCII, UTF-8, UTF-16 and UTF-16BE Encodings
Testing decode() Methods