Big5 Tutorials - Herong's Tutorial Examples - v4.05, by Herong Yang
Big5 vs. Unicode
Big5 Character Set is a sub set Unicode Character Set. However the Big5 code point is mathematically unrelated to the Unicode code point for the same Chinese character.
Big5 character set is a sub set of Unicode character set. This means that every character defined in Big5 is also defined in Unicode.
However, Big5 codes and Unicode codes are totally un-related. For example, Big5 character ไป with code value of 0xA4B4 has a Unicode code point of 0x4ECD. There is no mathematical formula to convert a Big5 code point to a Unicode code point of the same character.
To help you to convert Big5 codes to Unicode codes, a complete map of all Big5 codes and their corresponding Unicode codes is provided later in the book. The corresponding UTF-8 (Unicode Transformation Format - 8-bit) codes are also listed in the map.
Table of Contents
Big5Unicode.java - Big5 to Unicode Mapping
Big5 to Unicode Mapping - Special Symbols
Big5 to Unicode Mapping - Level 1 Characters
Big5 to Unicode Mapping - Level 2 Characters
UnicodeBig5.java - Unicode to Big5 Mapping
Unicode to Big5 Mapping - All 13,461 Characters