|
Tutorial Books by Herong - References by Others
Part:
1
2
3
4
5
6
7
8
9
I am so happy to see that my tutorial books have been referenced by many articles and books on the Internet.
I am listing few examples here. -- Herong
Click here to see the newer references.
Re: Yunnan 6FTM Geography map
Subject: Re: Yunnan 6FTM Geography map
Date: Nov 28, 2005
Source: USENET - rec.food.drink.tea
Author: Space Cowboy
Oh great, I run the GB2312 url through Google or Babelfish and get
the meaningless translation of the 1st mountain as "The leather
ascends the Cha shan". In the past a GB2312 font set would have been
a Stop Sign since what I really want are the Unicode chinese character
values for the name of the mountain which in this case is four GB2312
Chinese characters representing leather/ascends/cha/shan. With a
little guessing I cut and paste the corresponding mountain name GB2312
string ¸ï/µÇ/²è/ɽ where I use / to demarcate the two GB2312
bytes representing each chinese character for illustrative purposes.
You take a hex viewer and determine the string in hex codes is
B8EF/B5C7/B2E8/C9BD. Then you take each hex string and do a GB2312 to
UNICODE lookup using the mapping table at:
http://www.herongyang.com/gb2312/ug_map.html
I'll leave it as an exercise to determine the 3 remaining Unicode
characters beside the tea character (cha) GB2312 B2E8 which is UNICODE
8336 (Hint: Table 22) where you use the Unicode value to see the
character graphically or Web browser font glyph (copy/paste) if you
have the appropriate Unicode character set loaded:
http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=8336
Jim
PS: If you have Java or DotNet loaded then it is simple to convert
from any native language set to Unicode using the international
language support routines. In this case the Table mapping was
generated by a Java program. Web browsers Javascript only support
Unicode. I do this for the one person (Lew) who has shown any
interest in the past on the subject.
Japanese Chinese tea web sites
Subject: Japanese Chinese tea web sites
Date: Oct 26, 2005
Source: USENET - rec.food.drink.tea
Author: Space Cowboy
Here is an interesting site for GB2312 to UNICODE conversion
http://www.herongyang.com/gb2312/ I found yesterday. As I previously
suspected it is a mapping and not a mathematical routine even though
the table was generated by a Java program with a bunch of but-ifs.
I didn't see anything right off the bat that would prevent Javascript
from doing the same thing mathematically as Java. The table says
B2E8 is the GB value for the Unicode value 8336 and not 1872 as
mentioned in Unihan. I can tell I'm going to have some fun. Also
if I had DOTNET loaded then there is a simple routine to indicate the
language pack such as GB2312 and give the corresponding Unicode
value. The charCodeAt routine in Javascript is just a Unicode
character to Unicode hex representation. The two byte hexview of a
Unicode character is not the same as the result of the charCodeAt
conversion. Some things are flipped around in the way the Unicode
char is stored on disk. In a file the Unicode tea character is stored
as 36383. Notepad will store the Unicode tea character as four bytes
with the first two characters high order FFFE.
Jim
(Continued on next part...)
Part:
1
2
3
4
5
6
7
8
9
|