Building Chinese Web Sites using PHP
Dr. Herong Yang, Version 2.11

Chinese Web Pages with GB18030 Encoding

This section describes how to create a Chinese HTML document in GB18030 encoding and publish it on the Apache server.

Chinese Web pages should use UTF-8 encoding. But for some reason, if you have to use GB18030 encoding for your Chinese Web pages. you should enter Chinese characters with GB18030 encoding and set the "charset" attribute to "gb18030". Here is a simple test I did on my local system:

1. Run my Chinese text editor that supports GB18030 encoding.

2. Enter the following HTML document:

<html>
<!-- Hello-GB18030.html
   Copyright (c) 2007 by Dr. Herong Yang, http://www.herongyang.com/
-->
<meta http-equiv="Content-Type" content="text/html; charset=gb18030"/>
<body>
<b>Chinese characters in GB18030</b><br/>
Simplified characters: ??????<br/>
</body>
</html>

You see some question marks (?) in the source code listed above, because this book uses UTF-8 encoding. GB18030 encoded characters can not be included here.

3. Save the as Hello-GB18030.html in GB18030 encoding. On my Chinese text editor, I had to select "GB text file" as the "Save as type" to ensure my document was saved in GB18030 encoding. Like many other Chinese text editors, it supports multiple encodings. If you are not careful, the document could be saved with a wrong encoding.

4. Copy Hello-GB18030.html to \local\apache\htdocs.

5. Now run Internet Explorer (IE) with http://localhost/Hello-GB18030.html. You should see Chinese characters displayed correctly:
Chinese Web Page using GB18030

This proves that my Chinese editor, the Web server: Apache, and the Web browser: IE, all worked correctly with Chinese characters in GB18030 encoding.

Note that GB18030 encoding is an extended version of GB2312 encoding, which is still supported by many applications.

Sections in This Chapter

Chinese Character Set Encoding Options

HTML Document Travel Path

Chinese Web Pages with UTF-8 Encoding

Chinese Web Pages with GB18030 Encoding

Chinese Web Pages with Big5 Encoding

UTF-8 Encoding Pages with GB18030 Characters

Dr. Herong Yang, updated in 2007
Chinese Web Pages with GB18030 Encoding