Chinese Text Encoding Conversion and Corruptions
This chapter provides tutorial notes on Chinese text encoding conversion and corruptions. Topics include Detecting system default encoding; Root causes of corrupted Chinese text; Corrupted Chinese file names with Un-ZIP; Generating 8-bit encoding tables; Restoring corrupted Chinese text.
Detect System Default Encoding
Root Cause of Corrupted Chinese Text
Corrupted Chinese File Name with Un-ZIP
Generate 8-Bit Encoding Tables
Restore Corrupted Chinese Text
- Run a script to generate Chinese text in different encoding to test system default encoding.
- The root cause of corrupted Chinese text is usually generated by incorrect decoding of
encoded Chinese text.
- Chinese file names get corrupted when extract ZIP files on systems with different default encodings.
- Restoring corrupted Chinese text requires guessing what is the last encoding on the text.
Table of Contents
About This Book
PHP Installation on Windows Systems
Integrating PHP with Apache Web Server
charset="*" - Encodings on Chinese Web Pages
Chinese Characters in PHP String Literals
Multibyte String Functions in UTF-8 Encoding
Input Text Data from Web Forms
Input Chinese Text Data from Web Forms
MySQL - Installation on Windows
MySQL - Connecting PHP to Database
MySQL - Character Set and Encoding
MySQL - Sending Non-ASCII Text to MySQL
Retrieving Chinese Text from Database to Web Pages
Input Chinese Text Data to MySQL Database
►Chinese Text Encoding Conversion and Corruptions
Full Version in PDF/EPUB