Opening UTF-8 Text Files

This section provides a tutorial example on how to open a UTF-8 text file with Word correctly by selecting the Unicode (UTF-8) encoding option on the File Conversion dialog box.

Let's try to use Word to open the UTF-8 text file, hello.utf-8, created from the previous chapter first.

1. Run Word and click menu File > Open. The Open file dialog box comes up.

2. Select the hello.utf-8 text file and click the Open button. The File Conversion dialog box comes up automatically. Word detected the encoding to be "Unicode (UTF-8)" and suggests you to use it to read the text file. See the correct text in the preview section in the picture below:

Word Open UTF-8 File
Word Open UTF-8 File

3. Click the OK button. My UTF-8 text file opens in Word correctly.

Very nice. This proves that Word can open UTF-8 text file correctly if the "Unicode (UTF-8)" encoding option is selected.

If you select a different encoding, like Unicode, the UTF-8 text file will be opened incorrectly. Try it out yourself.

Table of Contents

 About This Book

 Character Sets and Encodings

 ASCII Character Set and Encoding

 GB2312 Character Set and Encoding

 GB18030 Character Set and Encoding

 JIS X0208 Character Set and Encodings

 Unicode Character Set

 UTF-8 (Unicode Transformation Format - 8-Bit)

 UTF-16, UTF-16BE and UTF-16LE Encodings

 UTF-32, UTF-32BE and UTF-32LE Encodings

 Java Language and Unicode Characters

 Character Encoding in Java

 Character Set Encoding Maps

 Encoding Conversion Programs for Encoded Text Files

 Using Notepad as a Unicode Text Editor

Using Microsoft Word as a Unicode Text Editor

 What Is Microsoft Word

Opening UTF-8 Text Files

 Opening UTF-16BE Text Files

 Opening UTF-16LE Text Files

 Saving Files in "Unicode (UTF-8)" Option

 Saving Files in "Unicode (Big-Endian)" Option

 Saving Files in Unicode Option

 Supported Save and Open File Formats

 Using Microsoft Excel as a Unicode Text Editor

 Unicode Fonts

 Unicode Code Point Blocks: 0000 - 0FFF

 Unicode Code Point Blocks: 1000 - FFFF

 Unicode Code Point Blocks: 10000 - 11FFF

 Unicode Code Point Blocks: 12000 - 10FFFF

 Outdated Tutorials


 Full Version in PDF/EPUB