Unicode Tutorials - Herong's Tutorial Notes
Dr. Herong Yang, Version 5.00

Using Microsoft Word as a Unicode Text Editor

This chapter provides notes and tutorial examples on using Microsoft Word as a Unicode text editor. Topics including opening Unicode text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files with the BOM character prepended.

What Is Microsoft Word?

Opening UTF-8 Text Files

Opening UTF-16BE Text Files

Opening UTF-16LE Text Files

Saving Files in "Unicode (UTF-8)" Option

Saving Files in "Unicode (Big-Endian)" Option

Saving Files in Unicode Option

Supported Save and Open File Formats

Conclusions:

  • Word can be used to edit Unicode text files.
  • Word allows you to save Unicode text file in 3 encoding formats, UTF-8, UTF-16 (Big-Endian with BOM), and UTF-16 (Little-Endian with BOM).
  • Word can open Unicode text files in 4 encoding formats with encoding automatically detected, UTF-8, UTF-16 (Big-Endian with BOM), UTF-16 (Little-Endian with BOM), and UTF-8 with BOM.
  • Word can open Unicode text files in 2 encoding formats with your help to select the encoding manually, UTF-16BE and UTF-16LE.
  • The BOM character is the "ZERO WIDTH NO-BREAK SPACE" character, U+FEFF, in the Unicode character set.

Dr. Herong Yang, updated in 2009
Using Microsoft Word as a Unicode Text Editor