Unicode Tutorials - Herong's Tutorial Examples
∟Using Microsoft Word as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Microsoft Word as a Unicode text editor. Topics including opening Unicode text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files with the BOM character prepended.
What Is Microsoft Word
Opening UTF-8 Text Files
Opening UTF-16BE Text Files
Opening UTF-16LE Text Files
Saving Files in "Unicode (UTF-8)" Option
Saving Files in "Unicode (Big-Endian)" Option
Saving Files in Unicode Option
Supported Save and Open File Formats
Conclusions:
- Word can be used to edit Unicode text files.
- Word allows you to save Unicode text file in 3 encoding formats,
UTF-8, UTF-16 (Big-Endian with BOM), and UTF-16 (Little-Endian with BOM).
- Word can open Unicode text files in 4 encoding formats with encoding automatically detected,
UTF-8, UTF-16 (Big-Endian with BOM), UTF-16 (Little-Endian with BOM), and
UTF-8 with BOM.
- Word can open Unicode text files in 2 encoding formats with your help to select the encoding manually,
UTF-16BE and UTF-16LE.
- The BOM character is the "ZERO WIDTH NO-BREAK SPACE" character, U+FEFF, in the Unicode character set.
Table of Contents
About This Book
Character Sets and Encodings
ASCII Character Set and Encoding
GB2312 Character Set and Encoding
GB18030 Character Set and Encoding
JIS X0208 Character Set and Encodings
Unicode Character Set
UTF-8 (Unicode Transformation Format - 8-Bit)
UTF-16, UTF-16BE and UTF-16LE Encodings
UTF-32, UTF-32BE and UTF-32LE Encodings
Python Language and Unicode Characters
Java Language and Unicode Characters
Character Encoding in Java
Character Set Encoding Maps
Encoding Conversion Programs for Encoded Text Files
Using Notepad as a Unicode Text Editor
►Using Microsoft Word as a Unicode Text Editor
Using Microsoft Excel as a Unicode Text Editor
Unicode Fonts
Archived Tutorials
References
Full Version in PDF/EPUB