1 2 >   Sort: Rank

UTF-8 Encoding Algorithm
This section provides a tutorial example on how to write a programming algorithm to encode characters with UTF-8 encoding.
2019-09-27, 3373👍, 3💬

💬 2019-09-27 Andrew Dillon: This was very useful. It really helped clarify the explanation in the Unicode Specification. Thank you!

💬 2016-03-30 Herong: Deniz, you are welcome!

💬 2016-03-24 Deniz Aktas: Simple and clear, thanks a lot !

UTF-16, UTF-16BE and UTF-16LE Encodings
This chapter provides notes and tutorial examples on UTF-16, UTF-16BE and UTF-16LE encodings. Topics including encoding and decoding logics of UTF-16, UTF-16BE and UTF-16LE encodings; introduction of surrogate pairs; explanation of the use of BOM (Byte Order Mark).
2019-09-24, 4769👍, 4💬

💬 2019-08-09 asas: Haga contacto con su administrador del sistema si desea obtener más información

💬 2017-04-06 kl: 21 53 93 68 87 65 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 31 00 30 00 31 00 32 00 30 00 30 00 30...

Examples of Unicode Characters
This chapter provides notes and tutorial examples on the Unicode character set. Topics including introduction of Unicode standard, example characters, history of releases, blocks of code points.
2019-09-18, 3230👍, 7💬

💬 2019-06-24 abcd: It's a rainy night

💬 2017-01-20 anynomous: he ha ha

💬 2017-01-05 danish: that is great

UTF-16LE Encoding
This section provides a quick introduction of the UTF-16LE (Unicode Transformation Format - 16-bit Little Endian) encoding for Unicode character set. UTF-16LE is a variation of UTF-16.
2019-08-16, 7864👍, 7💬

💬 2019-07-23 ching: chong

💬 2019-04-28 Herong: The last part is needed. Thanks.

💬 2019-04-21 小胖: 最后一段中的"the ZERO WIDTH NO-BREAK SPACE, U+FEFF, character",是不是不用写呢?

💬 2018-12-29 test: thank

💬 2016-12-21 task go: thank you

UTF-16BE Encoding
This section provides a quick introduction of the UTF-16BE (Unicode Transformation Format - 16-bit Big Endian) encoding for Unicode character set. UTF-16BE is a variation of UTF-16.
2019-08-09, 3329👍, 4💬

💬 2019-08-09 asas: UTF-16BE

💬 2017-07-14 john wirter: thnanks foro hthe inofos

💬 2017-06-06 potato: Cool

Downloading and Installing GNU Unifont
A tutorial example is provided on how to download and install GNU Unifont font family on Windows 7 systems.
2019-07-28, 1285👍, 3💬

💬 2019-07-28 dipen: thank you.

💬 2015-12-06 Herong: D, maybe you can start with What Is Unicode?.

💬 2015-12-05 d gayen: i want to understand what is unicode

Shift-JIS Encoding
This section provides a quick introduction of Shift-JIS, also called MS Kanji, encoding, which maps a JIS X0208 character to a 2-byte sequence using a complicated schema designed by Microsoft.
2019-07-16, 343👍, 3💬

💬 2019-07-16 Duc: かきくけこ

UTF-32LE Encoding
This section provides a quick introduction of the UTF-32LE (Unicode Transformation Format - 32-bit Big Endian) encoding for Unicode character set.
2019-06-21, 185👍, 3💬

💬 2019-06-21 ggg: test

U1DC0: Combining Diacritical Marks Supplement
This section provides a quick summary of the Unicode code point block: 'Combining Diacritical Marks Supplement', which contains 64 code points to represent additional diacritical marks used in conjunction with preceding letters.
2019-06-17, 166👍, 2💬

💬 2019-06-16 Herong: Glyn, Try GNU Unifont. Regular fonts do not support all glyphs in the ""Combining Diacritical Marks Supplement" Unicode block. F...

💬 2019-06-11 Glyn Davies: Which font(s) contain all 63/4 glyphs in this range?

Using Microsoft Word as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Microsoft Word as a Unicode text editor. Topics including opening Unicode text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files with the BOM character prepended.
2019-06-01, 344👍, 2💬

💬 2019-06-01 Herong: Adriaan, I can only find the peace symbol: ☮ (U+262E).

💬 2019-05-31 Adriaan: I am a pensioner, physically disabled, wheelchair, writing a book, topic: Revolution vs Evolution. I need about 10 unicode symbo...

"Character" Class with Unicode Utility Methods
This section provides an introduction on 'Character' class static methods added since J2SE 5.0 as Unicode utility methods.
2019-04-20, 137👍, 2💬

💬 2019-04-20 Herong: Isie, it will be corrected in the next update. Thanks.

💬 2019-04-17 lsie: There is a typo in "static boolean isDefined(int codePoint)" paragraph. "tt has an entry in the UnicodeData file", "tt" should b...

U2600: Miscellaneous Symbols
This section provides a quick summary of the Unicode code point block: 'Miscellaneous Symbols', which contains 160 code points to represent more miscellaneous symbols.
2019-03-02, 1088👍, 2💬

💬 2016-05-11 Neven: Thanks!

Opening UTF-16LE Text Files
This section provides a tutorial example on how to open a UTF-16LE text file with Nodepad correctly by selecting the Unicode encoding option on the open file dialog box.
2018-09-09, 1778👍, 3💬

💬 2018-09-09 Nit: GIF89a��������!�, T;

💬 2018-02-12 Herong: Krishna, dates should have nothing to do with UTF-16 LE format. Can you provide some examples?

💬 2018-02-08 Krishna: When I open a UTF-16 LE file notepad and and notepad ++ ,it is showing special characters for the dates. Is it because notepad +...

Saving Files in "Unicode (UTF-8)" Option
This section provides a tutorial example on how to save text files with Nodepad by selecting the 'Unicode (UTF-8)' encoding option on the file conversion dialog box.
2018-07-17, 1400👍, 2💬

💬 2018-07-17 Me: Very good, but Unicode (nothing else) worked for me!

💬 2016-10-08 shital p: it's works fine for me thank u

What Is ASCII?
This section provides a quick introduction of ASCII (American Standard Code for Information Interchange) character set and encoding.
2018-06-15, 278👍, 1💬

💬 2018-06-15 z: Thanks

Printable Copy - PDF Version
Information on how to obtain the PDF version of this book for printing.
2018-02-07, 1781👍, 5💬

💬 2017-01-08 Herong: Rex, see my email to your qq.com account.

💬 2017-01-08 rex: how to download this book

💬 2015-09-09 mani: require more details

Character Encoding in Java
This chapter provides notes and tutorial examples on character encoding in Java. Topics including supported encodings in Java SE 7; using encoding and decoding methods; examples of encoded byte sequences of various encodings.
2018-01-30, 857👍, 2💬

💬 2018-01-30 Sunny: www.youtube.com

💬 2017-07-14 kev: hi

U0180: Latin Extended-B
This section provides a quick summary of the Unicode code point block: 'Latin Extended-B', which contains 208 code points to represent historical Latin language characters.
2017-08-29, 312👍, 1💬

💬 2017-08-29 Bravo: haha

U1B00: Balinese
This section provides a quick summary of the Unicode code point block: 'Balinese', which contains 128 code points to represent Balinese alphabets used in the Balinese language.
2017-05-23, 472👍, 2💬

💬 2017-05-23 Herong: Riccio, is that your birthday? :-)

💬 2017-05-17 riccio: August 18, 1983

U1EE00: Arabic Mathematical Alphabetic Symbols
This section provides a quick summary of the Unicode code point block: 'Arabic Mathematical Alphabetic Symbols', which contains 256 code points to represent Arabic mathematical alphabetic symbols.
2017-04-01, 603👍, 2💬

💬 2017-03-21 Herong: Anon, can you show us some examples of Arabic mathematical symbols?

💬 2017-03-19 Anon: These aren't Arabic mathematical symbols

UAC00: Hangul Syllables
This section provides a quick summary of the Unicode code point block: 'Hangul Syllables', which contains 11184 code points to represent Hangul syllables used in the Korean language.
2017-02-21, 943👍, 2💬

💬 2017-02-21 Herong: danny, that's true. I think English can also create thousands of syllables.

💬 2017-02-18 danny: Hangul is so great that it can make 11 thousand 172 syllables with only 40 consonants and vowels

UTF-8 Encoding
This section provides a quick introduction of the UTF-8 (Unicode Transformation Format - 8-bit) encoding for Unicode character set. It uses 1, 2, 3, or 4 bytes for each character.
2016-12-30, 505👍, 1💬

💬 2016-12-30 Rehman: Welcome

UTF-16 Encoding
This section provides a quick introduction of the UTF-16 (Unicode Transformation Format - 16-bit) encoding for Unicode character set. Paired surrogates are used for characters in the U+10000...0x10FFFF range.
2016-12-28, 1042👍, 1💬

💬 2016-12-28 anson: what a detail of explain. i think i complete understand it. thanks very much.

U2A00: Supplemental Mathematical Operators
This section provides a quick summary of the Unicode code point block: 'Supplemental Mathematical Operators', which contains 256 code points to represent additional mathematical operators
2016-10-24, 447👍, 1💬

💬 2016-10-24 ramulu: it is useful

1 2 >   Sort: Rank