1 2 >   Sort: Date

List of Supported Character Encodings in Java
This section provides a list of supported character encodings supported in Java. The list is generated using the availableCharsets() static method in the java.nio.charset.Charset class.
2015-11-27, 14372👍, 1💬

💬 2015-11-27 nawed ahmed khan: Thanks☺

UTF-16LE Encoding
This section provides a quick introduction of the UTF-16LE (Unicode Transformation Format - 16-bit Little Endian) encoding for Unicode character set. UTF-16LE is a variation of UTF-16.
2019-08-16, 8506👍, 7💬

💬 2019-07-23 ching: chong

💬 2019-04-28 Herong: The last part is needed. Thanks.

💬 2019-04-21 小胖: 最后一段中的"the ZERO WIDTH NO-BREAK SPACE, U+FEFF, character",是不是不用写呢?

💬 2018-12-29 test: thank

💬 2016-12-21 task go: thank you

UTF-16, UTF-16BE and UTF-16LE Encodings
This chapter provides notes and tutorial examples on UTF-16, UTF-16BE and UTF-16LE encodings. Topics including encoding and decoding logics of UTF-16, UTF-16BE and UTF-16LE encodings; introduction of surrogate pairs; explanation of the use of BOM (Byte Order Mark).
2019-09-24, 5133👍, 4💬

💬 2019-08-09 asas: Haga contacto con su administrador del sistema si desea obtener más información

💬 2017-04-06 kl: 21 53 93 68 87 65 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 31 00 30 00 31 00 32 00 30 00 30 00 30...

UTF-8 Encoding Algorithm
This section provides a tutorial example on how to write a programming algorithm to encode characters with UTF-8 encoding.
2019-09-27, 3770👍, 3💬

💬 2019-09-27 Andrew Dillon: This was very useful. It really helped clarify the explanation in the Unicode Specification. Thank you!

💬 2016-03-30 Herong: Deniz, you are welcome!

💬 2016-03-24 Deniz Aktas: Simple and clear, thanks a lot !

UTF-16BE Encoding
This section provides a quick introduction of the UTF-16BE (Unicode Transformation Format - 16-bit Big Endian) encoding for Unicode character set. UTF-16BE is a variation of UTF-16.
2019-11-25, 3504👍, 5💬

💬 2019-11-25 Remo Harsono: Very helpful information, thanks

💬 2019-08-09 asas: UTF-16BE

💬 2017-07-14 john wirter: thnanks foro hthe inofos

💬 2017-06-06 potato: Cool

Examples of Unicode Characters
This chapter provides notes and tutorial examples on the Unicode character set. Topics including introduction of Unicode standard, example characters, history of releases, blocks of code points.
2019-09-18, 3403👍, 7💬

💬 2019-06-24 abcd: It's a rainy night

💬 2017-01-20 anynomous: he ha ha

💬 2017-01-05 danish: that is great

Opening UTF-16LE Text Files
This section provides a tutorial example on how to open a UTF-16LE text file with Nodepad correctly by selecting the Unicode encoding option on the open file dialog box.
2018-09-09, 2006👍, 3💬

💬 2018-09-09 Nit: GIF89a��������!�, T;

💬 2018-02-12 Herong: Krishna, dates should have nothing to do with UTF-16 LE format. Can you provide some examples?

💬 2018-02-08 Krishna: When I open a UTF-16 LE file notepad and and notepad ++ ,it is showing special characters for the dates. Is it because notepad +...

Printable Copy - PDF Version
Information on how to obtain the PDF version of this book for printing.
2018-02-07, 1802👍, 5💬

💬 2017-01-08 Herong: Rex, see my email to your qq.com account.

💬 2017-01-08 rex: how to download this book

💬 2015-09-09 mani: require more details

Using Microsoft Excel as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Microsoft Excel as a Unicode text editor. Topics including testing to open text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files in UTF-16 (Little-Endian with BOM) format.
2016-10-03, 1722👍, 1💬

💬 2016-10-03 zoltan: useful

U0600: Arabic
This section provides a quick summary of the Unicode code point block: 'Arabic', which contains 256 code points to represent alphabetic letters used in the Arabic language.
2015-08-07, 1589👍, 1💬

💬 2015-08-07 محمد خالد سعد حسي: From the table we change every word to its Unicode symbol the table from

Saving Files in "Unicode (UTF-8)" Option
This section provides a tutorial example on how to save text files with Nodepad by selecting the 'Unicode (UTF-8)' encoding option on the file conversion dialog box.
2018-07-17, 1459👍, 2💬

💬 2018-07-17 Me: Very good, but Unicode (nothing else) worked for me!

💬 2016-10-08 shital p: it's works fine for me thank u

Downloading and Installing GNU Unifont
A tutorial example is provided on how to download and install GNU Unifont font family on Windows 7 systems.
2019-10-23, 1418👍, 4💬

💬 2019-10-23 Sajjad Amin: thank you

💬 2019-07-28 dipen: thank you.

💬 2015-12-06 Herong: D, maybe you can start with What Is Unicode?.

💬 2015-12-05 d gayen: i want to understand what is unicode

U2600: Miscellaneous Symbols
This section provides a quick summary of the Unicode code point block: 'Miscellaneous Symbols', which contains 160 code points to represent more miscellaneous symbols.
2019-03-02, 1138👍, 2💬

💬 2016-05-11 Neven: Thanks!

UTF-16 Encoding
This section provides a quick introduction of the UTF-16 (Unicode Transformation Format - 16-bit) encoding for Unicode character set. Paired surrogates are used for characters in the U+10000...0x10FFFF range.
2016-12-28, 1105👍, 1💬

💬 2016-12-28 anson: what a detail of explain. i think i complete understand it. thanks very much.

UAC00: Hangul Syllables
This section provides a quick summary of the Unicode code point block: 'Hangul Syllables', which contains 11184 code points to represent Hangul syllables used in the Korean language.
2017-02-21, 975👍, 2💬

💬 2017-02-21 Herong: danny, that's true. I think English can also create thousands of syllables.

💬 2017-02-18 danny: Hangul is so great that it can make 11 thousand 172 syllables with only 40 consonants and vowels

Character Encoding in Java
This chapter provides notes and tutorial examples on character encoding in Java. Topics including supported encodings in Java SE 7; using encoding and decoding methods; examples of encoded byte sequences of various encodings.
2018-01-30, 893👍, 2💬

💬 2018-01-30 Sunny: www.youtube.com

💬 2017-07-14 kev: hi

GB18030 Character Set and Encoding
This chapter provides notes and tutorial examples on GB18030 character set and encoding. Topics including history of GB character sets: GB2312, GB1300.1 (GBK) and GB18030; GB18030 encoding schema.
2016-10-17, 861👍, 1💬

💬 2016-10-17 asd: test

U1EE00: Arabic Mathematical Alphabetic Symbols
This section provides a quick summary of the Unicode code point block: 'Arabic Mathematical Alphabetic Symbols', which contains 256 code points to represent Arabic mathematical alphabetic symbols.
2017-04-01, 628👍, 2💬

💬 2017-03-21 Herong: Anon, can you show us some examples of Arabic mathematical symbols?

💬 2017-03-19 Anon: These aren't Arabic mathematical symbols

UTF-8 Encoding
This section provides a quick introduction of the UTF-8 (Unicode Transformation Format - 8-bit) encoding for Unicode character set. It uses 1, 2, 3, or 4 bytes for each character.
2019-11-06, 548👍, 2💬

💬 2016-12-30 Rehman: Welcome

U1B00: Balinese
This section provides a quick summary of the Unicode code point block: 'Balinese', which contains 128 code points to represent Balinese alphabets used in the Balinese language.
2017-05-23, 492👍, 2💬

💬 2017-05-23 Herong: Riccio, is that your birthday? :-)

💬 2017-05-17 riccio: August 18, 1983

U2A00: Supplemental Mathematical Operators
This section provides a quick summary of the Unicode code point block: 'Supplemental Mathematical Operators', which contains 256 code points to represent additional mathematical operators
2016-10-24, 461👍, 1💬

💬 2016-10-24 ramulu: it is useful

Using Microsoft Word as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Microsoft Word as a Unicode text editor. Topics including opening Unicode text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files with the BOM character prepended.
2019-06-01, 443👍, 2💬

💬 2019-06-01 Herong: Adriaan, I can only find the peace symbol: ☮ (U+262E).

💬 2019-05-31 Adriaan: I am a pensioner, physically disabled, wheelchair, writing a book, topic: Revolution vs Evolution. I need about 10 unicode symbo...

Shift-JIS Encoding
This section provides a quick introduction of Shift-JIS, also called MS Kanji, encoding, which maps a JIS X0208 character to a 2-byte sequence using a complicated schema designed by Microsoft.
2019-07-16, 391👍, 3💬

💬 2019-07-16 Duc: かきくけこ

UAAE0: Meetei Mayek Extensions
This section provides a quick summary of the Unicode code point block: 'Meetei Mayek Extensions', which contains 32 code points to represent additional Meetei Mayek alphabets used in the Meitei language.
2016-07-19, 351👍, 2💬

💬 2016-07-19 Herong: Hi Karen, Meeteri Mayak is a set of Unicode code points, not a font. But we do need a font that supports Meeteri Mayak code poin...

💬 2016-07-15 Karen: Is the Meeteri Mayak script a UTF8 font? If so, where can I download it? Thank you.

1 2 >   Sort: Date