1 2 >   Sort: Date

List of Supported Character Encodings in Java
This section provides a list of supported character encodings supported in Java. The list is generated using the availableCharsets() static method in the java.nio.charset.Charset class.
2015-11-27, 8274👍, 1💬

💬 2015-11-27 nawed ahmed khan: Thanks☺

UTF-16LE Encoding
This section provides a quick introduction of the UTF-16LE (Unicode Transformation Format - 16-bit Little Endian) encoding for Unicode character set. UTF-16LE is a variation of UTF-16.
2017-04-08, 3618👍, 5💬

💬 2017-04-08 Tj: Wow

💬 2016-12-21 task go: thank you

UTF-16, UTF-16BE and UTF-16LE Encodings
This chapter provides notes and tutorial examples on UTF-16, UTF-16BE and UTF-16LE encodings. Topics including encoding and decoding logics of UTF-16, UTF-16BE and UTF-16LE encodings; introduction of surrogate pairs; explanation of the use of BOM (Byte Order Mark).
2017-04-06, 2010👍, 2💬

💬 2017-04-06 kl: 21 53 93 68 87 65 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 31 00 30 00 31 00 32 00 30 00 30 00 30...

UTF-16BE Encoding
This section provides a quick introduction of the UTF-16BE (Unicode Transformation Format - 16-bit Big Endian) encoding for Unicode character set. UTF-16BE is a variation of UTF-16.
2017-07-14, 1878👍, 3💬

💬 2017-07-14 john wirter: thnanks foro hthe inofos

💬 2017-06-06 potato: Cool

UTF-8 Encoding Algorithm
This section provides a tutorial example on how to write a programming algorithm to encode characters with UTF-8 encoding.
2016-03-30, 1716👍, 2💬

💬 2016-03-30 Herong: Deniz, you are welcome!

💬 2016-03-24 Deniz Aktas: Simple and clear, thanks a lot !

Examples of Unicode Characters
This chapter provides notes and tutorial examples on the Unicode character set. Topics including introduction of Unicode standard, example characters, history of releases, blocks of code points.
2017-01-20, 1336👍, 5💬

💬 2017-01-20 anynomous: he ha ha

💬 2017-01-05 danish: that is great

Printable Copy - PDF Version
Information on how to obtain the PDF version of this book for printing.
2017-01-08, 1254👍, 4💬

💬 2017-01-08 Herong: Rex, see my email to your qq.com account.

💬 2017-01-08 rex: how to download this book

💬 2015-09-09 mani: require more details

U0600: Arabic
This section provides a quick summary of the Unicode code point block: 'Arabic', which contains 256 code points to represent alphabetic letters used in the Arabic language.
2015-08-07, 1109👍, 1💬

💬 2015-08-07 محمد خالد سعد حسي: From the table we change every word to its Unicode symbol the table from

Saving Files in "Unicode (UTF-8)" Option
This section provides a tutorial example on how to save text files with Nodepad by selecting the 'Unicode (UTF-8)' encoding option on the file conversion dialog box.
2016-10-08, 719👍, 1💬

💬 2016-10-08 shital p: it's works fine for me thank u

U2600: Miscellaneous Symbols
This section provides a quick summary of the Unicode code point block: 'Miscellaneous Symbols', which contains 160 code points to represent more miscellaneous symbols.
2016-05-11, 571👍, 1💬

💬 2016-05-11 Neven: Thanks!

UAC00: Hangul Syllables
This section provides a quick summary of the Unicode code point block: 'Hangul Syllables', which contains 11184 code points to represent Hangul syllables used in the Korean language.
2017-02-21, 548👍, 2💬

💬 2017-02-21 Herong: danny, that's true. I think English can also create thousands of syllables.

💬 2017-02-18 danny: Hangul is so great that it can make 11 thousand 172 syllables with only 40 consonants and vowels

UTF-16 Encoding
This section provides a quick introduction of the UTF-16 (Unicode Transformation Format - 16-bit) encoding for Unicode character set. Paired surrogates are used for characters in the U+10000...0x10FFFF range.
2016-12-28, 506👍, 1💬

💬 2016-12-28 anson: what a detail of explain. i think i complete understand it. thanks very much.

Using Microsoft Excel as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Microsoft Excel as a Unicode text editor. Topics including testing to open text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files in UTF-16 (Little-Endian with BOM) format.
2016-10-03, 446👍, 1💬

💬 2016-10-03 zoltan: useful

U1EE00: Arabic Mathematical Alphabetic Symbols
This section provides a quick summary of the Unicode code point block: 'Arabic Mathematical Alphabetic Symbols', which contains 256 code points to represent Arabic mathematical alphabetic symbols.
2017-04-01, 333👍, 2💬

💬 2017-03-21 Herong: Anon, can you show us some examples of Arabic mathematical symbols?

💬 2017-03-19 Anon: These aren't Arabic mathematical symbols

GB18030 Character Set and Encoding
This chapter provides notes and tutorial examples on GB18030 character set and encoding. Topics including history of GB character sets: GB2312, GB1300.1 (GBK) and GB18030; GB18030 encoding schema.
2016-10-17, 332👍, 1💬

💬 2016-10-17 asd: test

Unicode Tutorials - Herong's Tutorial Examples
This free Unicode tutorial book is a collection of notes and sample codes written by the author while he was learning Unicode himself, an ideal tutorial guide for beginners. Topics include ASCII, BMP, character set, encoding, decoding, GB, GB18030, GB2312, GBK, ISO-8859, Java, JDK, JIS, Surrogate, U...
2015-09-14, 303👍, 1💬

Downloading and Installing GNU Unifont
A tutorial example is provided on how to download and install GNU Unifont font family on Windows 7 systems.
2015-12-06, 284👍, 2💬

💬 2015-12-06 Herong: D, maybe you can start with What Is Unicode?.

💬 2015-12-05 d gayen: i want to understand what is unicode

U1B00: Balinese
This section provides a quick summary of the Unicode code point block: 'Balinese', which contains 128 code points to represent Balinese alphabets used in the Balinese language.
2017-05-23, 282👍, 2💬

💬 2017-05-23 Herong: Riccio, is that your birthday? :-)

💬 2017-05-17 riccio: August 18, 1983

UTF-8 (Unicode Transformation Format - 8-Bit)
This chapter provides notes and tutorial examples on UTF-8 encoding. Topics including introduction of UTF-8 encoding; examples of encoded byte stream; UTF-8 encoding algorithm.
2015-08-10, 268👍, 1💬

UTF-32, UTF-32BE and UTF-32LE Encodings
This chapter provides notes and tutorial examples on UTF-32, UTF-32BE and UTF-32LE encodings. Topics including encoding and decoding logics of UTF-32, UTF-32BE and UTF-32LE encodings; explanation of the use of BOM (Byte Order Mark).
2015-11-01, 258👍, 1💬

Shift-JIS Encoding
This section provides a quick introduction of Shift-JIS, also called MS Kanji, encoding, which maps a JIS X0208 character to a 2-byte sequence using a complicated schema designed by Microsoft.
2017-03-02, 232👍, 1💬

UTF-8 Encoding
This section provides a quick introduction of the UTF-8 (Unicode Transformation Format - 8-bit) encoding for Unicode character set. It uses 1, 2, 3, or 4 bytes for each character.
2016-12-30, 225👍, 1💬

💬 2016-12-30 Rehman: Welcome

UAAE0: Meetei Mayek Extensions
This section provides a quick summary of the Unicode code point block: 'Meetei Mayek Extensions', which contains 32 code points to represent additional Meetei Mayek alphabets used in the Meitei language.
2016-07-19, 219👍, 2💬

💬 2016-07-19 Herong: Hi Karen, Meeteri Mayak is a set of Unicode code points, not a font. But we do need a font that supports Meeteri Mayak code poin...

💬 2016-07-15 Karen: Is the Meeteri Mayak script a UTF8 font? If so, where can I download it? Thank you.

Character Encoding in Java
This chapter provides notes and tutorial examples on character encoding in Java. Topics including supported encodings in Java SE 7; using encoding and decoding methods; examples of encoded byte sequences of various encodings.
2017-07-14, 211👍, 1💬

💬 2017-07-14 kev: hi

1 2 >   Sort: Date