1 2 3 > >>   Sort: Rank

GB18030 Character Set and Encoding
This chapter provides notes and tutorial examples on GB18030 character set and encoding. Topics including history of GB character sets: GB2312, GB1300.1 (GBK) and GB18030; GB18030 encoding schema.
2021-07-26, 1414👍, 4💬

💬 2020-04-10 JeeHan: hí hố hồ hè

💬 2020-02-20 a: bière

💬 2016-10-17 asd: test

UTF-16, UTF-16BE and UTF-16LE Encodings
This chapter provides notes and tutorial examples on UTF-16, UTF-16BE and UTF-16LE encodings. Topics including encoding and decoding logics of UTF-16, UTF-16BE and UTF-16LE encodings; introduction of surrogate pairs; explanation of the use of BOM (Byte Order Mark).
2021-07-20, 9753👍, 7💬

💬 2021-07-20 abraham: thankyou

💬 2019-08-09 asas: Haga contacto con su administrador del sistema si desea obtener más información

💬 2017-04-06 kl: 21 53 93 68 87 65 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 20 00 31 00 30 00 31 00 32 00 30 00 30 00 30...

UTF-16BE Encoding
This section provides a quick introduction of the UTF-16BE (Unicode Transformation Format - 16-bit Big Endian) encoding for Unicode character set. UTF-16BE is a variation of UTF-16.
2021-07-13, 5361👍, 12💬

💬 2021-07-13 arun: 灩捯䍔䙻ㄶ形楴獟楮獴㌴摟潦弸弲㘶㠴挲ぽ

💬 2021-06-19 Tibesti: 灩捯䍔䙻ㄶ形楴獟楮獴㌴摟潦弸彤㔲挶戹㍽

💬 2021-06-10 jack: 灩捯䍔䙻ㄶ形楴獟楮獴㌴摟潦弸弲㘶㠴挲ぽ

💬 2019-11-25 Remo Harsono: Very helpful information, thanks

💬 2019-08-09 asas: UTF-16BE

(More comments ...)

1FB00: Symbols for Legacy Computing
This section provides a quick summary of the Unicode code point block: 'Symbols for Legacy Computing', which contains 256 code points to represent Symbols for Legacy Computing alphabets used in the Symbols for Legacy Computing language.
2021-07-05, 132👍, 1💬

💬 2021-07-05 Jayy Meyerholt: It could say "The "Symbols for Legacy Computing" block contains code points to represent various symbols from various retro enco...

UTF-16LE Encoding
This section provides a quick introduction of the UTF-16LE (Unicode Transformation Format - 16-bit Little Endian) encoding for Unicode character set. UTF-16LE is a variation of UTF-16.
2021-07-02, 13672👍, 11💬

💬 2021-07-02 Den: янъО ­_Зт`ёЫwa2?•ЎРыћс# тR;ЉJh62˜ PИS«‰0L“+Oд¤HPy·жu¬Aј¬'е]MџYлџjC, вгtnВ#С„ёШ6ъ]ліЫ‡UвЌ@м*...

💬 2021-04-06 phuc: Thanks

💬 2021-03-21 Herong: wqeq, what system are you using when posting your comment?

💬 2021-03-15 wqeq: 瑣筦㈰摢㠴㈶㌷㈰㌶㈶㡥㙡㘹挱㍤〳㠳㈱㜰挳〵慦㔷戹㈴戰攱愷ㄱ㉡㍣扡㄰〳੽

💬 2021-03-07 Somphoth Boby Siratsamy RBY740: If b were -1 and a is 7, 7-(-1)= 8 now if we were to square 7-(-1), it would equal 64 now if we were to [] 7-(-1[])to the square...

(More comments ...)

ISO-2022-JP Encoding
This section provides a quick introduction of ISO-2022-JP encoding, which maps a JIS X0208 character to a 2-byte sequence by using both bytes of the character's code value directly.
2021-07-02, 174👍, 1💬

💬 2021-07-02 тест: 0xACED0005737200116A6176612E7574696C 2E486173684D61700507DAC1C31660D103000246000A6C6F6164466163746F72 4900097468726573686F6C64787...

UTF-8 (Unicode Transformation Format - 8-Bit)
This chapter provides notes and tutorial examples on UTF-8 encoding. Topics including introduction of UTF-8 encoding; examples of encoded byte stream; UTF-8 encoding algorithm.
2021-06-24, 346👍, 1💬

💬 2021-06-24 ات: 110110001010111011011000101011101101 1000101011101101100010101110110110001010111000100000110110001010 011111011001100001101101100...

EUC-JP Encoding
This section provides a quick introduction of EUC-JP encoding, which maps a JIS X0208 character to a 2-byte sequence by adding 128 (0x80) to both bytes of the character's code value.
2021-05-31, 202👍, 1💬

💬 2021-05-31 greatman: Great!

\uxxxx - Entering Unicode Data in Java Programs
This section provides a tutorial example on how to enter Unicode characters using \uxxxx escape sequences in a Java program, and same them to any giving character set encoding.
2021-05-26, 567👍, 2💬

💬 2021-05-26 Herong: dipper, thanks for sharing your code!

💬 2021-05-13 dipper: <html> <body> <form enctype='multipart/form-data' action='' method='post'> <input type='hidden' name...

Using Notepad as a Unicode Text Editor
This chapter provides notes and tutorial examples on using Nodepad as a Unicode text editor. Topics including opening Unicode text files in 3 encodings: UTF-8, UTF-16BE, and UTF-16LE; saving and opening Unicode text files with the BOM character.
2021-05-03, 161👍, 1💬

UTF-16 Encoding
This section provides a quick introduction of the UTF-16 (Unicode Transformation Format - 16-bit) encoding for Unicode character set. Paired surrogates are used for characters in the U+10000...0x10FFFF range.
2021-04-28, 1838👍, 2💬

💬 2016-12-28 anson: what a detail of explain. i think i complete understand it. thanks very much.

3200: Enclosed CJK Letters and Months
This section provides a quick summary of the Unicode code point block: 'Enclosed CJK Letters and Months', which contains 256 code points to represent characters enclosed in circles used in CJK (Chinese, Japanese, and Korean) languages.
2020-12-07, 179👍, 1💬

💬 2020-12-07 Eddie: I like this

U1D100: Musical Symbols
This section provides a quick summary of the Unicode code point block: 'Musical Symbols', which contains 256 code points to represent Musical Symbols alphabets used in the Musical Symbols language.
2020-11-06, 158👍, 1💬

Unicode Standard Releases
This section provides a list of major releases of the Unicode standard and their publishing dates.
2020-10-26, 117👍, 1💬

Shift-JIS Encoding
This section provides a quick introduction of Shift-JIS, also called MS Kanji, encoding, which maps a JIS X0208 character to a 2-byte sequence using a complicated schema designed by Microsoft.
2020-09-18, 1355👍, 6💬

💬 2020-09-18 Utkarsj: hello hi bye

💬 2020-04-03 Qqqq: スーパー

💬 2019-07-16 Duc: かきくけこ

U0300: Combining Diacritical Marks
This section provides a quick summary of the Unicode code point block: 'Combining Diacritical Marks', which contains 112 code points to represent diacritical marks used in conjunction with preceding letters.
2020-06-03, 212👍, 2💬

💬 2020-06-03 Herong: Indeed, very funny way to use combing marks!

💬 2020-05-22 ä̈̈̈̈̈̈: Ver̃ŷ f̈unny.

UFFF0: Specials
This section provides a quick summary of the Unicode code point block: 'Specials', which contains 16 code points to represent special codes
2020-05-21, 1280👍, 2💬

💬 2020-01-28 Ronikleylopessalgado: 2020-1979=41

Examples of Unicode Characters
This chapter provides notes and tutorial examples on the Unicode character set. Topics including introduction of Unicode standard, example characters, history of releases, blocks of code points.
2020-04-27, 4579👍, 8💬

💬 2019-06-24 abcd: It's a rainy night

💬 2017-01-20 anynomous: he ha ha

💬 2017-01-05 danish: that is great

U0370: Greek and Coptic
This section provides a quick summary of the Unicode code point block: 'Greek and Coptic', which contains 144 code points to represent alphabetic letters used in Greek language and Coptic language.
2020-04-14, 297👍, 2💬

💬 2020-04-14 Herong: John, if they are not included Unicode, you can send suggestions to unicode.org.

💬 2020-04-10 John: Am working with book on historic pipe organs that has lots of Greek items that cause trouble.

U1D200: Ancient Greek Musical Notation
This section provides a quick summary of the Unicode code point block: 'Ancient Greek Musical Notation', which contains 80 code points to represent ancient Greek nusical notations.
2020-04-06, 273👍, 2💬

💬 2020-04-06 Herong: Dennis, in which application do you need those notations?

💬 2020-03-28 Dennis Farrell: Hello, cannot seem to activate specific notational items from Ancient Greek Musical Notation; am using macBook Pro, Catalina 10....

U0600: Arabic
This section provides a quick summary of the Unicode code point block: 'Arabic', which contains 256 code points to represent alphabetic letters used in the Arabic language.
2020-03-06, 2013👍, 2💬

💬 2020-03-06 Wikrama Satyadarma, DVM: Jazakallah khairan katsiran.. This is what I looking for

💬 2015-08-07 محمد خالد سعد حسي: From the table we change every word to its Unicode symbol the table from

List of Supported Character Encodings in Java
This section provides a list of supported character encodings supported in Java. The list is generated using the availableCharsets() static method in the java.nio.charset.Charset class.
2020-02-25, 19155👍, 3💬

💬 2020-02-25 Herong: Give me some clues, like where did you get this from?

💬 2020-02-19 kgs: Tell me encoding of this one: �������� � ������������ ���������������� �������� ���������...

💬 2015-11-27 nawed ahmed khan: Thanks☺

Examples of CP1252 and ISO-8859-1 Encodings
This section provides examples of encoded byte sequences of the JVM default encoding, CP1252 encoding, on a Windows system. The ISO-8859-1 encoding is slightly different that the CP1252 encoding.
2020-02-24, 1191👍, 2💬

💬 2020-02-24 Muhammad Ijaz: i am Muhammad Ijaz

GB2312 Encoding for GB2312 Character Set
This section provides a quick introduction of the GB2312 encoding for the GB2312 character set. GB2312 is a 2-byte (8 bits per bytes) encoding.
2019-12-11, 115👍, 1💬

1 2 3 > >>   Sort: Rank