Jump to content

Unicode block

From Wikipedia, the free encyclopedia

A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.

Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc.

Design and implementation

[edit]

Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" and "SUPPLEMENTALARROWSA".[1]

Blocks are pairwise disjoint; that is, they do not overlap. The starting code point and the size (number of code points) of each block are always multiples of 16; therefore, in the hexadecimal notation, the starting (smallest) point is U+xxx0 and the ending (largest) point is U+yyyF, where xxx and yyy are three or more hexadecimal digits. (These constraints are intended to simplify the display of glyphs in Unicode Consortium documents, as tables with 16 rows labeled with the last hexadecimal digit of the code point.[1]) The size of a block may range from the minimum of 16 to a maximum of 65,536 code points.

Every assigned code point has a glyph property called "Block", whose value is a character string naming the unique block that owns that point.[2] However, a block may also contain unassigned code points, usually reserved for future additions of characters that "logically" should belong to that block. Code points not belonging to any of the named blocks, e.g. in the unassigned planes 4–13, have the value block="No_Block".[1]

Simply belonging to a particular Unicode block does not guarantee the certain particular properties of the characters it is or will be expected to contain. The identity of any character is determined by its properties stated in the Unicode Character Database. For example, the contiguous range of 32 noncharacter code points U+FDD0..U+FDEF share none of the properties common to the other characters in the Arabic Presentation Forms-A block, that they are certainly not Arabic script characters or "right-to-left noncharacters", and are assigned there as a filler to this block given that it has been agreed that no further Arabic compatibility characters will be encoded. [3]

Other classifications

[edit]

Each Unicode point also has a property called "General Category", that attempts to describe the role of the corresponding symbol in the languages or applications for whose sake it was included in the system. Examples of General Categories are "Lu" (meaning upper-case letter), "Nd" (decimal digit), "Pi" (open-quote punctuation), and "Mn" (non-spacing mark, i.e. a diacritic for the preceding glyph). This division is completely independent of code blocks: the code points with a given General Category generally span many blocks, and do not have to be consecutive, not even within each block.[4]

Each code point also has a script property, specifying which writing system it is intended for, or whether it is intended for multiple writing systems. This, also, is independent of block.

In descriptions of the Unicode system, a block may be subdivided into more specific subgroups, such as the "Chess symbols" in the Miscellaneous Symbols block (not to be confused with the separate Chess Symbols block). Those subgroups are not "blocks" in the technical sense used by the Unicode consortium, and are named only for the convenience of users.

List of blocks

[edit]

Unicode 16.0 defines 338 blocks:[1]

  • 164 in plane 0, the Basic Multilingual Plane (in table below: § BMP)
  • 161 in plane 1, the Supplementary Multilingual Plane (§ SMP)
  • 7 in plane 2, the Supplementary Ideographic Plane (§ SIP)
  • 2 in plane 3, the Tertiary Ideographic Plane (§ TIP)
  • 2 in plane 14 (E in hexadecimal), the Supplementary Special-purpose Plane (§ SSP)
  • One each in the planes 15 (Fhex) and 16 (10hex), called Supplementary Private Use Area-A and -B (§ PUA-A)
PlaneBlock range Block name Code points[a]Assigned characters Scripts[b][c][d][e][f]
 0 BMPU+0000..U+007FBasic Latin[g]128 128 Latin (52 characters), Common (76 characters)
 0 BMP U+0080..U+00FFLatin-1 Supplement[h]128 128 Latin (64 characters), Common (64 characters)
 0 BMP U+0100..U+017FLatin Extended-A128 128 Latin
 0 BMP U+0180..U+024FLatin Extended-B208 208 Latin
 0 BMP U+0250..U+02AFIPA Extensions96 96 Latin
 0 BMP U+02B0..U+02FFSpacing Modifier Letters80 80 Bopomofo (2 characters), Latin (14 characters), Common (64 characters)
 0 BMP U+0300..U+036FCombining Diacritical Marks112 112 Inherited
 0 BMP U+0370..U+03FFGreek and Coptic144 135 Coptic (14 characters), Greek (117 characters), Common (4 characters)
 0 BMP U+0400..U+04FFCyrillic256 256 Cyrillic (254 characters), Inherited (2 characters)
 0 BMP U+0500..U+052FCyrillic Supplement48 48 Cyrillic
 0 BMP U+0530..U+058FArmenian96 91 Armenian
 0 BMP U+0590..U+05FFHebrew112 88 Hebrew
 0 BMP U+0600..U+06FFArabic256 256 Arabic (238 characters), Common (6 characters), Inherited (12 characters)
 0 BMP U+0700..U+074FSyriac80 77 Syriac
 0 BMP U+0750..U+077FArabic Supplement48 48 Arabic
 0 BMP U+0780..U+07BFThaana64 50 Thaana
 0 BMP U+07C0..U+07FFNKo64 62 N’Ko
 0 BMP U+0800..U+083FSamaritan64 61 Samaritan
 0 BMP U+0840..U+085FMandaic32 29 Mandaic
 0 BMP U+0860..U+086FSyriac Supplement16 11 Syriac
 0 BMP U+0870..U+089FArabic Extended-B48 42 Arabic
 0 BMP U+08A0..U+08FFArabic Extended-A96 96 Arabic (95 characters), Common (1 character)
 0 BMP U+0900..U+097FDevanagari128 128 Devanagari (122 characters), Common (2 characters), Inherited (4 characters)
 0 BMP U+0980..U+09FFBengali128 96 Bengali
 0 BMP U+0A00..U+0A7FGurmukhi128 80 Gurmukhi
 0 BMP U+0A80..U+0AFFGujarati128 91 Gujarati
 0 BMP U+0B00..U+0B7FOriya128 91 Oriya
 0 BMP U+0B80..U+0BFFTamil128 72 Tamil
 0 BMP U+0C00..U+0C7FTelugu128 100 Telugu
 0 BMP U+0C80..U+0CFFKannada128 91 Kannada
 0 BMP U+0D00..U+0D7FMalayalam128 118 Malayalam
 0 BMP U+0D80..U+0DFFSinhala128 91 Sinhala
 0 BMP U+0E00..U+0E7FThai128 87 Thai (86 characters), Common (1 character)
 0 BMP U+0E80..U+0EFFLao128 83 Lao
 0 BMP U+0F00..U+0FFFTibetan256 211 Tibetan (207 characters), Common (4 characters)
 0 BMP U+1000..U+109FMyanmar160 160 Myanmar
 0 BMP U+10A0..U+10FFGeorgian96 88 Georgian (87 characters), Common (1 character)
 0 BMP U+1100..U+11FFHangul Jamo256 256 Hangul
 0 BMP U+1200..U+137FEthiopic384 358 Ethiopic
 0 BMP U+1380..U+139FEthiopic Supplement32 26 Ethiopic
 0 BMP U+13A0..U+13FFCherokee96 92 Cherokee
 0 BMP U+1400..U+167FUnified Canadian Aboriginal Syllabics640 640 Canadian Aboriginal
 0 BMP U+1680..U+169FOgham32 29 Ogham
 0 BMP U+16A0..U+16FFRunic96 89 Runic (86 characters), Common (3 characters)
 0 BMP U+1700..U+171FTagalog32 23 Tagalog
 0 BMP U+1720..U+173FHanunoo32 23 Hanunoo (21 characters), Common (2 characters)
 0 BMP U+1740..U+175FBuhid32 20 Buhid
 0 BMP U+1760..U+177FTagbanwa32 18 Tagbanwa
 0 BMP U+1780..U+17FFKhmer128 114 Khmer
 0 BMP U+1800..U+18AFMongolian176 158 Mongolian (155 characters), Common (3 characters)
 0 BMP U+18B0..U+18FFUnified Canadian Aboriginal Syllabics Extended80 70 Canadian Aboriginal
 0 BMP U+1900..U+194FLimbu80 68 Limbu
 0 BMP U+1950..U+197FTai Le48 35 Tai Le
 0 BMP U+1980..U+19DFNew Tai Lue96 83 New Tai Lue
 0 BMP U+19E0..U+19FFKhmer Symbols32 32 Khmer
 0 BMP U+1A00..U+1A1FBuginese32 30 Buginese
 0 BMP U+1A20..U+1AAFTai Tham144 127 Tai Tham
 0 BMP U+1AB0..U+1AFFCombining Diacritical Marks Extended80 31 Inherited
 0 BMP U+1B00..U+1B7FBalinese128 127 Balinese
 0 BMP U+1B80..U+1BBFSundanese64 64 Sundanese
 0 BMP U+1BC0..U+1BFFBatak64 56 Batak
 0 BMP U+1C00..U+1C4FLepcha80 74 Lepcha
 0 BMP U+1C50..U+1C7FOl Chiki48 48 Ol Chiki
 0 BMP U+1C80..U+1C8FCyrillic Extended-C16 11 Cyrillic
 0 BMP U+1C90..U+1CBFGeorgian Extended48 46 Georgian
 0 BMP U+1CC0..U+1CCFSundanese Supplement16 8 Sundanese
 0 BMP U+1CD0..U+1CFFVedic Extensions48 43 Common (16 characters), Inherited (27 characters)
 0 BMP U+1D00..U+1D7FPhonetic Extensions128 128 Cyrillic (2 characters), Greek (15 characters), Latin (111 characters)
 0 BMP U+1D80..U+1DBFPhonetic Extensions Supplement64 64 Greek (1 character), Latin (63 characters)
 0 BMP U+1DC0..U+1DFFCombining Diacritical Marks Supplement64 64 Inherited
 0 BMP U+1E00..U+1EFFLatin Extended Additional256 256 Latin
 0 BMP U+1F00..U+1FFFGreek Extended256 233 Greek
 0 BMP U+2000..U+206FGeneral Punctuation112 111 Common (109 characters), Inherited (2 characters)
 0 BMP U+2070..U+209FSuperscripts and Subscripts48 42 Latin (15 characters), Common (27 characters)
 0 BMP U+20A0..U+20CFCurrency Symbols48 33 Common
 0 BMP U+20D0..U+20FFCombining Diacritical Marks for Symbols48 33 Inherited
 0 BMP U+2100..U+214FLetterlike Symbols80 80 Greek (1 character), Latin (4 characters), Common (75 characters)
 0 BMP U+2150..U+218FNumber Forms64 60 Latin (41 characters), Common (19 characters)
 0 BMP U+2190..U+21FFArrows112 112 Common
 0 BMP U+2200..U+22FFMathematical Operators256 256 Common
 0 BMP U+2300..U+23FFMiscellaneous Technical256 256 Common
 0 BMP U+2400..U+243FControl Pictures64 42 Common
 0 BMP U+2440..U+245FOptical Character Recognition32 11 Common
 0 BMP U+2460..U+24FFEnclosed Alphanumerics160 160 Common
 0 BMP U+2500..U+257FBox Drawing128 128 Common
 0 BMP U+2580..U+259FBlock Elements32 32 Common
 0 BMP U+25A0..U+25FFGeometric Shapes96 96 Common
 0 BMP U+2600..U+26FFMiscellaneous Symbols256 256 Common
 0 BMP U+2700..U+27BFDingbats192 192 Common
 0 BMP U+27C0..U+27EFMiscellaneous Mathematical Symbols-A48 48 Common
 0 BMP U+27F0..U+27FFSupplemental Arrows-A16 16 Common
 0 BMP U+2800..U+28FFBraille Patterns256 256 Braille
 0 BMP U+2900..U+297FSupplemental Arrows-B128 128 Common
 0 BMP U+2980..U+29FFMiscellaneous Mathematical Symbols-B128 128 Common
 0 BMP U+2A00..U+2AFFSupplemental Mathematical Operators256 256 Common
 0 BMP U+2B00..U+2BFFMiscellaneous Symbols and Arrows256 253 Common
 0 BMP U+2C00..U+2C5FGlagolitic96 96 Glagolitic
 0 BMP U+2C60..U+2C7FLatin Extended-C32 32 Latin
 0 BMP U+2C80..U+2CFFCoptic128 123 Coptic
 0 BMP U+2D00..U+2D2FGeorgian Supplement48 40 Georgian
 0 BMP U+2D30..U+2D7FTifinagh80 59 Tifinagh
 0 BMP U+2D80..U+2DDFEthiopic Extended96 79 Ethiopic
 0 BMP U+2DE0..U+2DFFCyrillic Extended-A32 32 Cyrillic
 0 BMP U+2E00..U+2E7FSupplemental Punctuation128 94 Common
 0 BMP U+2E80..U+2EFFCJK Radicals Supplement128 115 Han
 0 BMP U+2F00..U+2FDFKangxi Radicals224 214 Han
 0 BMP U+2FF0..U+2FFFIdeographic Description Characters16 16 Common
 0 BMP U+3000..U+303FCJK Symbols and Punctuation64 64 Han (15 characters), Hangul (2 characters), Common (43 characters), Inherited (4 characters)
 0 BMP U+3040..U+309FHiragana96 93 Hiragana (89 characters), Common (2 characters), Inherited (2 characters)
 0 BMP U+30A0..U+30FFKatakana96 96 Katakana (93 characters), Common (3 characters)
 0 BMP U+3100..U+312FBopomofo48 43 Bopomofo
 0 BMP U+3130..U+318FHangul Compatibility Jamo96 94 Hangul
 0 BMP U+3190..U+319FKanbun16 16 Common
 0 BMP U+31A0..U+31BFBopomofo Extended32 32 Bopomofo
 0 BMP U+31C0..U+31EFCJK Strokes48 39 Common
 0 BMP U+31F0..U+31FFKatakana Phonetic Extensions16 16 Katakana
 0 BMP U+3200..U+32FFEnclosed CJK Letters and Months256 255 Hangul (62 characters), Katakana (47 characters), Common (146 characters)
 0 BMP U+3300..U+33FFCJK Compatibility256 256 Katakana (88 characters), Common (168 characters)
 0 BMP U+3400..U+4DBFCJK Unified Ideographs Extension A6,592 6,592 Han
 0 BMP U+4DC0..U+4DFFYijing Hexagram Symbols64 64 Common
 0 BMP U+4E00..U+9FFFCJK Unified Ideographs20,992 20,992 Han
 0 BMP U+A000..U+A48FYi Syllables1,168 1,165 Yi
 0 BMP U+A490..U+A4CFYi Radicals64 55 Yi
 0 BMP U+A4D0..U+A4FFLisu48 48 Lisu
 0 BMP U+A500..U+A63FVai320 300 Vai
 0 BMP U+A640..U+A69FCyrillic Extended-B96 96 Cyrillic
 0 BMP U+A6A0..U+A6FFBamum96 88 Bamum
 0 BMP U+A700..U+A71FModifier Tone Letters32 32 Common
 0 BMP U+A720..U+A7FFLatin Extended-D224 199 Latin (194 characters), Common (5 characters)
 0 BMP U+A800..U+A82FSyloti Nagri48 45 Syloti Nagri
 0 BMP U+A830..U+A83FCommon Indic Number Forms16 10 Common
 0 BMP U+A840..U+A87FPhags-pa64 56 Phags Pa
 0 BMP U+A880..U+A8DFSaurashtra96 82 Saurashtra
 0 BMP U+A8E0..U+A8FFDevanagari Extended32 32 Devanagari
 0 BMP U+A900..U+A92FKayah Li48 48 Kayah Li (47 characters), Common (1 character)
 0 BMP U+A930..U+A95FRejang48 37 Rejang
 0 BMP U+A960..U+A97FHangul Jamo Extended-A32 29 Hangul
 0 BMP U+A980..U+A9DFJavanese96 91 Javanese (90 characters), Common (1 character)
 0 BMP U+A9E0..U+A9FFMyanmar Extended-B32 31 Myanmar
 0 BMP U+AA00..U+AA5FCham96 83 Cham
 0 BMP U+AA60..U+AA7FMyanmar Extended-A32 32 Myanmar
 0 BMP U+AA80..U+AADFTai Viet96 72 Tai Viet
 0 BMP U+AAE0..U+AAFFMeetei Mayek Extensions32 23 Meetei Mayek
 0 BMP U+AB00..U+AB2FEthiopic Extended-A48 32 Ethiopic
 0 BMP U+AB30..U+AB6FLatin Extended-E64 60 Latin (56 characters), Greek (1 character), Common (3 characters)
 0 BMP U+AB70..U+ABBFCherokee Supplement80 80 Cherokee
 0 BMP U+ABC0..U+ABFFMeetei Mayek64 56 Meetei Mayek
 0 BMP U+AC00..U+D7AFHangul Syllables11,184 11,172 Hangul
 0 BMP U+D7B0..U+D7FFHangul Jamo Extended-B80 72 Hangul
 0 BMP U+D800..U+DB7FHigh Surrogates896 0 Unknown
 0 BMP U+DB80..U+DBFFHigh Private Use Surrogates128 0 Unknown
 0 BMP U+DC00..U+DFFFLow Surrogates1,024 0 Unknown
 0 BMP U+E000..U+F8FFPrivate Use Area6,400 6,400 Unknown
 0 BMP U+F900..U+FAFFCJK Compatibility Ideographs512 472 Han
 0 BMP U+FB00..U+FB4FAlphabetic Presentation Forms80 58 Armenian (5 characters), Hebrew (46 characters), Latin (7 characters)
 0 BMP U+FB50..U+FDFFArabic Presentation Forms-A688 631 Arabic (629 characters), Common (2 characters)
 0 BMP U+FE00..U+FE0FVariation Selectors16 16 Inherited
 0 BMP U+FE10..U+FE1FVertical Forms16 10 Common
 0 BMP U+FE20..U+FE2FCombining Half Marks16 16 Cyrillic (2 characters), Inherited (14 characters)
 0 BMP U+FE30..U+FE4FCJK Compatibility Forms32 32 Common
 0 BMP U+FE50..U+FE6FSmall Form Variants32 26 Common
 0 BMP U+FE70..U+FEFFArabic Presentation Forms-B144 141 Arabic (140 characters), Common (1 character)
 0 BMP U+FF00..U+FFEFHalfwidth and Fullwidth Forms240 225 Hangul (52 characters), Katakana (55 characters), Latin (52 characters), Common (66 characters)
 0 BMP U+FFF0..U+FFFFSpecials16 5 Common
 1 SMPU+10000..U+1007FLinear B Syllabary128 88 Linear B
 1 SMP U+10080..U+100FFLinear B Ideograms128 123 Linear B
 1 SMP U+10100..U+1013FAegean Numbers64 57 Common
 1 SMP U+10140..U+1018FAncient Greek Numbers80 79 Greek
 1 SMP U+10190..U+101CFAncient Symbols64 14 Greek (1 character), Common (13 characters)
 1 SMP U+101D0..U+101FFPhaistos Disc48 46 Common (45 characters), Inherited (1 character)
 1 SMP U+10280..U+1029FLycian32 29 Lycian
 1 SMP U+102A0..U+102DFCarian64 49 Carian
 1 SMP U+102E0..U+102FFCoptic Epact Numbers32 28 Common (27 characters), Inherited (1 character)
 1 SMP U+10300..U+1032FOld Italic48 39 Old Italic
 1 SMP U+10330..U+1034FGothic32 27 Gothic
 1 SMP U+10350..U+1037FOld Permic48 43 Old Permic
 1 SMP U+10380..U+1039FUgaritic32 31 Ugaritic
 1 SMP U+103A0..U+103DFOld Persian64 50 Old Persian
 1 SMP U+10400..U+1044FDeseret80 80 Deseret
 1 SMP U+10450..U+1047FShavian48 48 Shavian
 1 SMP U+10480..U+104AFOsmanya48 40 Osmanya
 1 SMP U+104B0..U+104FFOsage80 72 Osage
 1 SMP U+10500..U+1052FElbasan48 40 Elbasan
 1 SMP U+10530..U+1056FCaucasian Albanian64 53 Caucasian Albanian
 1 SMP U+10570..U+105BFVithkuqi80 70 Vithkuqi
 1 SMP U+105C0..U+105FFTodhri64 52 Todhri
 1 SMP U+10600..U+1077FLinear A384 341 Linear A
 1 SMP U+10780..U+107BFLatin Extended-F64 57 Latin
 1 SMP U+10800..U+1083FCypriot Syllabary64 55 Cypriot
 1 SMP U+10840..U+1085FImperial Aramaic32 31 Imperial Aramaic
 1 SMP U+10860..U+1087FPalmyrene32 32 Palmyrene
 1 SMP U+10880..U+108AFNabataean48 40 Nabataean
 1 SMP U+108E0..U+108FFHatran32 26 Hatran
 1 SMP U+10900..U+1091FPhoenician32 29 Phoenician
 1 SMP U+10920..U+1093FLydian32 27 Lydian
 1 SMP U+10980..U+1099FMeroitic Hieroglyphs32 32 Meroitic Hieroglyphs
 1 SMP U+109A0..U+109FFMeroitic Cursive96 90 Meroitic Cursive
 1 SMP U+10A00..U+10A5FKharoshthi96 68 Kharoshthi
 1 SMP U+10A60..U+10A7FOld South Arabian32 32 Old South Arabian
 1 SMP U+10A80..U+10A9FOld North Arabian32 32 Old North Arabian
 1 SMP U+10AC0..U+10AFFManichaean64 51 Manichaean
 1 SMP U+10B00..U+10B3FAvestan64 61 Avestan
 1 SMP U+10B40..U+10B5FInscriptional Parthian32 30 Inscriptional Parthian
 1 SMP U+10B60..U+10B7FInscriptional Pahlavi32 27 Inscriptional Pahlavi
 1 SMP U+10B80..U+10BAFPsalter Pahlavi48 29 Psalter Pahlavi
 1 SMP U+10C00..U+10C4FOld Turkic80 73 Old Turkic
 1 SMP U+10C80..U+10CFFOld Hungarian128 108 Old Hungarian
 1 SMP U+10D00..U+10D3FHanifi Rohingya64 50 Hanifi Rohingya
 1 SMP U+10D40..U+10D8FGaray80 69 Garay
 1 SMP U+10E60..U+10E7FRumi Numeral Symbols32 31 Arabic
 1 SMP U+10E80..U+10EBFYezidi64 47 Yezidi
 1 SMP U+10EC0..U+10EFFArabic Extended-C64 7 Arabic
 1 SMP U+10F00..U+10F2FOld Sogdian48 40 Old Sogdian
 1 SMP U+10F30..U+10F6FSogdian64 42 Sogdian
 1 SMP U+10F70..U+10FAFOld Uyghur64 26 Old Uyghur
 1 SMP U+10FB0..U+10FDFChorasmian48 28 Chorasmian
 1 SMP U+10FE0..U+10FFFElymaic32 23 Elymaic
 1 SMP U+11000..U+1107FBrahmi128 115 Brahmi
 1 SMP U+11080..U+110CFKaithi80 68 Kaithi
 1 SMP U+110D0..U+110FFSora Sompeng48 35 Sora Sompeng
 1 SMP U+11100..U+1114FChakma80 71 Chakma
 1 SMP U+11150..U+1117FMahajani48 39 Mahajani
 1 SMP U+11180..U+111DFSharada96 96 Sharada
 1 SMP U+111E0..U+111FFSinhala Archaic Numbers32 20 Sinhala
 1 SMP U+11200..U+1124FKhojki80 65 Khojki
 1 SMP U+11280..U+112AFMultani48 38 Multani
 1 SMP U+112B0..U+112FFKhudawadi80 69 Khudawadi
 1 SMP U+11300..U+1137FGrantha128 86 Grantha (85 characters), Inherited (1 character)
 1 SMP U+11380..U+113FFTulu-Tigalari128 80 Tulu Tigalari
 1 SMP U+11400..U+1147FNewa128 97 Newa
 1 SMP U+11480..U+114DFTirhuta96 82 Tirhuta
 1 SMP U+11580..U+115FFSiddham128 92 Siddham
 1 SMP U+11600..U+1165FModi96 79 Modi
 1 SMP U+11660..U+1167FMongolian Supplement32 13 Mongolian
 1 SMP U+11680..U+116CFTakri80 68 Takri
 1 SMP U+116D0..U+116FFMyanmar Extended-C48 20 Myanmar
 1 SMP U+11700..U+1174FAhom80 65 Ahom
 1 SMP U+11800..U+1184FDogra80 60 Dogra
 1 SMP U+118A0..U+118FFWarang Citi96 84 Warang Citi
 1 SMP U+11900..U+1195FDives Akuru96 72 Dives Akuru
 1 SMP U+119A0..U+119FFNandinagari96 65 Nandinagari
 1 SMP U+11A00..U+11A4FZanabazar Square80 72 Zanabazar Square
 1 SMP U+11A50..U+11AAFSoyombo96 83 Soyombo
 1 SMP U+11AB0..U+11ABFUnified Canadian Aboriginal Syllabics Extended-A16 16 Canadian Aboriginal
 1 SMP U+11AC0..U+11AFFPau Cin Hau64 57 Pau Cin Hau
 1 SMP U+11B00..U+11B5FDevanagari Extended-A96 10 Devanagari
 1 SMP U+11BC0..U+11BFFSunuwar64 44 Sunuwar
 1 SMP U+11C00..U+11C6FBhaiksuki112 97 Bhaiksuki
 1 SMP U+11C70..U+11CBFMarchen80 68 Marchen
 1 SMP U+11D00..U+11D5FMasaram Gondi96 75 Masaram Gondi
 1 SMP U+11D60..U+11DAFGunjala Gondi80 63 Gunjala Gondi
 1 SMP U+11EE0..U+11EFFMakasar32 25 Makasar
 1 SMP U+11F00..U+11F5FKawi96 87 Kawi
 1 SMP U+11FB0..U+11FBFLisu Supplement16 1 Lisu
 1 SMP U+11FC0..U+11FFFTamil Supplement64 51 Tamil
 1 SMP U+12000..U+123FFCuneiform1,024 922 Cuneiform
 1 SMP U+12400..U+1247FCuneiform Numbers and Punctuation128 116 Cuneiform
 1 SMP U+12480..U+1254FEarly Dynastic Cuneiform208 196 Cuneiform
 1 SMP U+12F90..U+12FFFCypro-Minoan112 99 Cypro Minoan
 1 SMP U+13000..U+1342FEgyptian Hieroglyphs1,072 1,072 Egyptian Hieroglyphs
 1 SMP U+13430..U+1345FEgyptian Hieroglyph Format Controls48 38 Egyptian Hieroglyphs
 1 SMP U+13460..U+143FFEgyptian Hieroglyphs Extended-A4,000 3,995 Egyptian Hieroglyphs
 1 SMP U+14400..U+1467FAnatolian Hieroglyphs640 583 Anatolian Hieroglyphs
 1 SMP U+16100..U+1613FGurung Khema64 58 Gurung Khema
 1 SMP U+16800..U+16A3FBamum Supplement576 569 Bamum
 1 SMP U+16A40..U+16A6FMro48 43 Mro
 1 SMP U+16A70..U+16ACFTangsa96 89 Tangsa
 1 SMP U+16AD0..U+16AFFBassa Vah48 36 Bassa Vah
 1 SMP U+16B00..U+16B8FPahawh Hmong144 127 Pahawh Hmong
 1 SMP U+16D40..U+16D7FKirat Rai64 58 Kirat Rai
 1 SMP U+16E40..U+16E9FMedefaidrin96 91 Medefaidrin
 1 SMP U+16F00..U+16F9FMiao160 149 Miao
 1 SMP U+16FE0..U+16FFFIdeographic Symbols and Punctuation32 7 Han (4 characters), Khitan Small Script (1 character), Nushu (1 character), Tangut (1 character)
 1 SMP U+17000..U+187FFTangut6,144 6,136 Tangut
 1 SMP U+18800..U+18AFFTangut Components768 768 Tangut
 1 SMP U+18B00..U+18CFFKhitan Small Script512 471 Khitan Small Script
 1 SMP U+18D00..U+18D7FTangut Supplement128 9 Tangut
 1 SMP U+1AFF0..U+1AFFFKana Extended-B16 13 Katakana
 1 SMP U+1B000..U+1B0FFKana Supplement256 256 Hiragana (255 characters), Katakana (1 character)
 1 SMP U+1B100..U+1B12FKana Extended-A48 35 Hiragana (32 characters), Katakana (3 characters)
 1 SMP U+1B130..U+1B16FSmall Kana Extension64 9 Hiragana (4 characters), Katakana (5 characters)
 1 SMP U+1B170..U+1B2FFNushu400 396 Nüshu
 1 SMP U+1BC00..U+1BC9FDuployan160 143 Duployan
 1 SMP U+1BCA0..U+1BCAFShorthand Format Controls16 4 Common
 1 SMP U+1CC00..U+1CEBFSymbols for Legacy Computing Supplement704 686 Common
 1 SMP U+1CF00..U+1CFCFZnamenny Musical Notation208 185 Common (116 characters), Inherited (69 characters)
 1 SMP U+1D000..U+1D0FFByzantine Musical Symbols256 246 Common
 1 SMP U+1D100..U+1D1FFMusical Symbols256 233 Common (211 characters), Inherited (22 characters)
 1 SMP U+1D200..U+1D24FAncient Greek Musical Notation80 70 Greek
 1 SMP U+1D2C0..U+1D2DFKaktovik Numerals32 20 Common
 1 SMP U+1D2E0..U+1D2FFMayan Numerals32 20 Common
 1 SMP U+1D300..U+1D35FTai Xuan Jing Symbols96 87 Common
 1 SMP U+1D360..U+1D37FCounting Rod Numerals32 25 Common
 1 SMP U+1D400..U+1D7FFMathematical Alphanumeric Symbols1,024 996 Common
 1 SMP U+1D800..U+1DAAFSutton SignWriting688 672 SignWriting
 1 SMP U+1DF00..U+1DFFFLatin Extended-G256 37 Latin
 1 SMP U+1E000..U+1E02FGlagolitic Supplement48 38 Glagolitic
 1 SMP U+1E030..U+1E08FCyrillic Extended-D96 63 Cyrillic
 1 SMP U+1E100..U+1E14FNyiakeng Puachue Hmong80 71 Nyiakeng Puachue Hmong
 1 SMP U+1E290..U+1E2BFToto48 31 Toto
 1 SMP U+1E2C0..U+1E2FFWancho64 59 Wancho
 1 SMP U+1E4D0..U+1E4FFNag Mundari48 42 Mundari
 1 SMP U+1E5D0..U+1E5FFOl Onal48 44 Ol Onal
 1 SMP U+1E7E0..U+1E7FFEthiopic Extended-B32 28 Ethiopic
 1 SMP U+1E800..U+1E8DFMende Kikakui224 213 Mende Kikakui
 1 SMP U+1E900..U+1E95FAdlam96 88 Adlam
 1 SMP U+1EC70..U+1ECBFIndic Siyaq Numbers80 68 Common
 1 SMP U+1ED00..U+1ED4FOttoman Siyaq Numbers80 61 Common
 1 SMP U+1EE00..U+1EEFFArabic Mathematical Alphabetic Symbols256 143 Arabic
 1 SMP U+1F000..U+1F02FMahjong Tiles48 44 Common
 1 SMP U+1F030..U+1F09FDomino Tiles112 100 Common
 1 SMP U+1F0A0..U+1F0FFPlaying Cards96 82 Common
 1 SMP U+1F100..U+1F1FFEnclosed Alphanumeric Supplement256 200 Common
 1 SMP U+1F200..U+1F2FFEnclosed Ideographic Supplement256 64 Hiragana (1 character), Common (63 characters)
 1 SMP U+1F300..U+1F5FFMiscellaneous Symbols and Pictographs768 768 Common
 1 SMP U+1F600..U+1F64FEmoticons80 80 Common
 1 SMP U+1F650..U+1F67FOrnamental Dingbats48 48 Common
 1 SMP U+1F680..U+1F6FFTransport and Map Symbols128 118 Common
 1 SMP U+1F700..U+1F77FAlchemical Symbols128 124 Common
 1 SMP U+1F780..U+1F7FFGeometric Shapes Extended128 103 Common
 1 SMP U+1F800..U+1F8FFSupplemental Arrows-C256 162 Common
 1 SMP U+1F900..U+1F9FFSupplemental Symbols and Pictographs256 256 Common
 1 SMP U+1FA00..U+1FA6FChess Symbols112 98 Common
 1 SMP U+1FA70..U+1FAFFSymbols and Pictographs Extended-A144 114 Common
 1 SMP U+1FB00..U+1FBFFSymbols for Legacy Computing256 249 Common
 2 SIPU+20000..U+2A6DFCJK Unified Ideographs Extension B42,720 42,720 Han
 2 SIP U+2A700..U+2B73FCJK Unified Ideographs Extension C4,160 4,154 Han
 2 SIP U+2B740..U+2B81FCJK Unified Ideographs Extension D224 222 Han
 2 SIP U+2B820..U+2CEAFCJK Unified Ideographs Extension E5,776 5,762 Han
 2 SIP U+2CEB0..U+2EBEFCJK Unified Ideographs Extension F7,488 7,473 Han
 2 SIP U+2EBF0..U+2EE5FCJK Unified Ideographs Extension I624 622 Han
 2 SIP U+2F800..U+2FA1FCJK Compatibility Ideographs Supplement544 542 Han
 3 TIPU+30000..U+3134FCJK Unified Ideographs Extension G4,944 4,939 Han
 3 TIP U+31350..U+323AFCJK Unified Ideographs Extension H4,192 4,192 Han
14 SSPU+E0000..U+E007FTags128 97 Common
14 SSP U+E0100..U+E01EFVariation Selectors Supplement240 240 Inherited
15 PUA-AU+F0000..U+FFFFFSupplementary Private Use Area-A65,536 65,534 Unknown
16 PUA-BU+100000..U+10FFFFSupplementary Private Use Area-B65,536 65,534 Unknown
  1. ^Code point count includes unassigned code points: noncharacter, reserved etc.
  2. ^The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the block name
  3. ^"Common" and "Unknown" (Zyyy) and "Inherited" (Zinh or Qaai) refer to Scripts in ISO 15924
  4. ^Unicode Blocks data file. As of Unicode version 16.0
  5. ^UAX 24: Unicode Script Property (4 alpha code)
  6. ^UAX 24: Script data file
  7. ^Called "C0 Controls and Basic Latin" in ISO/IEC 10646
  8. ^Called "C1 Controls and Latin-1 Supplement" in ISO/IEC 10646

Moved blocks

[edit]

The Unicode Stability Policy requires that a character, once assigned, may not be moved or removed, although it may be deprecated. This applies to Unicode 2.0 and all subsequent versions.

Prior to this, the following former blocks were moved:

Former Unicode blocks from before Unicode 2.0
Block range Historical
block name
Version when added Version when removed Range now occupied by Superseded by block Code points Assigned characters Scripts
U+1000..U+105F Tibetan[5]1.0.0 1.0.1 MyanmarTibetan96 71 Tibetan
U+3400..U+3D2D Hangul[6]1.0.0 2.0 CJK Unified Ideographs Extension AHangul Syllables2350 2350 Hangul
U+3D2E..U+44B7 Hangul Supplementary-A[6]1.1 2.0 1930 1930
U+44B8..U+4DFF Hangul Supplementary-B[6]CJK Unified Ideographs Extension A and Yijing Hexagram Symbols2376 2376

References

[edit]
  1. ^ abcd"Unicode Blocks data file, Unicode version 15.1". Unicode Consortium. Retrieved 2023-09-12.
  2. ^"Glossary". www.unicode.org. Retrieved 2022-08-07.
  3. ^"Private-Use Characters, Noncharacters & Sentinels FAQ". www.unicode.org. Retrieved 2023-07-24.
  4. ^"Unicode Core Specification, Chapter 4: Character Properties"(PDF). Retrieved 2021-09-15.
  5. ^"3.8: Block-by-Block Charts"(PDF). The Unicode Standard. Version 1.0. Unicode Consortium.
  6. ^ abc"Appendix E: Block Names"(PDF). The Unicode Standard. Version 1.1. Unicode Consortium.
[edit]
close