Jump to content

Script (Unicode)

From Wikipedia, the free encyclopedia
(Redirected from Scripts in Unicode)
ழ்
ع‎‎ ש‎‎ ДA‎

In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.[1] Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters.

The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.

Unicode 16.0 defines 168 separate scripts, including 99 modern scripts and 69 ancient or historic scripts.[2][3] More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.[4]

Definition and classification

[edit]

When multiple languages make use of the same script, there are frequently some differences, particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character å (sometimes called a Swedish O), while English has no such character. Nor does English make use of the diacritic combining ring above for any character. In general, the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems, they are said to use the same Latin script. Thus, the Unicode abstraction of scripts is a basic organizing technique. The differences among different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms.

Script versus writing system

[edit]

Writing system is sometimes treated as a synonym for "script". However, it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script; for example, the Japanese writing system makes use of the Han, Hiragana and Katakana scripts.

Most writing systems can be broadly divided into several categories: logographic, syllabic, alphabetic (or segmental), abugida, abjad and featural; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize a system. The term complex system is sometimes used to describe those where the admixture makes classification problematic.

Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text-processing algorithms.

Special script property values

[edit]

In addition to explicit or specific script properties, Unicode uses three special values:[5]

Common
Unicode can assign a character in the UCS to a single script only. However, many characters—those that are not part of a formal natural-language writing system or are unified across many writing systems—may be used in more than one script (for example, currency signs, symbols, numerals and punctuation marks). In these cases Unicode defines them as belonging to the "common" script (ISO 15924 code "Zyyy").
Inherited
Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, U+0308  ̈  COMBINING DIAERESIS may combine either with U+0065 eLATIN SMALL LETTER E to create a Latin ë or with U+0435 еCYRILLIC SMALL LETTER IE for the Cyrillicё. In the former case, it inherits the Latin script of the base character, whereas in the latter case, it inherits the Cyrillic script of the base character.
Unknown
The value of "unknown" script (ISO 15924 code Zzzz) is given to unassigned, private-use, noncharacter, and surrogate code points.

Character categories within scripts

[edit]

Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letters and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as Dz (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters, and therefore Unicode discourages their use by authors. It is unlikely that new titlecase letters will be added in the future.

Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase.

Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation, separators (word separators such as spaces), symbols and non-graphical format characters. These are included in a particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters.

List of encoded scripts

[edit]

As of version 16.0, Unicode defines 168 scripts (called "Alias" or "Property value alias") based on the ISO 15924 list. In addition, Unicode assigns the name "Common" to ISO 15924's Zyyy code for undetermined scripts, "Inherited" to ISO 15924's Zinh code for inherited scripts, and "Unknown" to ISO 15924's Zzzz code for uncoded scripts. There are script codes defined by ISO 15924 but are not used in Unicode, including Zsym (Symbols) and Zmth (Mathematical notation).

ISO 15924 Script in Unicode[e]
Code ISO number ISO formal name Directionality Unicode Alias[f]Version Characters Notes Description
Adlm166Adlamright-to-left script Edit this on WikidataAdlam 9.0 88 Ch 19.9
Afak439Afakavaries ZZ— Not in Unicode, proposal is explored[i]
Aghb239Caucasian Albanianleft-to-right Edit this on WikidataCaucasian Albanian 7.0 53 Ancient/historic Ch 8.11
Ahom338Ahom, Tai Ahomleft-to-right Edit this on WikidataAhom 8.0 65 Ancient/historic Ch 15.16
Arab160Arabicright-to-left script Edit this on WikidataArabic 1.0 1,373 Ch 9.2
Aran161Arabic (Nastaliq variant)mixed ZZ— Typographic variant of Arabic (see § Arab)
Armi124Imperial Aramaicright-to-left script Edit this on WikidataImperial Aramaic 5.2 31 Ancient/historic Ch 10.4
Armn230Armenianleft-to-right Edit this on WikidataArmenian 1.0 96 Ch 7.6
Avst134Avestanright-to-left script Edit this on WikidataAvestan 5.2 61 Ancient/historic Ch 10.7
Bali360Balineseleft-to-right Edit this on WikidataBalinese 5.0 127 Ch 17.3
Bamu435Bamumleft-to-right Edit this on WikidataBamum 5.2 657 Ch 19.6
Bass259Bassa Vahleft-to-right Edit this on WikidataBassa Vah 7.0 36 Ancient/historic Ch 19.7
Batk365Batakleft-to-right Edit this on WikidataBatak 6.0 56 Ch 17.6
Beng325Bengali (Bangla)left-to-right Edit this on WikidataBengali 1.0 96 Ch 12.2
Berf258Beria Erfeleft-to-right ZZ— Not in Unicode
Bhks334Bhaiksukileft-to-right Edit this on WikidataBhaiksuki 9.0 97 Ancient/historic Ch 14.3
Blis550Blissymbolsvaries ZZ— Not in Unicode, proposal is explored[i]
Bopo285Bopomofoleft-to-right, right-to-left script Edit this on WikidataBopomofo 1.0 77 Ch 18.3
Brah300Brahmileft-to-right Edit this on WikidataBrahmi 6.0 115 Ancient/historic Ch 14.1
Brai570Brailleleft-to-right Edit this on WikidataBraille 3.0 256 Ch 21.1
Bugi367Bugineseleft-to-right Edit this on WikidataBuginese 4.1 30 Ch 17.2
Buhd372Buhidleft-to-right Edit this on WikidataBuhid 3.2 20 Ch 17.1
Cakm349Chakmaleft-to-right Edit this on WikidataChakma 6.1 71 Ch 13.11
Cans440Unified Canadian Aboriginal Syllabicsleft-to-right Edit this on WikidataCanadian Aboriginal 3.0 726 Ch 20.2
Cari201Carianleft-to-right, right-to-left script Edit this on WikidataCarian 5.1 49 Ancient/historic Ch 8.5
Cham358Chamleft-to-right Edit this on WikidataCham 5.1 83 Ch 16.10
Cher445Cherokeeleft-to-right Edit this on WikidataCherokee 3.0 172 Ch 20.1
Chis298Chisoileft-to-right ZZ— Not in Unicode, proposal is mature[ii]
Chrs109Chorasmianright-to-left script, top-to-bottom Edit this on WikidataChorasmian 13.0 28 Ancient/historic Ch 10.8
Cirt291Cirthvaries ZZ— Not in Unicode
Copt204Copticleft-to-right Edit this on WikidataCoptic 1.0 137 Ancient/historic, disunified from Greek in 4.1 Ch 7.3
Cpmn402Cypro-Minoanleft-to-right Cypro Minoan 14.0 99 Ancient/historic Ch 8.4
Cprt403Cypriot syllabaryright-to-left script Edit this on WikidataCypriot 4.0 55 Ancient/historic Ch 8.3
Cyrl220Cyrillicleft-to-right Edit this on WikidataCyrillic 1.0 508 Includes typographic variant Old Church Slavonic (see § Cyrs)Ch 7.4
Cyrs221Cyrillic (Old Church Slavonic variant)varies ZZ— Typographic variant of Cyrillic (see § Cyrl); Ancient/historic
Deva315Devanagari (Nagari)left-to-right Edit this on WikidataDevanagari 1.0 164 Ch 12.1
Diak342Dives Akuruleft-to-right Edit this on WikidataDives Akuru 13.0 72 Ancient/historic Ch 15.15
Dogr328Dograleft-to-right Edit this on WikidataDogra 11.0 60 Ancient/historic Ch 15.18
Dsrt250Deseret (Mormon)left-to-right Edit this on WikidataDeseret 3.1 80 Ch 20.4
Dupl755Duployan shorthand, Duployan stenographyleft-to-right Edit this on WikidataDuployan 7.0 143 Ch 21.6
Egyd070Egyptian demoticmixed ZZ— Not in Unicode
Egyh060Egyptian hieraticmixed ZZ— Not in Unicode
Egyp050Egyptian hieroglyphsright-to-left script, left-to-right, bottom-to-top, top-to-bottom Edit this on WikidataEgyptian Hieroglyphs 5.2 5,105 Ancient/historic Ch 11.4
Elba226Elbasanleft-to-right Edit this on WikidataElbasan 7.0 40 Ancient/historic Ch 8.10
Elym128Elymaicright-to-left script Edit this on WikidataElymaic 12.0 23 Ancient/historic Ch 10.9
Ethi430Ethiopic (Geʻez)left-to-right Edit this on WikidataEthiopic 3.0 523 Ch 19.1
Gara164Garayright-to-left Garay 16.0 69
Geok241Khutsuri (Asomtavruli and Nuskhuri)left-to-right Edit this on WikidataGeorgian Unicode groups Khutsori, Asomtavruli and Nuskhuri into 'Georgian' (see § Geok). Similarly, Mkhedruli and Mtavruli are 'Georgian' (see § Geor) Ch 7.7
Geor240Georgian (Mkhedruli and Mtavruli)left-to-right Edit this on WikidataGeorgian 1.0 173 In Unicode this also includes Nuskhuri (Geok) Ch 7.7
Glag225Glagoliticleft-to-right Edit this on WikidataGlagolitic 4.1 134 Ancient/historic Ch 7.5
Gong312Gunjala Gondileft-to-right Edit this on WikidataGunjala Gondi 11.0 63 Ch 13.15
Gonm313Masaram Gondileft-to-right Edit this on WikidataMasaram Gondi 10.0 75 Ch 13.14
Goth206Gothicleft-to-right Edit this on WikidataGothic 3.1 27 Ancient/historic Ch 8.9
Gran343Granthaleft-to-right Edit this on WikidataGrantha 7.0 85 Ancient/historic Ch 15.14
Grek200Greekleft-to-right Edit this on WikidataGreek 1.0 518 Directionality sometimes as boustrophedonCh 7.2
Gujr320Gujaratileft-to-right Edit this on WikidataGujarati 1.0 91 Ch 12.4
Gukh397Gurung Khemaleft-to-right Gurung Khema 16.0 58
Guru310Gurmukhileft-to-right Edit this on WikidataGurmukhi 1.0 80 Ch 12.3
Hanb503Han with Bopomofo (alias for Han + Bopomofo)mixed ZZ— See § Hani, § Bopo
Hang286Hangul (Hangŭl, Hangeul)left-to-right, vertical right-to-left Edit this on WikidataHangul 1.0 11,739 Hangul syllables relocated in 2.0 Ch 18.6
Hani500Han (Hanzi, Kanji, Hanja)top-to-bottom, columns right-to-left (historically) Han 1.0 99,030 Ch 18.1
Hano371Hanunoo (Hanunóo)left-to-right, bottom-to-top Edit this on WikidataHanunoo 3.2 21 Ch 17.1
Hans501Han (Simplified variant)varies ZZ— Subset of Han (Hanzi, Kanji, Hanja) (see § Hani)
Hant502Han (Traditional variant)varies ZZ— Subset of § Hani
Hatr127Hatranright-to-left script Edit this on WikidataHatran 8.0 26 Ancient/historic Ch 10.12
Hebr125Hebrewright-to-left script Edit this on WikidataHebrew 1.0 134 Ch 9.1
Hira410Hiraganavertical right-to-left, left-to-right Edit this on WikidataHiragana 1.0 381 Ch 18.4
Hluw080Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs)left-to-right Edit this on WikidataAnatolian Hieroglyphs 8.0 583 Ancient/historic Ch 11.6
Hmng450Pahawh Hmongleft-to-right Edit this on WikidataPahawh Hmong 7.0 127 Ch 16.11
Hmnp451Nyiakeng Puachue Hmongleft-to-right Edit this on WikidataNyiakeng Puachue Hmong 12.0 71 Ch 16.12
Hntl504Han (Traditional variant) with Latin (alias for Hant + Latn)varies ZZ— See § Hant and § Latn
Hrkt412Japanese syllabaries (alias for Hiragana + Katakana)vertical right-to-left, left-to-right Edit this on WikidataKatakana or Hiragana See § Hira, § KanaCh 18.4
Hung176Old Hungarian (Hungarian Runic)right-to-left script Edit this on WikidataOld Hungarian 8.0 108 Ancient/historic Ch 8.8
Inds610Indus (Harappan)mixed ZZ— Not in Unicode, proposal is explored[i]
Ital210Old Italic (Etruscan, Oscan, etc.)right-to-left script, left-to-right Edit this on WikidataOld Italic 3.1 39 Ancient/historic Ch 8.6
Jamo284Jamo (alias for Jamo subset of Hangul)varies ZZ— Subset of § Hang
Java361Javaneseleft-to-right Edit this on WikidataJavanese 5.2 90 Ch 17.4
Jpan413Japanese (alias for Han + Hiragana + Katakana)varies ZZ— See § Hani, § Hira and § Kana
Jurc510Jurchenleft-to-right ZZ— Not in Unicode
Kali357Kayah Lileft-to-right Edit this on WikidataKayah Li 5.1 47 Ch 16.9
Kana411Katakanavertical right-to-left, left-to-right Edit this on WikidataKatakana 1.0 321 Ch 18.4
Kawi368Kawileft-to-right Edit this on WikidataKawi 15.0 87 Ancient/historic Ch 17.9
Khar305Kharoshthiright-to-left script Edit this on WikidataKharoshthi 4.1 68 Ancient/historic Ch 14.2
Khmr355Khmerleft-to-right Edit this on WikidataKhmer 3.0 146 Ch 16.4
Khoj322Khojkileft-to-right Edit this on WikidataKhojki 7.0 65 Ancient/historic Ch 15.7
Kitl505Khitan large scriptleft-to-right ZZ— Not in Unicode
Kits288Khitan small scriptvertical right-to-left Edit this on WikidataKhitan Small Script 13.0 472 Ancient/historic Ch 18.12
Knda345Kannadaleft-to-right Edit this on WikidataKannada 1.0 91 Ch 12.8
Kore287Korean (alias for Hangul + Han)left-to-right ZZ— See § Hani, § Hang
Kpel436Kpelleleft-to-right ZZ— Not in Unicode, proposal is explored[i]
Krai396Kirat Raileft-to-right Kirat Rai 16.0 58
Kthi317Kaithileft-to-right Edit this on WikidataKaithi 5.2 68 Ancient/historic Ch 15.2
Lana351Tai Tham (Lanna)left-to-right Edit this on WikidataTai Tham 5.2 127 Ch 16.7
Laoo356Laoleft-to-right Edit this on WikidataLao 1.0 83 Ch 16.2
Latf217Latin (Fraktur variant)left-to-right Edit this on WikidataZZ— Typographic variant of Latin (see § Latn)
Latg216Latin (Gaelic variant)left-to-right Edit this on WikidataZZ— Typographic variant of Latin (see § Latn)
Latn215Latinleft-to-right Edit this on WikidataLatin 1.0 1,487 See also: Latin script in UnicodeCh 7.1
Leke364Lekeleft-to-right ZZ— Not in Unicode
Lepc335Lepcha (Róng)left-to-right Edit this on WikidataLepcha 5.1 74 Ch 13.12
Limb336Limbuleft-to-right Edit this on WikidataLimbu 4.0 68 Ch 13.6
Lina400Linear Aleft-to-right Edit this on WikidataLinear A 7.0 341 Ancient/historic Ch 8.1
Linb401Linear Bleft-to-right Edit this on WikidataLinear B 4.0 211 Ancient/historic Ch 8.2
Lisu399Lisu (Fraser)left-to-right Edit this on WikidataLisu 5.2 49 Ch 18.9
Loma437Lomaleft-to-right ZZ— Not in Unicode, proposal is explored[i]
Lyci202Lycianleft-to-right Edit this on WikidataLycian 5.1 29 Ancient/historic Ch 8.5
Lydi116Lydianright-to-left script Edit this on WikidataLydian 5.1 27 Ancient/historic Ch 8.5
Mahj314Mahajanileft-to-right Edit this on WikidataMahajani 7.0 39 Ancient/historic Ch 15.6
Maka366Makasarleft-to-right Edit this on WikidataMakasar 11.0 25 Ancient/historic Ch 17.8
Mand140Mandaic, Mandaeanright-to-left script Edit this on WikidataMandaic 6.0 29 Ch 9.5
Mani139Manichaeanright-to-left script Edit this on WikidataManichaean 7.0 51 Ancient/historic Ch 10.5
Marc332Marchenleft-to-right Edit this on WikidataMarchen 9.0 68 Ancient/historic Ch 14.5
Maya090Mayan hieroglyphsmixed ZZ— Not in Unicode
Medf265Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ)left-to-right Edit this on WikidataMedefaidrin 11.0 91 Ch 19.10
Mend438Mende Kikakuiright-to-left script Edit this on WikidataMende Kikakui 7.0 213 Ch 19.8
Merc101Meroitic Cursiveright-to-left script Edit this on WikidataMeroitic Cursive 6.1 90 Ancient/historic Ch 11.5
Mero100Meroitic Hieroglyphsright-to-left script Edit this on WikidataMeroitic Hieroglyphs 6.1 32 Ancient/historic Ch 11.5
Mlym347Malayalamleft-to-right Edit this on WikidataMalayalam 1.0 118 Ch 12.9
Modi324Modi, Moḍīleft-to-right Edit this on WikidataModi 7.0 79 Ancient/historic Ch 15.12
Mong145Mongolianvertical left-to-right, left-to-right Edit this on WikidataMongolian 3.0 168 Mong includes Clear and Manchu scripts Ch 13.5
Moon218Moon (Moon code, Moon script, Moon type)mixed ZZ— Not in Unicode, proposal is explored[i]
Mroo264Mro, Mruleft-to-right Edit this on WikidataMro 7.0 43 Ch 13.8
Mtei337Meitei Mayek (Meithei, Meetei)left-to-right Edit this on WikidataMeetei Mayek 5.2 79 Ch 13.7
Mult323Multanileft-to-right Edit this on WikidataMultani 8.0 38 Ancient/historic Ch 15.10
Mymr350Myanmar (Burmese)left-to-right Edit this on WikidataMyanmar 3.0 243 Ch 16.3
Nagm295Nag Mundarileft-to-right Edit this on WikidataNag Mundari 15.0 42
Nand311Nandinagarileft-to-right Edit this on WikidataNandinagari 12.0 65 Ancient/historic Ch 15.13
Narb106Old North Arabian (Ancient North Arabian)right-to-left script Edit this on WikidataOld North Arabian 7.0 32 Ancient/historic Ch 10.1
Nbat159Nabataeanright-to-left script Edit this on WikidataNabataean 7.0 40 Ancient/historic Ch 10.10
Newa333Newa, Newar, Newari, Nepāla lipileft-to-right Edit this on WikidataNewa 9.0 97 Ch 13.3
Nkdb085Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba)left-to-right ZZ— Not in Unicode
Nkgb420Naxi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba)left-to-right ZZ— Not in Unicode, proposal is explored[i]
Nkoo165N’Koright-to-left script Edit this on WikidataNKo 5.0 62 Ch 19.4
Nshu499Nüshuvertical right-to-left Edit this on WikidataNushu 10.0 397 Ch 18.8
Ogam212Oghambottom-to-top, left-to-right Edit this on WikidataOgham 3.0 29 Ancient/historic Ch 8.14
Olck261Ol Chiki (Ol Cemet’, Ol, Santali)left-to-right Edit this on WikidataOl Chiki 5.1 48 Ch 13.10
Onao296Ol Onalleft-to-right Ol Onal 16.0 44
Orkh175Old Turkic, Orkhon Runicright-to-left script Edit this on WikidataOld Turkic 5.2 73 Ancient/historic Ch 14.8
Orya327Oriya (Odia)left-to-right Edit this on WikidataOriya 1.0 91 Ch 12.5
Osge219Osageleft-to-right Edit this on WikidataOsage 9.0 72 Ch 20.3
Osma260Osmanyaleft-to-right Edit this on WikidataOsmanya 4.0 40 Ch 19.2
Ougr143Old Uyghurmixed Old Uyghur 14.0 26 Ancient/historic Ch 14.11
Palm126Palmyreneright-to-left script Edit this on WikidataPalmyrene 7.0 32 Ancient/historic Ch 10.11
Pauc263Pau Cin Hauleft-to-right Edit this on WikidataPau Cin Hau 7.0 57 Ch 16.13
Pcun015Proto-Cuneiformleft-to-right ZZ— Not in Unicode
Pelm016Proto-Elamiteleft-to-right ZZ— Not in Unicode
Perm227Old Permicleft-to-right Edit this on WikidataOld Permic 7.0 43 Ancient/historic Ch 8.13
Phag331Phags-pavertical left-to-right Edit this on WikidataPhags-pa 5.0 56 Ancient/historic Ch 14.4
Phli131Inscriptional Pahlaviright-to-left script Edit this on WikidataInscriptional Pahlavi 5.2 27 Ancient/historic Ch 10.6
Phlp132Psalter Pahlaviright-to-left script Edit this on WikidataPsalter Pahlavi 7.0 29 Ancient/historic Ch 10.6
Phlv133Book Pahlavimixed ZZ— Not in Unicode
Phnx115Phoenicianright-to-left script Edit this on WikidataPhoenician 5.0 29 Ancient/historic[g]Ch 10.3
Piqd293Klingon (KLI pIqaD)left-to-right Edit this on WikidataZZ— Rejected for inclusion in Unicode[iii][iv]
Plrd282Miao (Pollard)left-to-right Edit this on WikidataMiao 6.1 149 Ch 18.10
Prti130Inscriptional Parthianright-to-left script Edit this on WikidataInscriptional Parthian 5.2 30 Ancient/historic Ch 10.6
Psin103Proto-Sinaiticmixed ZZ— Not in Unicode
Qaaa-Qabx900-949Reserved for private use (range)ZZ— Not in Unicode
Ranj303Ranjanaleft-to-right ZZ— Not in Unicode
Rjng363Rejang (Redjang, Kaganga)left-to-right Edit this on WikidataRejang 5.1 37 Ch 17.5
Rohg167Hanifi Rohingyaright-to-left script Edit this on WikidataHanifi Rohingya 11.0 50 Ch 16.14
Roro620Rongorongomixed ZZ— Not in Unicode, proposal is explored[i]
Runr211Runicleft-to-right, boustrophedon Edit this on WikidataRunic 3.0 86 Ancient/historic Ch 8.7
Samr123Samaritanright-to-left script, top-to-bottom Edit this on WikidataSamaritan 5.2 61 Ch 9.4
Sara292Saratimixed ZZ— Not in Unicode
Sarb105Old South Arabianright-to-left script Edit this on WikidataOld South Arabian 5.2 32 Ancient/historic Ch 10.2
Saur344Saurashtraleft-to-right Edit this on WikidataSaurashtra 5.1 82 Ch 13.13
Seal590(Small) Sealvaries ZZ— Not in Unicode, proposal is explored[i]
Sgnw095SignWritingvertical left-to-right Edit this on WikidataSignWriting 8.0 672 Ch 21.7
Shaw281Shavian (Shaw)left-to-right Edit this on WikidataShavian 4.0 48 Ch 8.15
Shrd319Sharada, Śāradāleft-to-right Edit this on WikidataSharada 6.1 96 Ch 15.3
Shui530Shuishuleft-to-right ZZ— Not in Unicode
Sidd302Siddham, Siddhaṃ, Siddhamātṛkāleft-to-right Edit this on WikidataSiddham 7.0 92 Ancient/historic Ch 15.5
Sidt180Sideticright-to-left ZZ— Not in Unicode, proposal is mature[ii]
Sind318Khudawadi, Sindhileft-to-right Edit this on WikidataKhudawadi 7.0 69 Ch 15.9
Sinh348Sinhalaleft-to-right Edit this on WikidataSinhala 3.0 111 Ch 13.2
Sogd141Sogdianhorizontal and vertical writing in East Asian scripts, top-to-bottom Edit this on WikidataSogdian 11.0 42 Ancient/historic Ch 14.10
Sogo142Old Sogdianright-to-left script Edit this on WikidataOld Sogdian 11.0 40 Ancient/historic Ch 14.9
Sora398Sora Sompengleft-to-right Edit this on WikidataSora Sompeng 6.1 35 Ch 15.17
Soyo329Soyomboleft-to-right Edit this on WikidataSoyombo 10.0 83 Ancient/historic Ch 14.7
Sund362Sundaneseleft-to-right Edit this on WikidataSundanese 5.1 72 Ch 17.7
Sunu274Sunuwarleft-to-right Edit this on WikidataSunuwar 16.0 44
Sylo316Syloti Nagrileft-to-right Edit this on WikidataSyloti Nagri 4.1 45 Ancient/historic Ch 15.1
Syrc135Syriacright-to-left script Edit this on WikidataSyriac 3.0 88 Includes typographic variants Estrangelo (see § Syre), Western (§ Syrj), and Eastern (§ Syrn) Ch 9.3
Syre138Syriac (Estrangelo variant)mixed ZZ— Typographic variant of Syriac (see § Syrc)
Syrj137Syriac (Western variant)mixed ZZ— Typographic variant of Syriac (see § Syrc)
Syrn136Syriac (Eastern variant)mixed ZZ— Typographic variant of Syriac (see § Syrc)
Tagb373Tagbanwaleft-to-right Edit this on WikidataTagbanwa 3.2 18 Ch 17.1
Takr321Takri, Ṭākrī, Ṭāṅkrīleft-to-right Edit this on WikidataTakri 6.1 68 Ch 15.4
Tale353Tai Leleft-to-right Edit this on WikidataTai Le 4.0 35 Ch 16.5
Talu354New Tai Lueleft-to-right Edit this on WikidataNew Tai Lue 4.1 83 Ch 16.6
Taml346Tamilleft-to-right Edit this on WikidataTamil 1.0 123 Ch 12.6
Tang520Tangutvertical right-to-left, left-to-right Edit this on WikidataTangut 9.0 6,914 Ancient/historic Ch 18.11
Tavt359Tai Vietleft-to-right Edit this on WikidataTai Viet 5.2 72 Ch 16.8
Tayo380Tai Yotop-to-bottom, columns right-to-left ZZ— Not in Unicode, proposal is mature[ii]
Telu340Teluguleft-to-right Edit this on WikidataTelugu 1.0 100 Ch 12.7
Teng290Tengwarleft-to-right ZZ— Not in Unicode
Tfng120Tifinagh (Berber)right-to-left script, left-to-right, top-to-bottom, bottom-to-top Edit this on WikidataTifinagh 4.1 59 Ch 19.3
Tglg370Tagalog (Baybayin, Alibata)left-to-right Edit this on WikidataTagalog 3.2 23 Ch 17.1
Thaa170Thaanaright-to-left script Edit this on WikidataThaana 3.0 50 Ch 13.1
Thai352Thaileft-to-right Edit this on WikidataThai 1.0 86 Ch 16.1
Tibt330Tibetanleft-to-right Edit this on WikidataTibetan 2.0 207 Added in 1.0, removed in 1.1 and reintroduced in 2.0 Ch 13.4
Tirh326Tirhutaleft-to-right Edit this on WikidataTirhuta 7.0 82 Ch 15.11
Tnsa275Tangsaleft-to-right Edit this on WikidataTangsa 14.0 89 Ch 13.18
Todr229Todhriright-to-left Todhri 16.0 52 Ancient/historic
Tols299Tolong Sikileft-to-right ZZ— Not in Unicode, proposal is mature[ii]
Toto294Totoleft-to-right Edit this on WikidataToto 14.0 31 Ch 13.17
Tutg341Tulu-Tigalarileft-to-right Tulu Tigalari 16.0 80 Ancient/historic
Ugar040Ugariticleft-to-right Edit this on WikidataUgaritic 4.0 31 Ancient/historic Ch 11.2
Vaii470Vaileft-to-right Edit this on WikidataVai 5.1 300 Ch 19.5
Visp280Visible Speechleft-to-right ZZ— Not in Unicode
Vith228Vithkuqileft-to-right Vithkuqi 14.0 70 Ancient/historic Ch 8.12
Wara262Warang Citi (Varang Kshiti)left-to-right Edit this on WikidataWarang Citi 7.0 84 Ch 13.9
Wcho283Wancholeft-to-right Edit this on WikidataWancho 12.0 59 Ch 13.16
Wole480Woleaimixed ZZ— Not in Unicode, proposal is explored[i]
Xpeo030Old Persianleft-to-right Edit this on WikidataOld Persian 4.1 50 Ancient/historic Ch 11.3
Xsux020Cuneiform, Sumero-Akkadianleft-to-right Edit this on WikidataCuneiform 5.0 1,234 Ancient/historic Ch 11.1
Yezi192Yezidiright-to-left script Edit this on WikidataYezidi 13.0 47 Ancient/historic Ch 9.6
Yiii460Yileft-to-right Edit this on WikidataYi 3.0 1,220 Ch 18.7
Zanb339Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script)left-to-right Edit this on WikidataZanabazar Square 10.0 72 Ancient/historic Ch 14.6
Zinh994Code for inherited scriptInherited 657
Zmth995Mathematical notationZZ— Not a 'script' in Unicode
Zsye993Symbols (emoji variant)ZZ— Not a 'script' in Unicode
Zsym996SymbolsZZ— Not a 'script' in Unicode
Zxxx997Code for unwritten documentsZZ— Not a 'script' in Unicode
Zyyy998Code for undetermined scriptCommon 9,053
Zzzz999Code for uncoded scriptUnknown 959,049 In Unicode: All other code points
Notes
  1. ^
    ISO 15924 publications As of 24 April 2025
  2. ^
  3. ^
    ISO 15924 Changes (including Aliases for Unicode; as of 24 April 2025)
  4. ^
    Unicode version 16.0
  5. ^
  6. ^
    Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: Palm, Palmyrene → U+10860 𐡠PALMYRENE LETTER ALEPH.
  7. ^
    In Unicode, the Phoenician script is intended for the representation of text in Paleo-Hebrew, Archaic Phoenician, Phoenician, Early Aramaic, Late Phoenician cursive, Phoenician papyri, Siloam Hebrew, Hebrew seals, Ammonite, Moabite, and Punic.[v]
References
  1. ^ abcdefghij"SEI List of Scripts Not Yet Encoded". Unicode Consortium. March 2023. Retrieved 2023-09-25.
  2. ^ abcd"Unicode Pipeline § Code Points Provisionally Assigned for Mature Proposals". Unicode Consortium. 2023-09-12. Retrieved 2023-09-25.
  3. ^Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".[dead link]
  4. ^The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".
  5. ^"Middle East-II, Ancient Scripts"(PDF). 15.0.0. The Unicode Consortium. Retrieved 2023-09-25.

Missing scripts in Unicode

[edit]

The project Missing Scripts—with contributors from the Mainz University of Applied Sciences, the L’Atelier national de recherche typographique (ANRT) in Nancy, and the University of California, Berkeley—has compiled a list of 131 scripts that have not yet been encoded in The Unicode Standard, out of a total of 294 recognized scripts according to the current state of research.[6]

See also

[edit]

References

[edit]
  1. ^"Glossary". unicode.org.
  2. ^"Unicode Character Database: Scripts". unicode.org.
  3. ^"Chapter 14: Additional Ancient and Historic Scripts". The Unicode Standard, Version 15.0(PDF). Mountain View, CA: Unicode, Inc. September 2022. ISBN 978-1-936213-32-0.
  4. ^https://www.unicode.org/roadmaps/ Roadmaps to Unicode
  5. ^"UAX #24: Unicode Script Property". www.unicode.org.
  6. ^"The World's Writing Systems". www.worldswritingsystems.org. Retrieved 2024-10-04.
[edit]
close