Unicode block

In Unicode, a block is defined as one contiguous range of code points. Blocks are named uniquely and have no overlap. They have a starting code point of the form hhh0 and an ending code point of the form hhhF. A block explicitly can include code points that are unassigned and non-characters.[1] Code points not belonging to any of the named blocks, e.g. in the unassigned planes 3–13, have the value block="No_block".

Conversely, every assigned code point has a property "Block name", which identifies the unique block containing the character. This is determined by the code point only, although a block name will have a descriptive nature: "Tibetan" or "Supplemental Arrows-A".

Subdivisions, such as "Chess symbols" in the block Miscellaneous symbols, are not a "block". The subgroup name is an informative editorial addition only.

The number of code points in a Unicode block is a multiple of 16. Unicode blocks range in size from the minimum of 16 to a maximum of 65,536 code points.

Unicode 11.0 defines 291 blocks:[2]

  • 163 in plane 0, the Basic Multilingual Plane (BMP)
  • 118 in plane 1, the Supplementary Multilingual Plane (SMP)
  • 6 in plane 2, the Supplementary Ideographic Plane (SIP)
  • 2 in plane 14 (E in hexadecimal), the Supplementary Special-purpose Plane (SSP)
  • One each in planes 15 (Fhex) and 16 (10hex), called Supplementary Private Use Area-A and -B
Unicode blocks and contained scripts
Plane Block range Block name Code points[lower-alpha 1] Assigned characters Scripts[lower-alpha 2][lower-alpha 3][lower-alpha 4][lower-alpha 5][lower-alpha 6]
 
0 BMPU+0000..U+007FBasic Latin[lower-alpha 7]128128Latin (52 characters), Common (76 characters)
U+0080..U+00FFLatin-1 Supplement[lower-alpha 8]128128Latin (64 characters), Common (64 characters)
U+0100..U+017FLatin Extended-A128128Latin
U+0180..U+024FLatin Extended-B208208Latin
U+0250..U+02AFIPA Extensions9696Latin
U+02B0..U+02FFSpacing Modifier Letters8080Bopomofo (2 characters), Latin (14 characters), Common (64 characters)
U+0300..U+036FCombining Diacritical Marks112112Inherited
U+0370..U+03FFGreek and Coptic144135Coptic (14 characters), Greek (117 characters), Common (4 characters)
U+0400..U+04FFCyrillic256256Cyrillic (254 characters), Inherited (2 characters)
U+0500..U+052FCyrillic Supplement4848Cyrillic
0 BMPU+0530..U+058FArmenian9691Armenian (90 characters), Common (1 character)
U+0590..U+05FFHebrew11288Hebrew
U+0600..U+06FFArabic256255Arabic (237 characters), Common (6 characters), Inherited (12 characters)
U+0700..U+074FSyriac8077Syriac
U+0750..U+077FArabic Supplement4848Arabic
U+0780..U+07BFThaana6450Thaana
U+07C0..U+07FFNKo6462Nko
U+0800..U+083FSamaritan6461Samaritan
U+0840..U+085FMandaic3229Mandaic
U+0860..U+086FSyriac Supplement1611Syriac
0 BMPU+08A0..U+08FFArabic Extended-A9674Arabic (73 characters), Common (1 character)
U+0900..U+097FDevanagari128128Devanagari (124 characters), Common (2 characters), Inherited (2 characters)
U+0980..U+09FFBengali12896Bengali
U+0A00..U+0A7FGurmukhi12880Gurmukhi
U+0A80..U+0AFFGujarati12891Gujarati
U+0B00..U+0B7FOriya12890Oriya
U+0B80..U+0BFFTamil12872Tamil
U+0C00..U+0C7FTelugu12897Telugu
U+0C80..U+0CFFKannada12889Kannada
U+0D00..U+0D7FMalayalam128117Malayalam
0 BMPU+0D80..U+0DFFSinhala12890Sinhala
U+0E00..U+0E7FThai12887Thai (86 characters), Common (1 character)
U+0E80..U+0EFFLao12867Lao
U+0F00..U+0FFFTibetan256211Tibetan (207 characters), Common (4 characters)
U+1000..U+109FMyanmar160160Myanmar
U+10A0..U+10FFGeorgian9688Georgian (87 characters), Common (1 character)
U+1100..U+11FFHangul Jamo256256Hangul
U+1200..U+137FEthiopic384358Ethiopic
U+1380..U+139FEthiopic Supplement3226Ethiopic
U+13A0..U+13FFCherokee9692Cherokee
0 BMPU+1400..U+167FUnified Canadian Aboriginal Syllabics640640Canadian Aboriginal
U+1680..U+169FOgham3229Ogham
U+16A0..U+16FFRunic9689Runic (86 characters), Common (3 characters)
U+1700..U+171FTagalog3220Tagalog
U+1720..U+173FHanunoo3223Hanunoo (21 characters), Common (2 characters)
U+1740..U+175FBuhid3220Buhid
U+1760..U+177FTagbanwa3218Tagbanwa
U+1780..U+17FFKhmer128114Khmer
U+1800..U+18AFMongolian176157Mongolian (154 characters), Common (3 characters)
U+18B0..U+18FFUnified Canadian Aboriginal Syllabics Extended8070Canadian Aboriginal
0 BMPU+1900..U+194FLimbu8068Limbu
U+1950..U+197FTai Le4835Tai Le
U+1980..U+19DFNew Tai Lue9683New Tai Lue
U+19E0..U+19FFKhmer Symbols3232Khmer
U+1A00..U+1A1FBuginese3230Buginese
U+1A20..U+1AAFTai Tham144127Tai Tham
U+1AB0..U+1AFFCombining Diacritical Marks Extended8015Inherited
U+1B00..U+1B7FBalinese128121Balinese
U+1B80..U+1BBFSundanese6464Sundanese
U+1BC0..U+1BFFBatak6456Batak
0 BMPU+1C00..U+1C4FLepcha8074Lepcha
U+1C50..U+1C7FOl Chiki4848Ol Chiki
U+1C80..U+1C8FCyrillic Extended-C169Cyrillic
U+1C90..U+1CBFGeorgian Extended4846Georgian
U+1CC0..U+1CCFSundanese Supplement168Sundanese
U+1CD0..U+1CFFVedic Extensions4842Common (15 characters), Inherited (27 characters)
U+1D00..U+1D7FPhonetic Extensions128128Cyrillic (2 characters), Greek (15 characters), Latin (111 characters)
U+1D80..U+1DBFPhonetic Extensions Supplement6464Greek (1 character), Latin (63 characters)
U+1DC0..U+1DFFCombining Diacritical Marks Supplement6463Inherited
U+1E00..U+1EFFLatin Extended Additional256256Latin
0 BMPU+1F00..U+1FFFGreek Extended256233Greek
U+2000..U+206FGeneral Punctuation112111Common (109 characters), Inherited (2 characters)
U+2070..U+209FSuperscripts and Subscripts4842Latin (15 characters), Common (27 characters)
U+20A0..U+20CFCurrency Symbols4832Common
U+20D0..U+20FFCombining Diacritical Marks for Symbols4833Inherited
U+2100..U+214FLetterlike Symbols8080Greek (1 character), Latin (4 characters), Common (75 characters)
U+2150..U+218FNumber Forms6460Latin (41 characters), Common (19 characters)
U+2190..U+21FFArrows112112Common
U+2200..U+22FFMathematical Operators256256Common
U+2300..U+23FFMiscellaneous Technical256256Common
0 BMPU+2400..U+243FControl Pictures6439Common
U+2440..U+245FOptical Character Recognition3211Common
U+2460..U+24FFEnclosed Alphanumerics160160Common
U+2500..U+257FBox Drawing128128Common
U+2580..U+259FBlock Elements3232Common
U+25A0..U+25FFGeometric Shapes9696Common
U+2600..U+26FFMiscellaneous Symbols256256Common
U+2700..U+27BFDingbats192192Common
U+27C0..U+27EFMiscellaneous Mathematical Symbols-A4848Common
U+27F0..U+27FFSupplemental Arrows-A1616Common
0 BMPU+2800..U+28FFBraille Patterns256256Braille
U+2900..U+297FSupplemental Arrows-B128128Common
U+2980..U+29FFMiscellaneous Mathematical Symbols-B128128Common
U+2A00..U+2AFFSupplemental Mathematical Operators256256Common
U+2B00..U+2BFFMiscellaneous Symbols and Arrows256250Common
U+2C00..U+2C5FGlagolitic9694Glagolitic
U+2C60..U+2C7FLatin Extended-C3232Latin
U+2C80..U+2CFFCoptic128123Coptic
U+2D00..U+2D2FGeorgian Supplement4840Georgian
U+2D30..U+2D7FTifinagh8059Tifinagh
0 BMPU+2D80..U+2DDFEthiopic Extended9679Ethiopic
U+2DE0..U+2DFFCyrillic Extended-A3232Cyrillic
U+2E00..U+2E7FSupplemental Punctuation12879Common
U+2E80..U+2EFFCJK Radicals Supplement128115Han
U+2F00..U+2FDFKangxi Radicals224214Han
U+2FF0..U+2FFFIdeographic Description Characters1612Common
U+3000..U+303FCJK Symbols and Punctuation6464Han (15 characters), Hangul (2 characters), Common (43 characters), Inherited (4 characters)
U+3040..U+309FHiragana9693Hiragana (89 characters), Common (2 characters), Inherited (2 characters)
U+30A0..U+30FFKatakana9696Katakana (93 characters), Common (3 characters)
U+3100..U+312FBopomofo4843Bopomofo
0 BMPU+3130..U+318FHangul Compatibility Jamo9694Hangul
U+3190..U+319FKanbun1616Common
U+31A0..U+31BFBopomofo Extended3227Bopomofo
U+31C0..U+31EFCJK Strokes4836Common
U+31F0..U+31FFKatakana Phonetic Extensions1616Katakana
U+3200..U+32FFEnclosed CJK Letters and Months256254Hangul (62 characters), Katakana (47 characters), Common (145 characters)
U+3300..U+33FFCJK Compatibility256256Katakana (88 characters), Common (168 characters)
U+3400..U+4DBFCJK Unified Ideographs Extension A6,5926,582Han
U+4DC0..U+4DFFYijing Hexagram Symbols6464Common
U+4E00..U+9FFFCJK Unified Ideographs20,99220,976Han
0 BMPU+A000..U+A48FYi Syllables1,1681,165Yi
U+A490..U+A4CFYi Radicals6455Yi
U+A4D0..U+A4FFLisu4848Lisu
U+A500..U+A63FVai320300Vai
U+A640..U+A69FCyrillic Extended-B9696Cyrillic
U+A6A0..U+A6FFBamum9688Bamum
U+A700..U+A71FModifier Tone Letters3232Common
U+A720..U+A7FFLatin Extended-D224163Latin (158 characters), Common (5 characters)
U+A800..U+A82FSyloti Nagri4844Syloti Nagri
U+A830..U+A83FCommon Indic Number Forms1610Common
0 BMPU+A840..U+A87FPhags-pa6456Phags Pa
U+A880..U+A8DFSaurashtra9682Saurashtra
U+A8E0..U+A8FFDevanagari Extended3232Devanagari
U+A900..U+A92FKayah Li4848Kayah Li (47 characters), Common (1 character)
U+A930..U+A95FRejang4837Rejang
U+A960..U+A97FHangul Jamo Extended-A3229Hangul
U+A980..U+A9DFJavanese9691Javanese (90 characters), Common (1 character)
U+A9E0..U+A9FFMyanmar Extended-B3231Myanmar
U+AA00..U+AA5FCham9683Cham
U+AA60..U+AA7FMyanmar Extended-A3232Myanmar
0 BMPU+AA80..U+AADFTai Viet9672Tai Viet
U+AAE0..U+AAFFMeetei Mayek Extensions3223Meetei Mayek
U+AB00..U+AB2FEthiopic Extended-A4832Ethiopic
U+AB30..U+AB6FLatin Extended-E6454Latin (52 characters), Greek (1 character), Common (1 character)
U+AB70..U+ABBFCherokee Supplement8080Cherokee
U+ABC0..U+ABFFMeetei Mayek6456Meetei Mayek
U+AC00..U+D7AFHangul Syllables11,18411,172Hangul
U+D7B0..U+D7FFHangul Jamo Extended-B8072Hangul
U+D800..U+DB7FHigh Surrogates8960Unknown
U+DB80..U+DBFFHigh Private Use Surrogates1280Unknown
0 BMPU+DC00..U+DFFFLow Surrogates1,0240Unknown
U+E000..U+F8FFPrivate Use Area6,4006,400Unknown
U+F900..U+FAFFCJK Compatibility Ideographs512472Han
U+FB00..U+FB4FAlphabetic Presentation Forms8058Armenian (5 characters), Hebrew (46 characters), Latin (7 characters)
U+FB50..U+FDFFArabic Presentation Forms-A688611Arabic (609 characters), Common (2 characters)
U+FE00..U+FE0FVariation Selectors1616Inherited
U+FE10..U+FE1FVertical Forms1610Common
U+FE20..U+FE2FCombining Half Marks1616Cyrillic (2 characters), Inherited (14 characters)
U+FE30..U+FE4FCJK Compatibility Forms3232Common
U+FE50..U+FE6FSmall Form Variants3226Common
U+FE70..U+FEFFArabic Presentation Forms-B144141Arabic (140 characters), Common (1 character)
U+FF00..U+FFEFHalfwidth and Fullwidth Forms240225Hangul (52 characters), Katakana (55 characters), Latin (52 characters), Common (66 characters)
U+FFF0..U+FFFFSpecials165Common
1 SMPU+10000..U+1007FLinear B Syllabary12888Linear B
U+10080..U+100FFLinear B Ideograms128123Linear B
U+10100..U+1013FAegean Numbers6457Common
U+10140..U+1018FAncient Greek Numbers8079Greek
U+10190..U+101CFAncient Symbols6413Greek (1 character), Common (12 characters)
U+101D0..U+101FFPhaistos Disc4846Common (45 characters), Inherited (1 character)
U+10280..U+1029FLycian3229Lycian
U+102A0..U+102DFCarian6449Carian
U+102E0..U+102FFCoptic Epact Numbers3228Common (27 characters), Inherited (1 character)
U+10300..U+1032FOld Italic4839Old Italic
1 SMPU+10330..U+1034FGothic3227Gothic
U+10350..U+1037FOld Permic4843Old Permic
U+10380..U+1039FUgaritic3231Ugaritic
U+103A0..U+103DFOld Persian6450Old Persian
U+10400..U+1044FDeseret8080Deseret
U+10450..U+1047FShavian4848Shavian
U+10480..U+104AFOsmanya4840Osmanya
U+104B0..U+104FFOsage8072Osage
U+10500..U+1052FElbasan4840Elbasan
U+10530..U+1056FCaucasian Albanian6453Caucasian Albanian
1 SMPU+10600..U+1077FLinear A384341Linear A
U+10800..U+1083FCypriot Syllabary6455Cypriot
U+10840..U+1085FImperial Aramaic3231Imperial Aramaic
U+10860..U+1087FPalmyrene3232Palmyrene
U+10880..U+108AFNabataean4840Nabataean
U+108E0..U+108FFHatran3226Hatran
U+10900..U+1091FPhoenician3229Phoenician
U+10920..U+1093FLydian3227Lydian
U+10980..U+1099FMeroitic Hieroglyphs3232Meroitic Hieroglyphs
U+109A0..U+109FFMeroitic Cursive9690Meroitic Cursive
1 SMPU+10A00..U+10A5FKharoshthi9668Kharoshthi
U+10A60..U+10A7FOld South Arabian3232Old South Arabian
U+10A80..U+10A9FOld North Arabian3232Old North Arabian
U+10AC0..U+10AFFManichaean6451Manichaean
U+10B00..U+10B3FAvestan6461Avestan
U+10B40..U+10B5FInscriptional Parthian3230Inscriptional Parthian
U+10B60..U+10B7FInscriptional Pahlavi3227Inscriptional Pahlavi
U+10B80..U+10BAFPsalter Pahlavi4829Psalter Pahlavi
U+10C00..U+10C4FOld Turkic8073Old Turkic
U+10C80..U+10CFFOld Hungarian128108Old Hungarian
1 SMPU+10D00..U+10D3FHanifi Rohingya6450Hanifi Rohingya
U+10E60..U+10E7FRumi Numeral Symbols3231Arabic
U+10F00..U+10F2FOld Sogdian4840Old Sogdian
U+10F30..U+10F6FSogdian6442Sogdian
U+11000..U+1107FBrahmi128109Brahmi
U+11080..U+110CFKaithi8067Kaithi
U+110D0..U+110FFSora Sompeng4835Sora Sompeng
U+11100..U+1114FChakma8070Chakma
U+11150..U+1117FMahajani4839Mahajani
U+11180..U+111DFSharada9694Sharada
1 SMPU+111E0..U+111FFSinhala Archaic Numbers3220Sinhala
U+11200..U+1124FKhojki8062Khojki
U+11280..U+112AFMultani4838Multani
U+112B0..U+112FFKhudawadi8069Khudawadi
U+11300..U+1137FGrantha12886Grantha (85 characters), Inherited (1 character)
U+11400..U+1147FNewa12893Newa
U+11480..U+114DFTirhuta9682Tirhuta
U+11580..U+115FFSiddham12892Siddham
U+11600..U+1165FModi9679Modi
U+11660..U+1167FMongolian Supplement3213Mongolian
1 SMPU+11680..U+116CFTakri8066Takri
U+11700..U+1173FAhom6458Ahom
U+11800..U+1184FDogra8060Dogra
U+118A0..U+118FFWarang Citi9684Warang Citi
U+11A00..U+11A4FZanabazar Square8072Zanabazar Square
U+11A50..U+11AAFSoyombo9681Soyombo
U+11AC0..U+11AFFPau Cin Hau6457Pau Cin Hau
U+11C00..U+11C6FBhaiksuki11297Bhaiksuki
U+11C70..U+11CBFMarchen8068Marchen
U+11D00..U+11D5FMasaram Gondi9675Masaram Gondi
1 SMPU+11D60..U+11DAFGunjala Gondi8063Gunjala Gondi
U+11EE0..U+11EFFMakasar3225Makasar
U+12000..U+123FFCuneiform1,024922Cuneiform
U+12400..U+1247FCuneiform Numbers and Punctuation128116Cuneiform
U+12480..U+1254FEarly Dynastic Cuneiform208196Cuneiform
U+13000..U+1342FEgyptian Hieroglyphs1,0721,071Egyptian Hieroglyphs
U+14400..U+1467FAnatolian Hieroglyphs640583Anatolian Hieroglyphs
U+16800..U+16A3FBamum Supplement576569Bamum
U+16A40..U+16A6FMro4843Mro
U+16AD0..U+16AFFBassa Vah4836Bassa Vah
1 SMPU+16B00..U+16B8FPahawh Hmong144127Pahawh Hmong
U+16E40..U+16E9FMedefaidrin9691Medefaidrin
U+16F00..U+16F9FMiao160133Miao
U+16FE0..U+16FFFIdeographic Symbols and Punctuation322Nushu (1 character), Tangut (1 character)
U+17000..U+187FFTangut6,1446,130Tangut
U+18800..U+18AFFTangut Components768755Tangut
U+1B000..U+1B0FFKana Supplement256256Hiragana (255 characters), Katakana (1 character)
U+1B100..U+1B12FKana Extended-A4831Hiragana
U+1B170..U+1B2FFNushu400396Nushu
U+1BC00..U+1BC9FDuployan160143Duployan
1 SMPU+1BCA0..U+1BCAFShorthand Format Controls164Common
U+1D000..U+1D0FFByzantine Musical Symbols256246Common
U+1D100..U+1D1FFMusical Symbols256231Common (209 characters), Inherited (22 characters)
U+1D200..U+1D24FAncient Greek Musical Notation8070Greek
U+1D2E0..U+1D2FFMayan Numerals3220Common
U+1D300..U+1D35FTai Xuan Jing Symbols9687Common
U+1D360..U+1D37FCounting Rod Numerals3225Common
U+1D400..U+1D7FFMathematical Alphanumeric Symbols1,024996Common
U+1D800..U+1DAAFSutton SignWriting688672SignWriting
U+1E000..U+1E02FGlagolitic Supplement4838Glagolitic
1 SMPU+1E800..U+1E8DFMende Kikakui224213Mende Kikakui
U+1E900..U+1E95FAdlam9687Adlam
U+1EC70..U+1ECBFIndic Siyaq Numbers8068Common
U+1EE00..U+1EEFFArabic Mathematical Alphabetic Symbols256143Arabic
U+1F000..U+1F02FMahjong Tiles4844Common
U+1F030..U+1F09FDomino Tiles112100Common
U+1F0A0..U+1F0FFPlaying Cards9682Common
U+1F100..U+1F1FFEnclosed Alphanumeric Supplement256192Common
U+1F200..U+1F2FFEnclosed Ideographic Supplement25664Hiragana (1 character), Common (63 characters)
U+1F300..U+1F5FFMiscellaneous Symbols and Pictographs768768Common
1 SMPU+1F600..U+1F64FEmoticons8080Common
U+1F650..U+1F67FOrnamental Dingbats4848Common
U+1F680..U+1F6FFTransport and Map Symbols128108Common
U+1F700..U+1F77FAlchemical Symbols128116Common
U+1F780..U+1F7FFGeometric Shapes Extended12889Common
U+1F800..U+1F8FFSupplemental Arrows-C256148Common
U+1F900..U+1F9FFSupplemental Symbols and Pictographs256213Common
U+1FA00..U+1FA6FChess Symbols11214Common
2 SIPU+20000..U+2A6DFCJK Unified Ideographs Extension B42,72042,711Han
U+2A700..U+2B73FCJK Unified Ideographs Extension C4,1604,149Han
U+2B740..U+2B81FCJK Unified Ideographs Extension D224222Han
U+2B820..U+2CEAFCJK Unified Ideographs Extension E5,7765,762Han
U+2CEB0..U+2EBEFCJK Unified Ideographs Extension F7,4887,473Han
U+2F800..U+2FA1FCJK Compatibility Ideographs Supplement544542Han
14 SSPU+E0000..U+E007FTags12897Common
U+E0100..U+E01EFVariation Selectors Supplement240240Inherited
15 PUA-AU+F0000..U+FFFFFSupplementary Private Use Area-A65,53665,534Unknown
16 PUA-BU+100000..U+10FFFFSupplementary Private Use Area-B65,53665,534Unknown
  1. Code point count includes unassigned code points: non-character, reserved
  2. The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the block name
  3. "Common" and "Unknown" (Zyyy) and "Inherited" (Zinh or Qaai) refer to Scripts in ISO 15924
  4. Unicode Blocks data file. As of Unicode version 11.0
  5. UAX 24: Unicode Script Property (4 alpha code)
  6. UAX 24: Script data file
  7. Called "C0 Controls and Basic Latin" in ISO/IEC 10646
  8. Called "C1 Controls and Latin-1 Supplement" in ISO/IEC 10646

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.