General Punctuation

General Punctuation
Range U+2000..U+206F
(112 code points)
Plane BMP
Scripts Common (109 char.)
Inherited (2 char.)
Symbol sets Punctuation
Spaces
Format controls
Assigned 111 code points
Unused 1 reserved code points
6 deprecated
Unicode version history
1.0.0 67 (+67)
1.1 76 (+9)
3.0 83 (+7)
3.2 95 (+12)
4.0 97 (+2)
4.1 106 (+9)
5.1 107 (+1)
6.3 111 (+4)
Note: [1][2]

General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interobang, and invisible mathematical operators.

Additional punctuation characters are in the Supplemental Punctuation block and sprinkled in dozens of other Unicode blocks.

Block

General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 11.0
2.^ Grey area indicates non-assigned code point
3.^ Unicode code points U+206A - U+206F are deprecated as of Unicode version 3.0

Emoji

The General Punctuation block contains two emoji: U+203C and U+2049.[3][4]

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[5]

Emoji variation sequences
U+203C2049
base code point
base+VS15 (text)‼︎⁉︎
base+VS16 (emoji)‼️⁉️

History

The following Unicode-related documents record the purpose and process of defining specific characters in the General Punctuation block:

VersionFinal code points[lower-alpha 1]CountL2 IDWG2 IDDocument
1.0.0U+2000..202E, 2030..203E, 2040..204467(to be determined)
L2/11-438[lower-alpha 2][lower-alpha 3]N4182Edberg, Peter (2011-12-22), Emoji Variation Sequences (Revision of L2/11-429)
L2/17-086Burge, Jeremy; et al. (2017-03-27), Add ZWJ, VS-16, Keycaps & Tags to Emoji_Component
1.1U+203F, 2045..2046, 206A..206F9(to be determined)
3.0U+202F, 2048..20493L2/98-088N1711The Working Meeting on Mongolian Encoding Attended by Representatives of China and Mongolia, 1998-02-15
L2/98-104N1734Whistler, Ken (1998-03-20), Comments on the Mongolian Encoding Proposal, WG2 N1711
L2/98-252N1833Moore, Richard (1998-05-04), Feedback on Ken Whistler's Comments on Mongolian Encoding: N 1734
L2/98-251N1808Reply to "Proposal WG2 N1734" Raised at the Seattle Meeting Regarding "Proposal WG 2 N1711", 1998-07-09
L2/98-389RAliprand, Joan, "RESOLUTION M35.11", Consent docket re WG2 Resolutions at its Meeting #35
L2/99-075.1N1973Irish Comments on SC 2 N 3208, 1999-01-19
L2/99-075N1972Summary of Voting on SC 2 N 3208, PDAM ballot on WD for ISO/IEC 10646-1/Amd. 29: Mongolian, 1999-02-12
L2/99-113Text for FPDAM ballot of ISO/IEC 10646, Amd. 29 - Mongolian, 1999-04-06
L2/99-304N2126Paterson, Bruce (1999-10-01), Revised Text for FDAM ballot of ISO/IEC 10646-1/FDAM 29, AMENDMENT 29: Mongolian
L2/99-381Final text for ISO/IEC 10646-1, FDAM 29 -- Mongolian, 1999-12-07
L2/07-209Whistler, Ken (2007-07-05), UTR 14 and U+202F NARROW NO-BREAK SPACE
L2/07-225Moore, Lisa (2007-08-21), "B.11.4.1.2", UTC #112 Minutes
L2/11-438[lower-alpha 2][lower-alpha 3]N4182Edberg, Peter (2011-12-22), Emoji Variation Sequences (Revision of L2/11-429)
L2/15-187Moore, Lisa (2015-08-11), "B.14.5", UTC #144 Minutes
L2/16-258N4752R2Eck, Greg (2016-09-19), Mongolian Base Forms, Positional Forms, & Variant Forms
L2/16-259N4753Eck, Greg; Rileke, Orlog Ou (2016-09-20), WG2 #65 Mongolian Discussion Points
L2/16-266Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa (2016-09-26), "1. Mongolian", Comments on Mongolian, Small Khitan, and other WG2 #65 documents
L2/16-297N4769Anderson, Deborah (2016-10-27), Mongolian ad hoc report
U+204A..204D4(to be determined)
3.2U+2047, 20512L2/99-238Consolidated document containing 6 Japanese proposals, 1999-07-15
N2092Addition of forty eight characters, 1999-09-13
L2/99-365Moore, Lisa (1999-11-23), Comments on JCS Proposals
L2/00-024Shibano, Kohji (2000-01-31), JCS proposal revised
L2/99-260RMoore, Lisa (2000-02-07), "JCS Proposals", Minutes of the UTC/L2 meeting in Mission Viejo, October 26-28, 1999
L2/00-098N2195Rationale for non-Kanji characters proposed by JCS committee, 2000-03-15
L2/00-119[lower-alpha 4]N2191RWhistler, Ken; Freytag, Asmus (2000-04-19), Encoding Additional Mathematical Symbols in Unicode
L2/00-297N2257Sato, T. K. (2000-09-04), JIS X 0213 symbols part-1
L2/00-342N2278Sato, T. K.; Everson, Michael; Whistler, Ken; Freytag, Asmus (2000-09-20), Ad hoc Report on Japan feedback N2257 and N2258
U+204E..2050, 2057, 205F..20628L2/00-119[lower-alpha 4]N2191RWhistler, Ken; Freytag, Asmus (2000-04-19), Encoding Additional Mathematical Symbols in Unicode
U+2052, 20632L2/01-142[lower-alpha 4]N2336Beeton, Barbara; Freytag, Asmus; Ion, Patrick (2001-04-02), Additional Mathematical Symbols
L2/01-156N2356Freytag, Asmus (2001-04-03), Additional Mathematical Characters (Draft 10)
L2/01-344N2353Umamaheswaran, V. S. (2001-09-09), Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001
4.0U+2053..20542L2/02-141N2419Everson, Michael; et al. (2002-03-20), Uralic Phonetic Alphabet characters for the UCS
L2/02-192Everson, Michael (2002-05-02), Everson's Reply on UPA
N2442Everson, Michael; Kolehmainen, Erkki I.; Ruppel, Klaas; Trosterud, Trond (2002-05-21), Justification for placing the Uralic Phonetic Alphabet in the BMP
L2/02-291Whistler, Ken (2002-05-31), WG2 report from Dublin
L2/02-292Whistler, Ken (2002-06-03), Early look at WG2 consent docket
L2/02-166R2Moore, Lisa (2002-08-09), "Scripts and New Characters - UPA", UTC #91 Minutes
L2/02-253Moore, Lisa (2002-10-21), UTC #92 Minutes
4.1U+20551L2/03-151RConstable, Peter; Lloyd-Williams, James; Lloyd-Williams, Sue; Chowdhury, Shamsul Islam; Ali, Asaddar; Sadique, Mohammed; Chowdhury, Matiar Rahman (2003-05-10), Revised Proposal for Encoding Syloti Nagri Script in the BMP
L2/03-136Moore, Lisa (2003-08-18), "Scripts and New Characters - Syloti Nagri Script", UTC #95 Minutes
U+2056, 2058..20593L2/03-282RN2610REverson, Michael; Cleminson, Ralph (2003-09-04), Final proposal for encoding the Glagolitic script in the UCS
L2/03-324N2642Pantelia, Maria (2003-10-06), Proposal to encode additional Greek editorial and punctuation characters in the UCS
U+205A..205C3L2/03-157Pantelia, Maria (2003-05-19), Additional Beta Code Characters not in Unicode (WIP)
L2/03-193RN2612-7Pantelia, Maria (2003-06-11), Proposal to encode additional Punctuation Characters in the UCS
U+205D1L2/02-312RPantelia, Maria (2002-11-07), Proposal to encode additional Greek editorial and punctuation characters in the UCS
L2/03-324N2642Pantelia, Maria (2003-10-06), Proposal to encode additional Greek editorial and punctuation characters in the UCS
U+205E1L2/03-354N2655Freytag, Asmus (2003-10-10), Proposal -- Symbols used in Dictionaries
L2/03-356R2Moore, Lisa (2003-10-22), "97-C15", UTC #97 Minutes
5.1U+20641L2/07-011RN3198RFreytag, Asmus; Beeton, Barbara; Ion, Patrick; Sargent, Murray; Carlisle, David; Pournader, Roozbeh (2007-01-15), 29 Additional Mathematical and Symbol Characters
6.3U+2066..20694L2/12-186RLanin, Aharon; Davis, Mark; Pournader, Roozbeh (2012-07-24), A Proposal for Bidi Isolates in Unicode
L2/12-290N4310Lanin, Aharon; Davis, Mark; Pournader, Roozbeh (2012-07-31), Proposal for Four Characters for Bidi
L2/12-239Moore, Lisa (2012-08-14), UTC #132 Minutes
L2/13-040Pournader, Roozbeh; Lanin, Aharon (2013-01-29), Fasttracking Arabic Letter Mark (ALM)
L2/13-125N4447Constable, Peter (2013-06-10), Unicode Liaison Report to WG2
  1. Proposed code points and characters names may differ from final code points and names
  2. 1 2 See also L2/10-458, L2/11-414, L2/11-415, and L2/11-429
  3. 1 2 Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents
  4. 1 2 3 Refer to the history section of the Miscellaneous Mathematical Symbols-B block for additional math-related documents

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. "UTR #51: Unicode Emoji". Unicode Consortium. 2018-05-21.
  4. "UCD: Emoji Data for UTR #51". Unicode Consortium. 2018-05-22.
  5. "UTS #51 Emoji Variation Sequences". The Unicode Consortium.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.