KPS 9566

KPS 9566
Alias(es)	ISO-IR-202 (1997 version)
Language(s)	Korean, English, Russian; Partial support:; Greek, Japanese
Standard	KPS 9566
Current status	Used only in North Korea.
Classification	ISO-2022-compatible DBCS, CJK encoding
Encoding formats	UHC-style encoding,[1] ISO 2022.
Other related encoding(s)	KS X 1001

KPS 9566 ("DPRK Standard Korean Graphic Character Set for Information Interchange")[2] is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.[3]

KPS 9566 differs in approach from KS X 1001, its South Korean counterpart, in using a different ordering of chosŏn'gŭl,[4] in encoding explicit vertical presentation forms of punctuation, in not encoding duplicate hanja for multiple readings, and in including several characters specific to the North Korean political system, including special encodings for the names of the country's past and present leaders (Kim Il-sung, Kim Jong-il and Kim Jong-un).[1][3][2]

Although KPS 9566 was the original source of several characters added to Unicode,[5] not all KPS 9566 characters have Unicode equivalents. Those which do not are mapped to similar Unicode characters or to the Private Use Area.[6]

Background and other standards

The ASCII character set originated in the United States in 1963, and was revised in 1967 to the form it has today.[7] ASCII also became accepted as an international standard in 1967, becoming ECMA-6,[7] designated ISO/IEC 646 by the International Organisation for Standardization.[8] It is presently designated ANSI X3.4-1986 and ISO 646:1991.[9] ASCII was a 7-bit, single-byte encoding including 94 graphical characters, the space, and 33 control codes, which provided basic support for representing American English text as a series of bytes.[7][9]

The next edition of ISO 646, published in 1972, revised the standard to introduce the concept of national versions of the code, allowing countries to replace a few less commonly used codes with their own required characters. At the same time, work on defining extension mechanisms for ASCII was underway, with the intention of being applicable to both 7-bit and 8-bit environments. This was completed in 1973 and published as JIS X 0202, ECMA-35 and ISO 2022.[10] ISO 2022 specifies mechanisms for using single-byte and multiple-byte character sets with a certain structure in both 7-bit and 8-bit environments, and for declaring and switching between them in a standard fashion using shift codes and escape sequences.[11]

Countries in East Asia, due to using large repertoires of Chinese characters, introduced standardised double-byte encodings (DBCS) for their writing systems, since the number of characters representable in a single-byte code was not sufficient. In an ISO 2022 compliant DBCS, every character can be represented with two ASCII printing character bytes; the location of a character can be referenced by these byte values, or by two numbers from 1 to 94 (a kuten), equal to the respective bytes minus 32.[12] The first ISO 2022 compliant DBCS was the first edition of JIS X 0208 (Japan), published in 1978. This was followed by GB 2312 (Mainland China) in 1980, and by Wansung code (South Korea; first designated KS C 5601-1987) in 1987.[13][14] Big5 (Taiwan), defined in 1984, did not follow the ISO 2022 structure.[13]

Although the Korean writing system includes individual symbols (jamo) for consonants and vowels, serving as an alphabet, Korean text is properly typeset with these symbols composed into blocks for each syllable. Wansung code included individual Korean syllable blocks separately, treating them as a large set of characters similarly to hanja, and was first defined by the third edition of the South Korean standard KS C 5601. The first edition had defined an encoding of individual jamo which allowed syllable blocks to be encoded as sequences, which was named N-byte Hangul, and had not been adopted as widely as intended.[15][16] When used on an 8-bit system, Wansung code was usually used with the eighth bit set, with ASCII or a similar SBCS used with the eight bit unset; this arrangement is known as EUC-KR.[17]

Wansung code did not encode all possible modern Korean syllables, only a selection of the 2350 most common,[2] although it allowed them to be specified using combining sequences, which often were not supported.[18] An alternative encoding, also South Korean, named Johab did, and served as a competitor to Wansung for some time.[15] Unified Hangul Code (UHC), introduced by Microsoft with Windows 95, extended EUC-KR, allowing the use of invalid EUC double-byte codes to represent all other syllables available in Johab.[18] A similar approach was taken by the Mainland Chinese GBK encoding, extending GB 2312 with support for Traditional Chinese and for less common Chinese characters by encoding them to double-byte codes invalid in EUC-CN.[13]

South Korea was not the only country developing an ISO 2022 DBCS for Korean: the Mainland Chinese GB 12052 was published in 1989. This was not closely related to Wansung code, and instead mostly matched GB 2312 except for the inclusion of a dollar sign in place of a yuan sign, and the inclusion of Korean syllables (and 94 hanja) replacing the Chinese characters. It may have been developed for use by the Korean minority in north-eastern China.[2]

Although North Korea and South Korea both use Korean Chosŏn'gŭl (Hangul) as their primary writing system, they use different lexicographical orders.[19] Hence, character ordering differs between Wansung code and KPS 9566.[4]

KPS 9566 has undergone several revisions, including editions of 1997 and 2003,[20] mainly to enhance compatibility with Unicode. These are commonly indicated by specifying the year (e.g. KPS 9566-97, 9566-2003). The current edition as of the release of Red Star OS 3.0 appears to be KPS 9566-2011, which adds Kim Jong-un to the list of leaders.[3] The publicly available code chart for the 1997 edition of KPS 9566 shows a ISO 2022 94×94 plane.[21] The more recent editions, from what sources of information are available outside of North Korea itself, appear to define additional allocations outside of the EUC plane (similarly to GBK or UHC).[3]

Due to the interoperability issues arising from the use of multiple national standard and platform- or font-specific proprietary character encodings, the Unicode standard was developed with the intent of allowing all representable text to be interchanged in a single, universal format. The first edition of Unicode was published in 1991 and 1992,[22] and ISO/IEC 10646 was established in sync with Unicode in 1993.[23] Unicode formats are preferred for international use on the World Wide Web, where legacy character encodings are treated as partial encodings of Unicode by means of mapping files.[24][25]

Design

In principle, KPS 9566 is similar to the Wansung character set defined by the South Korean KS X 1001 standard, although the two are not compatible. Both encode a section of punctuation, symbols, jamo, kana and alphabetical characters, followed by a subset of the possible modern chosŏn'gŭl syllables, followed by a section of hanja.[2] However, KPS 9566 uses a different ordering of jamo and syllables to conform with North Korean lexicographical ordering standards.[4] KPS 9566 also includes 28 explicitly rotated punctuation characters for vertical typography, which KS X 1001 does not, and encodes each hanja only once, whereas KS X 1001 encodes several hanja with multiple readings multiple times.[2]

KPS 9566 encodes a total of 2679 chosŏn'gŭl syllables and 4653 hanja. This provides better coverage than the 2350 syllables encoded by Wansung code: for instance, the 똠 character used in the name of 똠방각하, a noted Korean literary work, does not have an assigned Wansung codepoint, but has one (38-02) in KPS 9566.[2] The hanja section includes 4652 characters from the Unified Repertoire and Ordering and one from CJK Unified Ideographs Extension A. The entirety of row 15, the latter half of row 44 (after the syllables block) and the latter half of row 94 (after the hanja block) may be used for user-defined purposes.[2]

KPS 9566 is especially distinguished by its inclusion of several special characters from North Korean political life. Specifically, it includes the hammer, sickle and brush emblem of the Workers' Party of Korea, both uncircled and circled[6] (code points 12-01 and 12-02),[21] and two groups of three special-purpose characters which spell out the names of the North Korean leaders Kim Il-sung (김일성) and Kim Jong-il (김정일) in a special decorative font (code points 04-72 to 04-74 and 04-75 to 04-77, respectively).[26] The syllables for Kim and Il, which are identical in the spelling of both names, are encoded twice. KPS 9566-2011 additionally includes the name of Kim Jong-un (김정은) as code points 04-78 to 04-80.[3]

Due to these special characters, there is currently no full round-trip compatibility between KPS 9566 and Unicode, unless unsupported characters are mapped to the Private Use Area.[1]

KPS 10721

North Korea also developed a second character set, KPS 10721 "Code of the supplementary Korean Hanja Set for Information Interchange", which was published in 2000. KPS 10721 encodes a set of at least 19469 hanja[2] additional to those included in KPS 9566. As of 2009, these did not all have mappings to Unicode, but included 10358 from the Unified Repertoire and Ordering, 3187 from CJK Unified Ideographs Extension A and 107 from CJK Compatibility Ideographs (all in the Basic Multilingual Plane), as well as 5767 from CJK Unified Ideographs Extension B and 50 from CJK Compatibility Ideographs Supplement (in the Supplementary Ideographic Plane).[2]

Besides the mapping of these hanja to Unicode, little is known about the KPS 10721 standard outside of North Korea.[2] North Korean reference glyphs are not provided for these hanja in the Unicode code charts, due to a lack of suitable font data available to the Unicode Consortium.[27] Unicode hanja characters with KPS 9566 or KPS 10721 sources are nonetheless cross-referenced to their KPS codes in the Unihan database with the key kIRG_KPSource.[28]

Documentation and relationship to Unicode

Unicode's initial coverage of Korean syllables, added in version 1.0, was based on Wansung code. In Unicode version 2.0, a new block of Korean syllables (the present Hangul Syllables block) was added, based on Johab, and the previous block was deleted (it is now occupied by CJK Unified Ideographs Extension A). This was done under the assumption that no Unicode-encoded Korean data existed yet, but became known as the "Korean mess", and the responsible committees pledged not to make such an incompatible change in the future,[29] a pledge codified by the Unicode Stability Policy.[30]

The code chart for KPS 9566-97, published April 1997,[2] was submitted to the ISO International Register of Coded Character Sets for registration for use with ISO/IEC 2022. It was registered in June 1998 with the number ISO-IR-202. This code chart is publicly available from the Information Processing Society of Japan.[21]

In August 1999, the North Korean national body submitted a document to WG2 (ISO/IEC JTC 1/SC 2 Working Group 2), the ISO body responsible for ISO/IEC 10646, the international standard corresponding to Unicode. This document requested the addition of the KPS 9566 codes to the existing cross-references from the CJK Unified Ideographs charts, the addition of 80 symbol characters from KPS 9566 which did not have existing Unicode mappings, a resolution to the difference in collation order between KPS 9566 and Unicode (due to the order of the characters in Unicode following the South Korean encodings) and the addition of 8 combining jamo. It also requested for WG2 to edit the existing Unicode character and block names to use the term "Korean character" rather than "Hangul".[31] An expanded version of this proposal, broken into several documents, was submitted as a work item in December 1999.[32]

A detailed response was submitted by the Swedish representative in March 2000, opposing several of the points and elaborating on Sweden's vote against the proposal. This response stated that changing the encoding of the Korean characters again would cause major disruption, even more so than the first time, which was done when comparatively few implementations existed, but which in retrospect should not have been done. It explained that that few or no languages can be collated correctly by code point value, and that a tailoring for the Unicode Collation Algorithm or ISO/IEC 14651 (then being drafted) should be used for that purpose, and that normative names of characters already assigned cannot be changed, due to the stability policy, although non-normative translations to other languages can be employed. It suggested that a machine-readable mapping file between Unicode and KPS 9566 could be provided by the North Korean body itself, and would be more useful than a printed cross-reference in the standard document. Regarding the proposed additional characters, the response stated that characters which would have compatibility decompositions in Unicode should not be added and that logos, including those of political parties, and special characters for names of particular persons should not be added.[33]

In July 2000, the North Korean body wrote to WG2, accusing them of developing both versions of the Unicode encoding for Korean on the basis of South Korean proposals only, without consulting North Korea, accusing them putting the commercial interests of companies and fears of international confusion over respect to North Korea's sovereignty, and stating that North Korea would regard further refusal to change the name and order of the Korean characters in Unicode as an insult to their sovereign dignity and as compromising the ISO's claims to impartiality. They re-iterated their demand for WG2 and Unicode to "correct" the order of the Korean characters, and to "correct" the names "Hangul Jamo" and "Hangul Syllable" to "Korean Alphabet" and "Korean Syllable".[4]

In August 2000, the North Korean national body submitted a more detailed version of their requests in a series of five consecutive proposals. These requested the addition of 14 additional jamo characters,[34] the addition of 82 symbol characters,[35] and the use of the term "Korean alphabet" instead of "Hangul",[36] provided supporting evidence for the North Korean collation order,[19] and requested addition of the North Korean hanja repertoire.[37] These proposals were discussed in two meetings between North Korean, South Korean, Swedish and other WG2 representatives in September 2000, in which the North Korean body was asked to provide manuscript evidence for the additional jamo characters, to resubmit their symbols proposal with symbols which had already been accepted into Unicode removed, and to consider using ISO/IEC 14651, then at final draft stage, for collation purposes.[38]

In September 2001, the North Korean national body submitted a revised series of proposals requesting the addition of several KPS 9566 and KPS 10721 characters, including 70 symbol characters, to Unicode.[39][40] In this version of the proposal, a section of document excerpts demonstrating use of several characters and short explanations of their purpose was included. The Workers' Party of Korea symbol was named the "Hammer and Sickle and Brush",[39] renamed from "Mark of the Workers' Party of Korea" in earlier versions of the proposal,[35] and justified as being used as an identifying symbol on maps.[39] As justification for the proposed characters for leaders' names, they explained that the leaders' names often appear with a different size and font weight in North Korean publications for the purpose of emphasis.[39] A follow-up by South Korean WG2 representatives requested evidence, names in Korean and justifications for adding certain of these characters, and noted that non-emphasised versions of the characters for the leaders' names already existed.[41] A meeting of North and South Korean representatives from WG2 was convened in October 2001, which recommended 47 of the symbol characters for adding to Unicode, and suggested that the leaders' names and WPK symbols be raised for further discussion by WG2.[42]

A subsequent feedback document from February 2002 regarding the North Korean proposed additions requested that the "tea" symbol for a tea house be accepted as a more general "hot beverage" symbol, equating it with symbols used in guidebooks to denote hot or non-alcoholic beverages. It also recommended that the reference glyph for the existing codepoint for an umbrella without rain be modified to harmonise with the proposed reference glyph for the umbrella with rain, equating them to the "keep dry" symbols used on packaging, and raised the question of which lightning bolt and high voltage warning symbols in existing symbol collections could be unified with the proposed "high voltage" character.[43] All three of these characters were accepted into Unicode in version 4.0.[44] It also recommended that the horizontal-barred fractions and the left-up pointing scissors be encoded using a variation selector, since the scissors did not accompany a differently-oriented pair of scissors, and since the existing Unicode fraction codepoints unified the skewed and horizontal forms.[43]

In August 2004, a pair of mapping tables between KPS 9566-2003 and Unicode were submitted to the OpenOffice.org project by an individual using the name "ooprojlover", who stated that they represented the updated version of the KPS 9566 standard and requested that support be added.[20] These files mapped the characters unavailable in Unicode to the Private Use Area, and included additional encoded forms for other syllable blocks outside of the main ISO-IR-202 plane. A mapping table was later published by the Unicode Consortium in 2011, based on this mapping data but with errors corrected with reference to the ISO-IR chart.[1]

Copies of Red Star OS 3.0 include fonts for an more recent edition of KPS 9566, appearing to be KPS 9566-2011. The mapping table used by Red Star OS internally has been successfully extracted. Besides adding Kim Jong-un to the list of leaders, KPS 9566-2011 amends the mappings of certain vertical forms compared to the 2003 mappings (taking advantage of the Vertical Forms block added in Unicode 4.1), and also includes several additional hanja and symbols encoded outside of the ISO-IR-202 plane. Several of these additional symbols are also mapped to the Private Use Area; however, their identity is not known, since no names or reference glyphs for those characters are known outside of North Korea.[3]

Impact on Unicode today

Several current Unicode characters were added to Unicode 4.0 as a result of the North Korean proposals, although not always at the original proposed codepoints. These include HOT BEVERAGE (☕, proposed as TEA SYMBOL), which was proposed as a map symbol for marking a tea house, and the flag symbols WHITE FLAG (⚐) and BLACK FLAG (⚑), which were proposed as map symbols for sites of battles and military victories.[5] These characters were proposed for the provisional code points U+270A, U+268E and U+268F respectively,[42] but encoded at the final code points U+2615, U+2690 and U+2691 respectively.[45] They also include a series of directional bold arrows in the range U+2B05 through U+2B0D,[42] excluding a rightward arrow, which was mapped to an existing character in the Dingbats block,[46] which were added at the same code points they were proposed for, besides the north-east and north-west arrows being swapped compared to the proposal.[47]

Other pictographic characters which were included in the North Korean proposal include the umbrella with raindrops (☔), the lightning bolt for high voltage (⚡) and the warning triangle (⚠), which were proposed for the provisional code points U+2618, U+267F and U+267E respectively.[42] Following some discussion about which other high voltage symbol glyphs in use represented the same character as the one from the North Korean proposal,[43] and which glyph would be best to include for it in the Unicode code chart,[48] and following modification of the code chart glyph of the existing umbrella character without rain (U+2602, ☂) to harmonise with the new umbrella with raindrops from the North Korean proposal,[43][50] these characters were also added in Unicode 4.0, at the same time as the flags and the beverage symbol.[44][45][48] They were given the final code points U+2614, U+26A1 and U+26A0 respectively.[45]

Of these characters, the hot beverage, umbrella with raindrops, lightning bolt and warning triangle, and the upward, downward and leftward arrows were subsequently selected as mappings from the Japanese cellular emoji sets,[51] making a total of seven current Unicode emoji which were originally added to Unicode at the request of North Korea. The umbrella with raindrops and the upward, downward and leftward arrows were also unified with characters from the ARIB extensions used in Japanese broadcasting,[52] which include several characters now classified as emoji,[53] and was mapped to Unicode in Unicode 5.2.[54] However, the pair of white and black flags used as emoji or in emoji regional and identity flag sequences is a different, "waving" set added in Unicode 7.0 (U+1F3F3 🏳 and U+1F3F4 🏴),[55][56] not the North Korean pair.

As of 2018, several KPS 9566 characters remained which are not mapped to Unicode. These include the WPK symbol, four triangular marks, a leftward-pointing pair of scissors (excluded on the rationale that contrastive use with the rightward scissors in the Dingbats block had not been demonstrated), an upward-pointing manicule in a circle, vertical presentation forms of punctuation marks, variants of closing brackets incorporating full stops, horizontal-barred variants of vulgar fractions encoded separately from their slanted versions, and the leaders' names.[57]

A Japanese postal mark with a downward pointing triangle was included in KPS 9566-97 but removed in KPS 9566-2003[1] after the North Korean body had withdrawn it from their Unicode proposal for review[58] in response to requests from the South Korean body for evidence of the symbol's use in North Korea.[41] This mark was re-proposed in 2018 on the basis of KPS 9566 compatibility, and identified as an electrical conformity mark used in Japan prior to its replacement by the PSE diamond.[59] It was added to Unicode in version 13.0, published in 2020.

Encoded forms

The 1997 edition of KPS 9566 was registered with the International Register of Coded Character Sets for Use with Escape Sequences as ISO-IR-202,[21] and can therefore be encoded using ISO/IEC 2022. It is a 94ⁿ multiple-byte G-set, i.e. if it is used in a 7-bit ISO 2022 code (analogous to ISO-2022-JP or ISO-2022-KR), characters will be encoded with pairs of bytes between 0x21 and 0x7E when in the appropriate mode.

The documented mappings between KPS 9566 and Unicode for the 2003[20][1] and 2011[3] editions of KPS 9566 use an encoding resembling an adaptation of Unified Hangul Code (UHC) to encode KPS 9566 rather than Wansung code, with their updated versions of the ISO-IR-202 plane being encoded using pairs of bytes between 0xA1 and 0xFE, and with other two-byte codes used for syllables not present in ISO-IR-202. The order of the extended syllables follows usual KPS 9566 order. Similarly to UHC, they use lead bytes 0x81 and above, and trail bytes from the ranges 0x41–0x5A, 0x61–0x7A and 0x81–0xFE, excluding the range 0xA1–0xFE if the lead byte is 0xA1 or above.[3]

The 2011 edition also includes several additional hanja and symbols encoded outside of the ISO-IR-202 plane, after the range used for the extended syllable blocks.[3] This approach is similar to that taken by GBK, but with the trail bytes remaining in the UHC-style ranges: like the extended syllables with lead bytes 0xA1 and above, these all use the trail byte ranges 0x41–0x5A, 0x61–0x7A and 0x81–0xA0. Extended hanja are encoded with lead bytes between 0xC8 and 0xDC, extended symbols are encoded using lead bytes between 0xE0 and 0xEA, and extended codes with lead bytes between 0xEC and 0xFE are mapped, without gaps, to the Private Use Area[3] (compare the user-defined ranges in GBK). Several of the characters in the extended symbols section and three in the hanja section are also mapped to the Unicode Private Use Area; unlike the PUA-mapped symbols in the main ISO-IR-202 plane, the identity of these characters is not documented.[3]

Lead byte

This chart details the overall layout of the main plane of the KPS 9566 character set by lead byte.[21] For lead bytes used for characters other than composed chosŏn'gŭl syllables or hanja, links are provided to charts on this page listing the characters encoded under that lead byte. For lead bytes used for hanja, links are provided to the appropriate section of Wiktionary's hanja index.

Where two hexadecimal numbers are given, the value below 0x7F is used in a 7-bit encoding,[lower-alpha 1] and the larger value (between 0xA1 and 0xFE) is used in an 8-bit EUC-style encoding.[17] The extended UHC-style 8-bit encodings defined by the 2003 edition onwards likewise use the larger byte values, between 0xA1 and 0xFE inclusive, for the main ISO-IR-202-based plane.[1][3]

KPS 9566 (lead bytes)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_	SP[lower-alpha 2] 0020	Punct. LEAD 1-_	Symbol LEAD 2-_	Alnum. LEAD 3-_	Jamo LEAD 4-_	Cyrillic LEAD 5-_	Greek LEAD 6-_	List LEAD 7-_	Units LEAD 8-_	Box LEAD 9-_	Hiragana LEAD 10-_	Katakana LEAD 11-_	Misc. LEAD 12-_	13-_	Ext Latn LEAD 14-_	15-_
3_/B_	Syllable LEAD 16-_	Syllable LEAD 17-_	Syllable LEAD 18-_	Syllable LEAD 19-_	Syllable LEAD 20-_	Syllable LEAD 21-_	Syllable LEAD 22-_	Syllable LEAD 23-_	Syllable LEAD 24-_	Syllable LEAD 25-_	Syllable LEAD 26-_	Syllable LEAD 27-_	Syllable LEAD 28-_	Syllable LEAD 29-_	Syllable LEAD 30-_	Syllable LEAD 31-_
4_/C_	Syllable LEAD 32-_	Syllable LEAD 33-_	Syllable LEAD 34-_	Syllable LEAD 35-_	Syllable LEAD 36-_	Syllable LEAD 37-_	Syllable LEAD 38-_	Syllable LEAD 39-_	Syllable LEAD 40-_	Syllable LEAD 41-_	Syllable LEAD 42-_	Syllable LEAD 43-_	Syllable LEAD 44-_	Hanja LEAD 45-_	Hanja LEAD 46-_	Hanja LEAD 47-_
5_/D_	Hanja LEAD 48-_	Hanja LEAD 49-_	Hanja LEAD 50-_	Hanja LEAD 51-_	Hanja LEAD 52-_	Hanja LEAD 53-_	Hanja LEAD 54-_	Hanja LEAD 55-_	Hanja LEAD 56-_	Hanja LEAD 57-_	Hanja LEAD 58-_	Hanja LEAD 59-_	Hanja LEAD 60-_	Hanja LEAD 61-_	Hanja LEAD 62-_	Hanja LEAD 63-_
6_/E_	Hanja LEAD 64-_	Hanja LEAD 65-_	Hanja LEAD 66-_	Hanja LEAD 67-_	Hanja LEAD 68-_	Hanja LEAD 69-_	Hanja LEAD 70-_	Hanja LEAD 71-_	Hanja LEAD 72-_	Hanja LEAD 73-_	Hanja LEAD 74-_	Hanja LEAD 75-_	Hanja LEAD 76-_	Hanja LEAD 77-_	Hanja LEAD 78-_	Hanja LEAD 79-_
7_/F_	Hanja LEAD 80-_	Hanja LEAD 81-_	Hanja LEAD 82-_	Hanja LEAD 83-_	Hanja LEAD 84-_	Hanja LEAD 85-_	Hanja LEAD 86-_	Hanja LEAD 87-_	Hanja LEAD 88-_	Hanja LEAD 89-_	Hanja LEAD 90-_	Hanja LEAD 91-_	Hanja LEAD 92-_	Hanja LEAD 93-_	Hanja LEAD 94-_	DEL[lower-alpha 2] 007F

Letter Number Punctuation Symbol Other Lead byte Undefined

Non-Hanja, non-composed sets in the main plane

Character set 0x21/0xA1 (row number 1, punctuation and vertical forms)

This set contains common sentence punctuation such as brackets, quotation marks, commas and so forth, as well as presentation forms for use in vertical writing. ASCII punctuation (shown here with a heavy green border) is shown below mapped to Basic Latin codepoints (consistent with articles on other CJK character sets, such as KS X 1001 or JIS X 0208), but is mapped to the Halfwidth and Fullwidth Forms block when used in an encoding which combines KPS 9566 with ASCII (as defined by, for example, the 2003 edition).[1]

Compared to the 2003 mapping, the 2011 mapping changes the Unicode mappings of three vertical presentation forms to take advantage of the Vertical Forms block introduced with Unicode 4.1.[3]

KPS 9566 (prefixed with 0x21/0xA1)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		IDSP 3000 1-1	、 3001 1-2	。 3002 1-3	, 002C 1-4	. 002E 1-5	· 00B7 1-6	: 003A 1-7	; 003B 1-8	? 003F 1-9	! 0021 1-10	‥ 2025 1-11	… 2026 1-12	~ 007E 1-13	〃 3003 1-14	― 2015 1-15
3_/B_	‐ 2010 1-16	_ 005F 1-17	￣[lower-alpha 3] FFE3 1-18	/ 002F 1-19	\ 005C 1-20	\| 007C 1-21	∥ 2225 1-22	∕ 2215 1-23	∖ 2216 1-24	゛ 309B 1-25	゜ 309C 1-26	´ 00B4 1-27	` 0060 1-28	¨ 00A8 1-29	^ 005E 1-30	ˇ 02C7 1-31
4_/C_	˙ 02D9 1-32	ʼ/︐ 02BC/FE10 1-33	˚/︒ 02DA/FE12 1-34	ˊ/︑ 02CA/FE11 1-35	⋮ 22EE 1-36	[lower-alpha 4] 2E2F/F104 1-37	‘ 2018 1-38	’ 2019 1-39	“ 201C 1-40	” 201D 1-41	( 0028 1-42	) 0029 1-43	〔 3014 1-44	〕 3015 1-45	[ 005B 1-46	] 005D 1-47
5_/D_	{ 007B 1-48	} 007D 1-49	〈 3008 1-50	〉 3009 1-51	《 300A 1-52	》 300B 1-53	「 300C 1-54	」 300D 1-55	『 300E 1-56	』 300F 1-57	【 3010 1-58	】 3011 1-59	.) F105[lower-alpha 5] 1-60	.⟫ F106[lower-alpha 5] 1-61	‚ 201A 1-62	‛ 201B 1-63
6_/E_	„ 201E 1-64	‟ 201F 1-65	︵ FE35 1-66	︶ FE36 1-67	︹ FE39 1-68	︺ FE3A 1-69	﹇ FE47 1-70	﹈ FE48 1-71	︷ FE37 1-72	︸ FE38 1-73	︿ FE3F 1-74	﹀ FE40 1-75	︽ FE3D 1-76	︾ FE3E 1-77	﹁ FE41 1-78	﹂ FE42 1-79
7_/F_	﹃ FE43 1-80	﹄ FE44 1-81	︻ FE3B 1-82	︼ FE3C 1-83	1-84	1-85	1-86	1-87	1-88	1-89	1-90	1-91	1-92	1-93	1-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x22/0xA2 (row number 2, symbols and operators)

This set includes mathematical operators, and some other symbols such as the ampersand, pilcrow, musical note and so forth. ASCII punctuation (shown here with a heavy green border) is shown below mapped to Basic Latin codepoints (consistent with articles on other CJK character sets), but is mapped to the Halfwidth and Fullwidth Forms block when used in an encoding which combines KPS 9566 with ASCII.[1]

Several triangular "road mark" symbols denoting upcoming mountains or inclines ahead or to one side are included in this row, but not presently included in Unicode. They are mapped to the Private Use Area.[39]

KPS 9566 (prefixed with 0x22/0xA2)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		+ 002B 2-1	- 002D 2-2	± 00B1 2-3	× 00D7 2-4	÷ 00F7 2-5	= 003D 2-6	≠ 2260 2-7	< 003C 2-8	> 003E 2-9	≦ 2266 2-10	≧ 2267 2-11	∞ 221E 2-12	∴ 2234 2-13	♂ 2642 2-14	♀ 2640 2-15
3_/B_	∠ 2220 2-16	⊥ 22A5 2-17	⌒ 2312 2-18	∂ 2202 2-19	∇ 2207 2-20	≡ 2261 2-21	≒ 2252 2-22	≈ 2248 2-23	≪ 226A 2-24	≫ 226B 2-25	√ 221A 2-26	∽ 223D 2-27	∝ 221D 2-28	∵ 2235 2-29	∫ 222B 2-30	∬ 222C 2-31
4_/C_	∮ 222E 2-32	∈ 2208 2-33	∋ 220B 2-34	⊆ 2286 2-35	⊇ 2287 2-36	⊂ 2282 2-37	⊃ 2283 2-38	∉ 2209 2-39	∌ 220C 2-40	⊈ 2288 2-41	⊉ 2289 2-42	⊄ 2284 2-43	⊅ 2285 2-44	∪ 222A 2-45	∩ 2229 2-46	∧ 2227 2-47
5_/D_	∨ 2228 2-48	￢[lower-alpha 3] FFE2 2-49	⇒ 21D2 2-50	⇔ 21D4 2-51	∀ 2200 2-52	∃ 2203 2-53	∑ 2211 2-54	# 0023 2-55	& 0026 2-56	* 002A 2-57	@ 0040 2-58	§ 00A7 2-59	※ 203B 2-60	☆ 2606 2-61	★ 2605 2-62	○ 25CB 2-63
6_/E_	● 25CF 2-64	◎ 25CE 2-65	◇ 25C7 2-66	◆ 25C6 2-67	□ 25A1 2-68	■ 25A0 2-69	△ 25B3 2-70	▲ 25B2 2-71	▽ 25BD 2-72	▼ 25BC 2-73	▷ 25B7 2-74	◁ 25C1 2-75	▶ 25B6 2-76	◀ 25C0 2-77	∘ 2218 2-78	∙ 2219 2-79
7_/F_	❖ 2756 2-80	F10D[lower-alpha 6][lower-alpha 7] 2-81	F10E[lower-alpha 6] 2-82	F10F[lower-alpha 6] 2-83	F110[lower-alpha 6] 2-84	⚐ 2690 2-85	⚑ 2691 2-86	♯ 266F 2-87	♭ 266D 2-88	♪ 266A 2-89	† 2020 2-90	‡ 2021 2-91	¶ 00B6 2-92	⊕ 2295 2-93	⊖ 2296 2-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x23/0xA3 (row number 3, digits and Roman)

This set includes a subset of ASCII, minus punctuation and symbols, comprising western Arabic numerals and both cases of the Basic Latin alphabet. Compare row 3 of JIS X 0208, which this row exactly matches. Compare and contrast row 3 of KS X 1001 and GB 2312, which include their entire national variants of ISO 646 in this row, rather than only the alphanumeric subset.

The characters in this row are shown below mapped to Basic Latin codepoints (consistent with articles on the other character sets), but is mapped to the Halfwidth and Fullwidth Forms block when used in an encoding which combines KPS 9566 with ASCII.[1]

KPS 9566 (prefixed with 0x23/0xA3)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		3-1	3-2	3-3	3-4	3-5	3-6	3-7	3-8	3-9	3-10	3-11	3-12	3-13	3-14	3-15
3_/B_	0 0030 3-16	1 0031 3-17	2 0032 3-18	3 0033 3-19	4 0034 3-20	5 0035 3-21	6 0036 3-22	7 0037 3-23	8 0038 3-24	9 0039 3-25	3-26	3-27	3-28	3-29	3-30	3-31
4_/C_	3-32	A 0041 3-33	B 0042 3-34	C 0043 3-35	D 0044 3-36	E 0045 3-37	F 0046 3-38	G 0047 3-39	H 0048 3-40	I 0049 3-41	J 004A 3-42	K 004B 3-43	L 004C 3-44	M 004D 3-45	N 004E 3-46	O 004F 3-47
5_/D_	P 0050 3-48	Q 0051 3-49	R 0052 3-50	S 0053 3-51	T 0054 3-52	U 0055 3-53	V 0056 3-54	W 0057 3-55	X 0058 3-56	Y 0059 3-57	Z 005A 3-58	3-59	3-60	3-61	3-62	3-63
6_/E_	3-64	a 0061 3-65	b 0062 3-66	c 0063 3-67	d 0064 3-68	e 0065 3-69	f 0066 3-70	g 0067 3-71	h 0068 3-72	i 0069 3-73	j 006A 3-74	k 006B 3-75	l 006C 3-76	m 006D 3-77	n 006E 3-78	o 006F 3-79
7_/F_	p 0070 3-80	q 0071 3-81	r 0072 3-82	s 0073 3-83	t 0074 3-84	u 0075 3-85	v 0076 3-86	w 0077 3-87	x 0078 3-88	y 0079 3-89	z 007A 3-90	3-91	3-92	3-93	3-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x24/0xA4 (row number 4, Chosŏn'gŭl jamo and leaders' names)

This set contains Chosŏn'gŭl jamo, as well as special encodings for the names of (as of 2003) the North Korean Leaders Kim Il-sung and Kim Jong-il. The name of Kim Jong-un is also included as of the 2011 edition.[3] Compare with row 4 of KS X 1001.

The jamo in this row which exist in the Unicode Hangul Compatibility Jamo block (which contains the position-independent characters mapped from KS X 1001) are mapped to that block. The obsolete jamo distinguishing palatised sibilants map to the position-specific characters in the Hangul Jamo block.[1] Conversely, not all of the obsolete jamo encoded by KS X 1001 are encoded in the main plane of KPS 9566. In the 2011 edition of KPS 9566, the remainder of the historic jamo from KS X 1001 are included outside of the main plane, with the lead byte 0xEA.[3]

The special encodings of the leaders' names are not present in Unicode and are mapped to the Private Use Area. They are shown below simulated with markup.

KPS 9566 (prefixed with 0x24/0xA4)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		ㄱ 3131 4-1	ㄴ 3134 4-2	ㄷ 3137 4-3	ㄹ 3139 4-4	ㅁ 3141 4-5	ㅂ 3142 4-6	ㅅ 3145 4-7	ㅇ 3147 4-8	ㅈ 3148 4-9	ㅊ 314A 4-10	ㅋ 314B 4-11	ㅌ 314C 4-12	ㅍ 314D 4-13	ㅎ 314E 4-14	ㄲ 3132 4-15
3_/B_	ㄸ 3138 4-16	ㅃ 3143 4-17	ㅆ 3146 4-18	ㅉ 3149 4-19	ㅏ 314F 4-20	ㅑ 3151 4-21	ㅓ 3153 4-22	ㅕ 3155 4-23	ㅗ 3157 4-24	ㅛ 315B 4-25	ㅜ 315C 4-26	ㅠ 3160 4-27	ㅡ 3161 4-28	ㅣ 3163 4-29	ㅐ 3150 4-30	ㅒ 3152 4-31
4_/C_	ㅔ 3154 4-32	ㅖ 3156 4-33	ㅚ 315A 4-34	ㅟ 315F 4-35	ㅢ 3162 4-36	ㅘ 3158 4-37	ㅝ 315D 4-38	ㅙ 3159 4-39	ㅞ 315E 4-40	ㄳ 3133 4-41	ㄵ 3135 4-42	ㄶ 3136 4-43	ㄺ 313A 4-44	ㄻ 313B 4-45	ㄼ 313C 4-46	ㄽ 313D 4-47
5_/D_	ㄾ 313E 4-48	ㄿ 313F 4-49	ㅀ 3140 4-50	ㅄ 3144 4-51	ㅿ 317F 4-52	ㆁ 3181 4-53	ㆆ 3186 4-54	ㆍ 318D 4-55	ᄼ 113C 4-56	ᄽ 113D 4-57	ᄾ 113E 4-58	ᄿ 113F 4-59	ᅎ 114E 4-60	ᅏ 114F 4-61	ᅐ 1150 4-62	ᅑ 1151 4-63
6_/E_	ᅔ 1154 4-64	ᅕ 1155 4-65	4-66	4-67	4-68	4-69	4-70	4-71	김 F113[lower-alpha 8] 4-72	일 F114[lower-alpha 8] 4-73	성 F115[lower-alpha 8] 4-74	김 F116[lower-alpha 8] 4-75	정 F117[lower-alpha 8] 4-76	일 F118[lower-alpha 8] 4-77	김 F120[lower-alpha 8] 4-78	정 F121[lower-alpha 8] 4-79
7_/F_	은 F122[lower-alpha 8] 4-80	4-81	4-82	4-83	4-84	4-85	4-86	4-87	4-88	4-89	4-90	4-91	4-92	4-93	4-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x25/0xA5 (row number 5, Cyrillic)

This set includes both cases of 33 letters from the Cyrillic script, sufficient to write the modern Russian alphabet and Bulgarian alphabet, although other forms of Cyrillic require additional letters.[62]

Compare row 12 of KS X 1001 and row 7 of JIS X 0208, which use the same layout (but in a different row).

KPS 9566 (prefixed with 0x25/0xA5)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		А 0410 5-1	Б 0411 5-2	В 0412 5-3	Г 0413 5-4	Д 0414 5-5	Е 0415 5-6	Ё 0401 5-7	Ж 0416 5-8	З 0417 5-9	И 0418 5-10	Й 0419 5-11	К 041A 5-12	Л 041B 5-13	М 041C 5-14	Н 041D 5-15
3_/B_	О 041E 5-16	П 041F 5-17	Р 0420 5-18	С 0421 5-19	Т 0422 5-20	У 0423 5-21	Ф 0424 5-22	Х 0425 5-23	Ц 0426 5-24	Ч 0427 5-25	Ш 0428 5-26	Щ 0429 5-27	Ъ 042A 5-28	Ы 042B 5-29	Ь 042C 5-30	Э 042D 5-31
4_/C_	Ю 042E 5-32	Я 042F 5-33	5-34	5-35	5-36	5-37	5-38	5-39	5-40	5-41	5-42	5-43	5-44	5-45	5-46	5-47
5_/D_	5-48	а 0430 5-49	б 0431 5-50	в 0432 5-51	г 0433 5-52	д 0434 5-53	е 0435 5-54	ё 0451 5-55	ж 0436 5-56	з 0437 5-57	и 0438 5-58	й 0439 5-59	к 043A 5-60	л 043B 5-61	м 043C 5-62	н 043D 5-63
6_/E_	о 043E 5-64	п 043F 5-65	р 0440 5-66	с 0441 5-67	т 0442 5-68	у 0443 5-69	ф 0444 5-70	х 0445 5-71	ц 0446 5-72	ч 0447 5-73	ш 0448 5-74	щ 0449 5-75	ъ 044A 5-76	ы 044B 5-77	ь 044C 5-78	э 044D 5-79
7_/F_	ю 044E 5-80	я 044F 5-81	5-82	5-83	5-84	5-85	5-86	5-87	5-88	5-89	5-90	5-91	5-92	5-93	5-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x26/0xA6 (row number 6, Greek letters and Roman numerals)

This set contains Roman numerals and basic support for the Greek alphabet, without diacritics or the final sigma.

Compare and contrast row 5 of KS X 1001 (which uses the same characters but in a different layout and a different row) and row 6 of JIS X 0208 (which uses the same layout for the Greek letters, but without the Roman numerals).

KPS 9566 (prefixed with 0x26/0xA6)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		Α 0391 6-1	Β 0392 6-2	Γ 0393 6-3	Δ 0394 6-4	Ε 0395 6-5	Ζ 0396 6-6	Η 0397 6-7	Θ 0398 6-8	Ι 0399 6-9	Κ 039A 6-10	Λ 039B 6-11	Μ 039C 6-12	Ν 039D 6-13	Ξ 039E 6-14	Ο 039F 6-15
3_/B_	Π 03A0 6-16	Ρ 03A1 6-17	Σ 03A3 6-18	Τ 03A4 6-19	Υ 03A5 6-20	Φ 03A6 6-21	Χ 03A7 6-22	Ψ 03A8 6-23	Ω 03A9 6-24	6-25	6-26	6-27	6-28	6-29	6-30	6-31
4_/C_	6-32	α 03B1 6-33	β 03B2 6-34	γ 03B3 6-35	δ 03B4 6-36	ε 03B5 6-37	ζ 03B6 6-38	η 03B7 6-39	θ 03B8 6-40	ι 03B9 6-41	κ 03BA 6-42	λ 03BB 6-43	μ 03BC 6-44	ν 03BD 6-45	ξ 03BE 6-46	ο 03BF 6-47
5_/D_	π 03C0 6-48	ρ 03C1 6-49	σ 03C3 6-50	τ 03C4 6-51	υ 03C5 6-52	φ 03C6 6-53	χ 03C7 6-54	ψ 03C8 6-55	ω 03C9 6-56	6-57	6-58	6-59	6-60	6-61	6-62	6-63
6_/E_	6-64	Ⅰ 2160 6-65	Ⅱ 2161 6-66	Ⅲ 2162 6-67	Ⅳ 2163 6-68	Ⅴ 2164 6-69	Ⅵ 2165 6-70	Ⅶ 2166 6-71	Ⅷ 2167 6-72	Ⅸ 2168 6-73	Ⅹ 2169 6-74	6-75	6-76	6-77	6-78	6-79
7_/F_	6-80	ⅰ 2170 6-81	ⅱ 2171 6-82	ⅲ 2172 6-83	ⅳ 2173 6-84	ⅴ 2174 6-85	ⅵ 2175 6-86	ⅶ 2176 6-87	ⅷ 2177 6-88	ⅸ 2178 6-89	ⅹ 2179 6-90	6-91	6-92	6-93	6-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x27/0xA7 (row number 7, encircled, superscript, subscript, fractions)

Several circled numbers in this row were mapped to Unicode incorrectly in the 2003 edition, due to using non-final proposed code points.[1] They were corrected in the 2011 edition.[3]

KPS 9566 (prefixed with 0x27/0xA7)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		① 2460 7-1	② 2461 7-2	③ 2462 7-3	④ 2463 7-4	⑤ 2464 7-5	⑥ 2465 7-6	⑦ 2466 7-7	⑧ 2467 7-8	⑨ 2468 7-9	⑩ 2469 7-10	⑪ 246A 7-11	⑫ 246B 7-12	⑬ 246C 7-13	⑭ 246D 7-14	⑮ 246E 7-15
3_/B_	⑯ 246F 7-16	⑰ 2470 7-17	⑱ 2471 7-18	⑲ 2472 7-19	⑳ 2473 7-20	㉑ 3251 7-21	㉒ 3252 7-22	㉓ 3253 7-23	㉔ 3254 7-24	㉕ 3255 7-25	㉖ 3256 7-26	㉗ 3257 7-27	㉘ 3258 7-28	㉙ 3259 7-29	㉚ 325A 7-30	7-31
4_/C_	7-32	㉠ 3260 7-33	㉡ 3261 7-34	㉢ 3262 7-35	㉣ 3263 7-36	㉤ 3264 7-37	㉥ 3265 7-38	㉦ 3266 7-39	㉧ 3267 7-40	㉨ 3268 7-41	㉩ 3269 7-42	㉪ 326A 7-43	㉫ 326B 7-44	㉬ 326C 7-45	㉭ 326D 7-46	7-47
5_/D_	7-48	㉮ 326E 7-49	㉯ 326F 7-50	㉰ 3270 7-51	㉱ 3271 7-52	㉲ 3272 7-53	㉳ 3273 7-54	㉴ 3274 7-55	㉵ 3275 7-56	㉶ 3276 7-57	㉷ 3277 7-58	㉸ 3278 7-59	㉹ 3279 7-60	㉺ 327A 7-61	㉻ 327B 7-62	7-63
6_/E_	⁰ 2070 7-64	¹ 00B9 7-65	² 00B2 7-66	³ 00B3 7-67	⁴ 2074 7-68	⁵ 2075 7-69	⁶ 2076 7-70	⁷ 2077 7-71	⁸ 2078 7-72	⁹ 2079 7-73	½ 00BD 7-74	⅓ 2153 7-75	⅔ 2154 7-76	¼ 00BC 7-77	¾ 00BE 7-78	7-79
7_/F_	₀ 2080 7-80	₁ 2081 7-81	₂ 2082 7-82	₃ 2083 7-83	₄ 2084 7-84	₅ 2085 7-85	₆ 2086 7-86	₇ 2087 7-87	₈ 2088 7-88	₉ 2089 7-89	1/2 F119[lower-alpha 9] 7-90	1/3 F11A[lower-alpha 9] 7-91	2/3 F11B[lower-alpha 9] 7-92	1/4 F11C[lower-alpha 9] 7-93	3/4 F11D[lower-alpha 9] 7-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x28/0xA8 (row number 8, unit, quantity and currency symbols)

This set contains symbols for units of measure and currency. Those present in ASCII (shown here with a heavy green border) are shown below mapped to Basic Latin codepoints (consistent with articles on other CJK character sets), but are mapped to the Halfwidth and Fullwidth Forms block when used in an encoding which combines KPS 9566 with ASCII.[1]

The Kelvin sign was replaced with a euro sign in the 2003 edition.[1] The 2011 edition includes an alternative encoding of the Kelvin sign at 0xE988.[3]

KPS 9566 (prefixed with 0x28/0xA8)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		° 00B0 8-1	′ 2032 8-2	″ 2033 8-3	℃ 2103 8-4	℉ 2109 8-5	€/K[lower-alpha 10] 20AC/212A 8-6	￦ FFE6 8-7	$ 0024 8-8	￠[lower-alpha 3] FFE0 8-9	￡[lower-alpha 3] FFE1 8-10	￥[lower-alpha 3] FFE5 8-11	% 0025 8-12	‰ 2030 8-13	Å 212B 8-14	㏄ 33C4 8-15
3_/B_	㎡ 33A1 8-16	㎥ 33A5 8-17	㎝ 339D 8-18	㎠ 33A0 8-19	㎤ 33A4 8-20	㎜ 339C 8-21	㎟ 339F 8-22	㎣ 33A3 8-23	㍷ 3377 8-24	㍸ 3378 8-25	㍹ 3379 8-26	㎞ 339E 8-27	㎢ 33A2 8-28	㎦ 33A6 8-29	㎙ 3399 8-30	㎚ 339A 8-31
4_/C_	㎛ 339B 8-32	㎧ 33A7 8-33	㎨ 33A8 8-34	㎍ 338D 8-35	㎎ 338E 8-36	㎏ 338F 8-37	㎴ 33B4 8-38	㎵ 33B5 8-39	㎶ 33B6 8-40	㎷ 33B7 8-41	㎸ 33B8 8-42	㎹ 33B9 8-43	㎀ 3380 8-44	㎁ 3381 8-45	㎂ 3382 8-46	㎃ 3383 8-47
5_/D_	㎄ 3384 8-48	㎺ 33BA 8-49	㎻ 33BB 8-50	㎼ 33BC 8-51	㎽ 33BD 8-52	㎾ 33BE 8-53	㎿ 33BF 8-54	Ω 2126 8-55	㏀ 33C0 8-56	㏁ 33C1 8-57	㎐ 3390 8-58	㎑ 3391 8-59	㎒ 3392 8-60	㎓ 3393 8-61	㎔ 3394 8-62	㏞ 33DE 8-63
6_/E_	㏟ 33DF 8-64	㎰ 33B0 8-65	㎱ 33B1 8-66	㎲ 33B2 8-67	㎳ 33B3 8-68	㎊ 338A 8-69	㎋ 338B 8-70	㎌ 338C 8-71	㎩ 33A9 8-72	㎪ 33AA 8-73	㎫ 33AB 8-74	㎬ 33AC 8-75	ℓ 2113 8-76	㎕ 3395 8-77	㎖ 3396 8-78	㎗ 3397 8-79
7_/F_	㎘ 3398 8-80	㏿ 33FF 8-81	㎈ 3388 8-82	㎉ 3389 8-83	㎭ 33AD 8-84	㎮ 33AE 8-85	㎯ 33AF 8-86	㋌ 32CC 8-87	㏝ 33DD 8-88	㏈ 33C8 8-89	㋍ 32CD 8-90	㋎ 32CE 8-91	㏖ 33D6 8-92	㏋ 33CB 8-93	㏊ 33CA 8-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x29/0xA9 (row number 9, box drawing)

KPS 9566 (prefixed with 0x29/0xA9)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		─ 2500 9-1	│ 2502 9-2	┌ 250C 9-3	┐ 2510 9-4	┘ 2518 9-5	└ 2514 9-6	├ 251C 9-7	┬ 252C 9-8	┤ 2524 9-9	┴ 2534 9-10	┼ 253C 9-11	━ 2501 9-12	┃ 2503 9-13	┏ 250F 9-14	┓ 2513 9-15
3_/B_	┛ 251B 9-16	┗ 2517 9-17	┣ 2523 9-18	┳ 2533 9-19	┫ 252B 9-20	┻ 253B 9-21	╋ 254B 9-22	┠ 2520 9-23	┯ 252F 9-24	┨ 2528 9-25	┷ 2537 9-26	┿ 253F 9-27	┝ 251D 9-28	┰ 2530 9-29	┥ 2525 9-30	┸ 2538 9-31
4_/C_	╂ 2542 9-32	┒ 2512 9-33	┑ 2511 9-34	┚ 251A 9-35	┙ 2519 9-36	┖ 2516 9-37	┕ 2515 9-38	┎ 250E 9-39	┍ 250D 9-40	┞ 251E 9-41	┟ 251F 9-42	┡ 2521 9-43	┢ 2522 9-44	┦ 2526 9-45	┧ 2527 9-46	┩ 2529 9-47
5_/D_	┪ 252A 9-48	┭ 252D 9-49	┮ 252E 9-50	┱ 2531 9-51	┲ 2532 9-52	┵ 2535 9-53	┶ 2536 9-54	┹ 2539 9-55	┺ 253A 9-56	┽ 253D 9-57	┾ 253E 9-58	╀ 2540 9-59	╁ 2541 9-60	╃ 2543 9-61	╄ 2544 9-62	╅ 2545 9-63
6_/E_	╆ 2546 9-64	╇ 2547 9-65	╈ 2548 9-66	╉ 2549 9-67	╊ 254A 9-68	9-69	9-70	9-71	9-72	9-73	9-74	9-75	9-76	9-77	9-78	9-79
7_/F_	9-80	9-81	9-82	9-83	9-84	9-85	9-86	9-87	9-88	9-89	9-90	9-91	9-92	9-93	9-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2A/0xAA (row number 10, Hiragana)

This row contains Hiragana for use in the Japanese language.

Compare row 10 of KS X 1001, which uses the same layout. Compare and contrast row 4 of JIS X 0208, which also uses the same layout, but in a different row.

KPS 9566 (prefixed with 0x2A/0xAA)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		ぁ 3041 10-1	あ 3042 10-2	ぃ 3043 10-3	い 3044 10-4	ぅ 3045 10-5	う 3046 10-6	ぇ 3047 10-7	え 3048 10-8	ぉ 3049 10-9	お 304A 10-10	か 304B 10-11	が 304C 10-12	き 304D 10-13	ぎ 304E 10-14	く 304F 10-15
3_/B_	ぐ 3050 10-16	け 3051 10-17	げ 3052 10-18	こ 3053 10-19	ご 3054 10-20	さ 3055 10-21	ざ 3056 10-22	し 3057 10-23	じ 3058 10-24	す 3059 10-25	ず 305A 10-26	せ 305B 10-27	ぜ 305C 10-28	そ 305D 10-29	ぞ 305E 10-30	た 305F 10-31
4_/C_	だ 3060 10-32	ち 3061 10-33	ぢ 3062 10-34	っ 3063 10-35	つ 3064 10-36	づ 3065 10-37	て 3066 10-38	で 3067 10-39	と 3068 10-40	ど 3069 10-41	な 306A 10-42	に 306B 10-43	ぬ 306C 10-44	ね 306D 10-45	の 306E 10-46	は 306F 10-47
5_/D_	ば 3070 10-48	ぱ 3071 10-49	ひ 3072 10-50	び 3073 10-51	ぴ 3074 10-52	ふ 3075 10-53	ぶ 3076 10-54	ぷ 3077 10-55	へ 3078 10-56	べ 3079 10-57	ぺ 307A 10-58	ほ 307B 10-59	ぼ 307C 10-60	ぽ 307D 10-61	ま 307E 10-62	み 307F 10-63
6_/E_	む 3080 10-64	め 3081 10-65	も 3082 10-66	ゃ 3083 10-67	や 3084 10-68	ゅ 3085 10-69	ゆ 3086 10-70	ょ 3087 10-71	よ 3088 10-72	ら 3089 10-73	り 308A 10-74	る 308B 10-75	れ 308C 10-76	ろ 308D 10-77	ゎ 308E 10-78	わ 308F 10-79
7_/F_	ゐ 3090 10-80	ゑ 3091 10-81	を 3092 10-82	ん 3093 10-83	10-84	10-85	10-86	10-87	10-88	10-89	10-90	10-91	10-92	10-93	10-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2B/0xAB (row number 11, Katakana)

This row contains Katakana for use in the Japanese language.

Compare row 11 of KS X 1001, which uses the same layout. Compare and contrast row 5 of JIS X 0208, which also uses the same layout, but in a different row.

KPS 9566 (prefixed with 0x2B/0xAB)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		ァ 30A1 11-1	ア 30A2 11-2	ィ 30A3 11-3	イ 30A4 11-4	ゥ 30A5 11-5	ウ 30A6 11-6	ェ 30A7 11-7	エ 30A8 11-8	ォ 30A9 11-9	オ 30AA 11-10	カ 30AB 11-11	ガ 30AC 11-12	キ 30AD 11-13	ギ 30AE 11-14	ク 30AF 11-15
3_/B_	グ 30B0 11-16	ケ 30B1 11-17	ゲ 30B2 11-18	コ 30B3 11-19	ゴ 30B4 11-20	サ 30B5 11-21	ザ 30B6 11-22	シ 30B7 11-23	ジ 30B8 11-24	ス 30B9 11-25	ズ 30BA 11-26	セ 30BB 11-27	ゼ 30BC 11-28	ソ 30BD 11-29	ゾ 30BE 11-30	タ 30BF 11-31
4_/C_	ダ 30C0 11-32	チ 30C1 11-33	ヂ 30C2 11-34	ッ 30C3 11-35	ツ 30C4 11-36	ヅ 30C5 11-37	テ 30C6 11-38	デ 30C7 11-39	ト 30C8 11-40	ド 30C9 11-41	ナ 30CA 11-42	ニ 30CB 11-43	ヌ 30CC 11-44	ネ 30CD 11-45	ノ 30CE 11-46	ハ 30CF 11-47
5_/D_	バ 30D0 11-48	パ 30D1 11-49	ヒ 30D2 11-50	ビ 30D3 11-51	ピ 30D4 11-52	フ 30D5 11-53	ブ 30D6 11-54	プ 30D7 11-55	ヘ 30D8 11-56	ベ 30D9 11-57	ペ 30DA 11-58	ホ 30DB 11-59	ボ 30DC 11-60	ポ 30DD 11-61	マ 30DE 11-62	ミ 30DF 11-63
6_/E_	ム 30E0 11-64	メ 30E1 11-65	モ 30E2 11-66	ャ 30E3 11-67	ヤ 30E4 11-68	ュ 30E5 11-69	ユ 30E6 11-70	ョ 30E7 11-71	ヨ 30E8 11-72	ラ 30E9 11-73	リ 30EA 11-74	ル 30EB 11-75	レ 30EC 11-76	ロ 30ED 11-77	ヮ 30EE 11-78	ワ 30EF 11-79
7_/F_	ヰ 30F0 11-80	ヱ 30F1 11-81	ヲ 30F2 11-82	ン 30F3 11-83	ヴ 30F4 11-84	ヵ 30F5 11-85	ヶ 30F6 11-86	11-87	11-88	11-89	11-90	11-91	11-92	11-93	11-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2C/0xAC (row number 12, miscellaneous symbols and arrows)

For the purpose of mapping this row to Unicode, the bold rightward arrow was unified with the bold rightward arrow from Zapf Dingbats (U+27A1). Since corresponding arrows in other directions were not included in the Dingbats block, additional arrows were encoded between U+2B05 and U+2B0D for compatibility with KPS 9566. These were incorporated into the Unicode code charts using the reference glyphs proposed by the North Korean national body, while U+27A1 retained its reference glyph based on Zapf Dingbats.[46] These arrows (U+2B05 through U+2B07, plus U+27A1) were chosen in Unicode 6.0 as the mappings for the arrow characters in cellular emoji sets.[51] Subsequently, during the addition of the Wingdings 3 repertoire in Unicode 7.0, the Unicode coverage of arrow characters was reviewed, resulting in an additional rightward arrow being added at U+2B95 with the intent of harmonising with characters U+2B05 through U+2B0D (in text presentation), since changing the reference glyph for the Zapf Dingbats character was not considered appropriate.[46]

In earlier editions of KPS 9566, such as the 1997 edition, this row included both the simple Japanese-style postal mark (〒) and a version in a downward-pointing triangle,[39][21] which was proposed by the North Korean national body for addition to Unicode alongside the other missing KPS 9566 characters.[39] A response by a South Korean representative, amongst other requests, requested evidence for the symbol's use in North Korea, noting that the Japanese-style postal mark is not used in South Korea, which uses a circled 우 (i.e. ㉾) for a similar purpose, and enquiring whether a Japanese-style postal mark was in use in North Korea.[41] A subsequent meeting was held to discuss this proposal, attended by North and South Korean WG2 representatives; the meeting report notes that the North Korean body had decided to review the character before discussing it further, and therefore did not recommend it for consideration by WG2 as a whole.[58] The postal mark triangle was subsequently removed from KPS 9566 in 2003, leaving only the unenclosed postal mark.[1]

The postal mark triangle was eventually added to Unicode in version 13.0, both for compatibilty with the legacy KPS 9566-97 character, and subsequent to the mark being identified as a symbol which had been used for certification for electrical appliances in Japan (as a predecessor to the PSE diamond).[59]

Certain KPS 9566 characters in this row, namely two forms of the emblem of the Workers' Party of Korea, a pair of scissors pointing in a different direction to those in the Dingbats block, and a circled upward-pointing manicule, remain mapped to the Private Use Area.[1]

The north-east and north-west white arrows used incorrect swapped Unicode mappings in the 2003 edition.[1] This was corrected in the 2011 edition mappings.[3]

KPS 9566 (prefixed with 0x2C/0xAC)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		F127[lower-alpha 11] 12-1	F128[lower-alpha 11] 12-2	⍟ 235F 12-3	☀ 2600 12-4	☂ 2602 12-5	☔︎ 2614 12-6	☁ 2601 12-7	❄ 2744 12-8	⚡︎ 26A1 12-9	⚠ 26A0 12-10	№ 2116 12-11	→ 2192 12-12	← 2190 12-13	↑ 2191 12-14	↓ 2193 12-15
3_/B_	↗ 2197 12-16	↖ 2196 12-17	↘ 2198 12-18	↙ 2199 12-19	↔ 2194 12-20	↕ 2195 12-21	⇨ 21E8 12-22	⇦ 21E6 12-23	⇧ 21E7 12-24	⇩ 21E9 12-25	⬀ 2B00 12-26	⬁ 2B01 12-27	⬂ 2B02 12-28	⬃ 2B03 12-29	⬄ 2B04 12-30	⇳ 21F3 12-31
4_/C_	➡ 27A1 12-32	⬅ 2B05 12-33	⬆ 2B06 12-34	⬇ 2B07 12-35	⬈ 2B08 12-36	⬉ 2B09 12-37	⬊ 2B0A 12-38	⬋ 2B0B 12-39	⬌ 2B0C 12-40	⬍ 2B0D 12-41	♣ 2663 12-42	♥ 2665 12-43	♠ 2660 12-44	♦ 2666 12-45	〒 3012 12-46	⮗[lower-alpha 12] 2B97 12-47
5_/D_	☏ 260F 12-48	☎ 260E 12-49	⏎ 23CE 12-50	☞ 261E[lower-alpha 13] 12-51	F13C[lower-alpha 14] 12-52	F13D[lower-alpha 15] 12-53	☕︎ 2615 12-54	㉼ 327C 12-55	㉽ 327D 12-56	㈝ 321D 12-57	㈞ 321E 12-58	㏇ 33C7 12-59	㋏ 32CF 12-60	㉐ 3250 12-61	℡ 2121 12-62	℻ 213B 12-63
6_/E_	㍺ 337A 12-64	® 00AE 12-65	12-66	12-67	12-68	12-69	12-70	12-71	12-72	12-73	12-74	12-75	12-76	12-77	12-78	12-79
7_/F_	12-80	12-81	12-82	12-83	12-84	12-85	12-86	12-87	12-88	12-89	12-90	12-91	12-92	12-93	12-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2D/0xAD (row number 13, empty)

KPS 9566 (prefixed with 0x2D/0xAD)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		13-1	13-2	13-3	13-4	13-5	13-6	13-7	13-8	13-9	13-10	13-11	13-12	13-13	13-14	13-15
3_/B_	13-16	13-17	13-18	13-19	13-20	13-21	13-22	13-23	13-24	13-25	13-26	13-27	13-28	13-29	13-30	13-31
4_/C_	13-32	13-33	13-34	13-35	13-36	13-37	13-38	13-39	13-40	13-41	13-42	13-43	13-44	13-45	13-46	13-47
5_/D_	13-48	13-49	13-50	13-51	13-52	13-53	13-54	13-55	13-56	13-57	13-58	13-59	13-60	13-61	13-62	13-63
6_/E_	13-64	13-65	13-66	13-67	13-68	13-69	13-70	13-71	13-72	13-73	13-74	13-75	13-76	13-77	13-78	13-79
7_/F_	13-80	13-81	13-82	13-83	13-84	13-85	13-86	13-87	13-88	13-89	13-90	13-91	13-92	13-93	13-94

Character set 0x2E/0xAE (row number 14, Latin-1 subset)

The characters in this set were not present in the 1997 version of the character set, but were added in the 2003 version.[1] They constitute a subset of the Latin-1 Supplement block of Unicode (equivalent to the upper half of the ISO 8859-1 (Latin-1) character set). This includes accented Roman letters and symbols. Some of the symbols which were already included are omitted, while some others are duplicated as halfwidth counterparts to the earlier fullwidth forms: for example, the not sign (¬, U+00AC) is represented as 0xAEAC, while its fullwidth form (￢, U+FFE2) is represented as 0xA2D1 (in row 2).[1]

This row is omitted from the mapping for the 2011 edition of the standard,[3] indicating it may have been removed at some point after the 2003 edition. The halfwidth yen sign is instead encoded at 0xE98E in the 2011 edition.[3]

The required space would fall outside of the 94-character range, colliding with the area used for extended chosŏn'gŭl syllables when a UHC-style encoding is used (specifically, with the syllable 쁲),[1] and is omitted. Although the y with trema also falls outside the 94-character range, and the trail byte 0xFF is otherwise unused, the code 0xAEFF is mapped to it in KPS 9566-2003.[1]

KPS 9566-2003 (prefixed with 0x2E/0xAE)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_/A_		¡ 00A1 14-1	¢ 00A2 14-2	£ 00A3 14-3	¤ 00A4 14-4	¥ 00A5 14-5	¦ 00A6 14-6	14-7	14-8	© 00A9 14-9	ª 00AA 14-10	« 00AB 14-11	¬ 00AC 14-12	SHY 00AD 14-13	14-14	¯ 00AF 14-15
3_/B_	14-16	14-17	14-18	14-19	14-20	µ 00B5 14-21	14-22	14-23	¸ 00B8 14-24	14-25	º 00BA 14-26	» 00BB 14-27	14-28	14-29	14-30	¿ 00BF 14-31
4_/C_	À 00C0 14-32	Á 00C1 14-33	Â 00C2 14-34	Ã 00C3 14-35	Ä 00C4 14-36	Å 00C5 14-37	Æ 00C6 14-38	Ç 00C7 14-39	È 00C8 14-40	É 00C9 14-41	Ê 00CA 14-42	Ë 00CB 14-43	Ì 00CC 14-44	Í 00CD 14-45	Î 00CE 14-46	Ï 00CF 14-47
5_/D_	Ð 00D0 14-48	Ñ 00D1 14-49	Ò 00D2 14-50	Ó 00D3 14-51	Ô 00D4 14-52	Õ 00D5 14-53	Ö 00D6 14-54	14-55	Ø 00D8 14-56	Ù 00D9 14-57	Ú 00DA 14-58	Û 00DB 14-59	Ü 00DC 14-60	Ý 00DD 14-61	Þ 00DE 14-62	ß 00DF 14-63
6_/E_	à 00E0 14-64	á 00E1 14-65	â 00E2 14-66	ã 00E3 14-67	ä 00E4 14-68	å 00E5 14-69	æ 00E6 14-70	ç 00E7 14-71	è 00E8 14-72	é 00E9 14-73	ê 00EA 14-74	ë 00EB 14-75	ì 00EC 14-76	í 00ED 14-77	î 00EE 14-78	ï 00EF 14-79
7_/F_	ð 00F0 14-80	ñ 00F1 14-81	ò 00F2 14-82	ó 00F3 14-83	ô 00F4 14-84	õ 00F5 14-85	ö 00F6 14-86	14-87	ø 00F8 14-88	ù 00F9 14-89	ú 00FA 14-90	û 00FB 14-91	ü 00FC 14-92	ý 00FD 14-93	þ 00FE 14-94	ÿ 00FF

Letter Number Punctuation Symbol Other Undefined

Composed Chosŏn'gŭl sets (rows number 16 through 44)

Composed Chosŏn'gŭl syllable clusters are allocated code points in a continuous sorted block between code points 16-01 and 44-47 inclusive. Not all possible clusters are allocated code points; the chart below indicates, for each initial+vowel pair, which initial+vowel+final syllable clusters are assigned code points. Vowels, initials and finals are displayed in KPS sorting order.[63] The "ø" character is used here to denote the empty final. Compare the different ordering and availability in KS X 1001.

The encoded form documented for KPS 9566-2003 encodes the KPS 9566 plane on GR (0xA1-0xFE) and additionally encodes the remaining syllable clusters using lead bytes in the range 0x80-0xC2 and trail bytes in the ranges 0x41-0x5A, 0x61-0x7A and 0x81-0xFE (where at most one byte is in the range 0xA1-0xFE),[1] similarly to Unified Hangul Code but with the omitted clusters from and sorting order of KPS 9566, not KS X 1001.

Initials	Vowels
Initials	ㅏ	ㅑ	ㅓ	ㅕ	ㅗ	ㅛ	ㅜ	ㅠ	ㅡ	ㅣ	ㅐ	ㅒ	ㅔ	ㅖ	ㅚ	ㅟ	ㅢ	ㅘ	ㅝ	ㅙ	ㅞ
ㄱ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅄ, ㅅ, ㅇ, ㅈ, ㅊ, ㅌ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅌ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㄽ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅌ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅂ, ㅇ, ㅆ	ø, ㄱ, ㅅ
ㄴ	ø, ㄱ, ㄳ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅌ, ㅎ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄳ, ㄴ, ㄹ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅎ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅋ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅎ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄴ, ㄹ, ㅅ, ㅆ	ø, ㄹ, ㅅ, ㅆ	ø	ø
ㄷ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㄼ, ㄿ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅍ, ㅎ, ㄲ, ㅆ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅍ, ㅎ, ㄲ, ㅆ	ø, ㄴ, ㄹ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㄽ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅌ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅂ, ㅇ	ø, ㄴ, ㄹ	ø, ㄴ, ㄹ, ㅁ, ㅅ, ㅆ	ø, ㄴ, ㄹ, ㅆ	ø, ㅇ
ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅌ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ	ø, ㄴ, ㅂ, ㅇ	ø, ㅆ	ø, ㅆ	ø
ㅁ	ø, ㄱ, ㄴ, ㄶ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅌ, ㅎ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅆ	ø, ㄱ, ㄳ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅎ, ㄲ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅌ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ	ø, ㄴ, ㅁ	ø, ㄴ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅆ	ø	ø
ㅂ	ø, ㄱ, ㄳ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㄲ	ø, ㄱ, ㄴ, ㅁ, ㅂ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅋ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅌ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ, ㄲ	ø, ㄴ, ㄹ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅍ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅆ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅇ	ø, ㄴ, ㄹ, ㅇ	ø, ㄴ, ㅂ, ㅅ, ㅆ	ø, ㄹ, ㅅ, ㅋ, ㅆ	ø, ㄴ, ㅆ	ø
ㅅ	ø, ㄱ, ㄳ, ㄴ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄳ, ㄴ, ㄷ, ㄹ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㄲ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅌ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄿ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅍ, ㅆ	ø, ㄴ, ㄹ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ	ø, ㄱ, ㄴ, ㄹ, ㅅ, ㅇ	ø, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅅ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ
ㅈ	ø, ㄱ, ㄴ, ㄶ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅆ	ø, ㄱ, ㄴ, ㄶ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅎ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅌ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅆ	ø, ㄴ, ㄹ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ	ø	ø, ㄱ, ㄴ, ㄹ, ㅂ, ㅅ, ㅇ	ø, ㅆ	ø, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅆ
ㅊ	ø, ㄱ, ㄴ, ㄶ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅆ	ø, ㄴ, ㄶ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø	ø, ㄱ, ㄴ, ㄹ, ㅇ	ø, ㅅ, ㅆ	ø, ㄴ	ø, ㄴ
ㅋ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅆ	ø, ㄱ, ㄴ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅍ, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø	ø, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄴ, ㄹ, ㅇ	ø, ㄴ, ㄹ, ㅇ	ø, ㄱ, ㄹ, ㅇ
ㅌ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅆ	ø, ㄴ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄴ	ø, ㄴ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ	ø, ㄴ	ø, ㅆ	ø	ø, ㄴ, ㅆ
ㅍ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㄲ, ㅆ	ø, ㄱ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅍ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄴ	ø, ㄴ, ㄹ, ㅁ, ㅅ	ø, ㄴ	ø, ㅇ	ø, ㅇ
ㅎ	ø, ㄱ, ㄴ, ㄹ, ㄾ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㄾ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄾ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ	ø, ㄴ, ㄹ, ㅂ, ㅅ	ø, ㄱ, ㄴ, ㄹ, ㄾ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄶ, ㄷ, ㄹ, ㄺ, ㄾ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅂ	ø, ㄱ, ㄴ, ㄹ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅇ
ㄲ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㄲ, ㅆ	ø, ㄴ, ㄹ, ㅅ, ㅌ, ㅆ	ø, ㄱ, ㄴ, ㄵ, ㄶ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ	ø	ø, ㄱ, ㄴ, ㄶ, ㄹ, ㄻ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ		ø, ㄱ, ㄴ, ㄹ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ, ㅆ
ㄸ	ø, ㄱ, ㄴ, ㄹ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ, ㄲ, ㅆ	ø, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ, ㅆ	ø, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø	ø, ㄱ, ㄴ, ㄹ, ㅀ, ㅁ, ㅂ, ㅇ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ		ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ		ø, ㄴ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ, ㄹ	ø, ㅆ	ø, ㄱ	ø
ㅃ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㅁ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅂ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅇ		ø	ø	ø	ø	ø
ㅆ	ø, ㄱ, ㄳ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅎ, ㅆ	ø, ㄴ, ㄹ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ, ㄲ, ㅆ	ø	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄻ, ㅀ, ㅁ, ㅂ, ㅇ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø, ㄴ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅆ	ø, ㄴ	ø, ㄴ, ㄹ, ㅁ	ø, ㄱ, ㄴ, ㄹ, ㅅ, ㅇ, ㅆ	ø, ㅆ	ø, ㄱ, ㅆ	ø, ㄴ, ㅂ
ㅉ	ø, ㄱ, ㄴ, ㄶ, ㄹ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅎ	ø, ㄴ, ㅁ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄹ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅅ, ㅇ, ㅈ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ, ㅎ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅇ	ø	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅆ	ø	ø, ㄴ	ø, ㄱ, ㄴ, ㄹ, ㅇ, ㅆ	ø, ㅇ, ㅆ	ø, ㅆ	ø
ㅇ	ø, ㄱ, ㄴ, ㄵ, ㄶ, ㄹ, ㄺ, ㄻ, ㄿ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄼ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄵ, ㄷ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅄ, ㅅ, ㅇ, ㅈ, ㅋ, ㅍ, ㅎ, ㅆ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄻ, ㄼ, ㅁ, ㅂ, ㅄ, ㅅ, ㅇ, ㅌ, ㅍ, ㅎ, ㄲ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㄽ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ, ㅍ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅊ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㄺ, ㄿ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅋ, ㅌ, ㅍ, ㅎ	ø, ㄱ, ㄴ, ㄹ, ㄺ, ㄻ, ㅀ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅍ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅂ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ	ø, ㄴ, ㄹ, ㅁ, ㅅ	ø, ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅆ	ø, ㄱ, ㄴ, ㅁ, ㅅ, ㅇ	ø, ㄱ, ㄴ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅌ
Initials	ㅏ	ㅑ	ㅓ	ㅕ	ㅗ	ㅛ	ㅜ	ㅠ	ㅡ	ㅣ	ㅐ	ㅒ	ㅔ	ㅖ	ㅚ	ㅟ	ㅢ	ㅘ	ㅝ	ㅙ	ㅞ
Initials	Vowels

Hanja sets (rows number 45 through 94)

Extended non-syllable, non-hanja sets in KPS 9566:2011

Following are charts for the non-syllable, non-hanja section of KPS 9566-2011 outside of the main plane.[3]

Extension set 0xE0 (symbols and pictographs)

KPS 9566-2011 (prefixed with 0xE0)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		� E100	� E101	� E102	� E103	◑ 25D1	� E104	� E105	� E106	� E107	⊘ 2298	✉ 2709	☛ 261B	☞ 261E	✌ 270C	� E108
5_	✍ 270D	✏ 270F	✎ 270E	✐ 2710	� E109	✓ 2713	✔ 2714	⊡ 22A1	⎔ 2394	� E10B	� E10C
6_		⊙ 2299
7_	� E11A	� E11B	⚓︎ 2693	� E11D	� E11E	� E11F	� E120	� E121	� E122	☼ 263C	� E123
8_		� E124	� E125	� E126	◉ 25C9	� E127	� E128
9_	� E12E	� E12F	✪ 272A	� E130	✯ 272F	✬ 272C	� E131	✫ 272B	� E132	✮ 272E	✭ 272D	✰ 2730	� E133	✩ 2729
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension sets 0xE1, 0xE2, 0xE3 (unknown)

These extension sets map to the private use area. Their purpose is not documented.[3]

KPS 9566-2011 (prefixed with 0xE1)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		� E138	� E139	� E13A	� E13B	� E13C	� E13D	� E13E	� E13F	� E140	� E141	� E142	� E143	� E144	� E145	� E146
5_	� E147	� E148	� E149	� E14A	� E14B	� E14C	� E14D	� E14E	� E14F	� E150	� E151
6_		� E152	� E153	� E154	� E155	� E156	� E157	� E158	� E159	� E15A	� E15B	� E15C	� E15D	� E15E	� E15F	� E160
7_	� E161	� E162	� E163	� E164	� E165	� E166	� E167	� E168	� E169	� E16A	� E16B
8_		� E16C	� E16D	� E16E	� E16F	� E170	� E171	� E172	� E173	� E174	� E175	� E176	� E177	� E178	� E179	� E17A
9_	� E17B	� E17C	� E17D	� E17E	� E17F	� E180	� E181	� E182	� E183	� E184	� E185	� E186	� E187	� E188	� E189	� E18A
A_	� E18B	Part of main plane.

KPS 9566-2011 (prefixed with 0xE2)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		� E18C	� E18D	� E18E	� E18F	� E190	� E191	� E192	� E193	� E194	� E195	� E196	� E197	� E198	� E199	� E19A
5_	� E19B	� E19C	� E19D	� E19E	� E19F	� E1A0	� E1A1	� E1A2	� E1A3	� E1A4
6_
7_
8_
9_
A_		Part of main plane.

KPS 9566-2011 (prefixed with 0xE3)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		� E1E0	� E1E1	� E1E2	� E1E3	� E1E4	� E1E5	� E1E6	� E1E7	� E1E8	� E1E9	� E1EA	� E1EB	� E1EC	� E1ED	� E1EE
5_	� E1EF	� E1F0	� E1F1	� E1F2	� E1F3	� E1F4	� E1F5	� E1F6	� E1F7	� E1F8	� E1F9
6_		� E1FA	� E1FB	� E1FC	� E1FD	� E1FE	� E1FF	� E200	� E201	� E202	� E203	� E204	� E205	� E206
7_
8_		� E214	� E215	� E216	� E217	� E218	� E219
9_
A_		Part of main plane.

Extension set 0xE4 (arrows)

This set includes several, mostly rightward arrows mapping to the Unicode Dingbats block and elsewhere.[3]

KPS 9566-2011 (prefixed with 0xE4)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		➔ 2794	➘ 2798	➙ 2799	➚ 279A	➛ 279B	➜ 279C	➝ 279D	➟ 279F	➠ 27A0	➢ 27A2	➣ 27A3	➤ 27A4	➥ 27A5	➦ 27A6	➧ 27A7
5_	➨ 27A8	➩ 27A9	➪ 27AA	➫ 27AB	➬ 27AC	➭ 27AD	➮ 27AE	➯ 27AF	➱ 27B1	➲ 27B2	➳ 27B3
6_		➴ 27B4	➵ 27B5	➶ 27B6	➷ 27B7	➸ 27B8	➹ 27B9	➺ 27BA	➻ 27BB	➾ 27BE	➼ 27BC	➽ 27BD	� E234
7_
8_		⟷ 27F7	⇌ 21CC	� E243	� E244	� E245	� E246	⥫ 296B	� E247	⥬ 296C	⇐ 21D0	� E248	⟹ 27F9
9_													� E258	� E259	� E25A	� E25B
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xE5 (Roman superscripts and subscripts)

This row includes several lowercase Roman superscripts with trail bytes corresponding to their uppercase ASCII equivalents, and lowercase Roman subscripts with trail bytes corresponding to their lowercase ASCII equivalents.[3]

KPS 9566-2011 (prefixed with 0xE5)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		ᵃ 1D43	ᵇ 1D47	ᶜ 1D9C	ᵈ 1D48	ᵉ 1D49	ᶠ 1DA0	ᵍ 1D4D	� E264	� E265	� E266	ᵏ 1D4F	� E268	ᵐ 1D50	ⁿ 207F	ᵒ 1D52
5_	ᵖ 1D56	� E26D	� E26E	� E26F	ᵗ 1D57	ᵘ 1D58	ᵛ 1D5B	� E273	� E274	� E275	� E276
6_		ₐ 2090	� E278	� E279	� E27A	ₑ 2091	� E27C	� E27D	� E27E	ᵢ 1D62	ⱼ 2C7C	� E281	� E282	� E283	� E284	ₒ 2092
7_	� E286	� E287	ᵣ 1D63	� E289	� E28A	ᵤ 1D64	ᵥ 1D65	� E28D	ₓ 2093	� E28F	� E290
8_		� E291	� E292	� E293	� E294	� E295	� E296
9_		� E2A1	� E2A2	� E2A3	� E2A4	� E2A5	� E2A6
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xE6 (Greek and symbol superscripts and subscripts)

KPS 9566-2011 (prefixed with 0xE6)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		ᵅ 1D45	ᵝ 1D5D	ᵞ 1D5E	ᵟ 1D5F	ᵋ 1D4B	� E2B6	� E2B7	ᶿ 1DBF	� E2B9	� E2BA	� E2BB	� E2BC	ᶹ 1DB9	� E2BE	� E2BF
5_	� E2C0	� E2C1	� E2C2	� E2C3	� E2C4	ᵠ 1D60	ᵡ 1D61	� E2C7	� E2C8
6_		� E2CB	ᵦ 1D66	ᵧ 1D67	� E2CE	� E2CF	� E2D0	� E2D1	� E2D2	� E2D3	� E2D4	� E2D5	� E2D6	� E2D7	� E2D8	� E2D9
7_	� E2DA	ᵨ 1D68	� E2DC	� E2DD	� E2DE	ᵩ 1D69	ᵪ 1D6A	� E2E1	� E2E2
8_		⁺ 207A	⁻ 207B	� E2E5	� E2E6	� E2E7
9_		₊ 208A	₋ 208B	� E2F3	� E2F4	� E2F5
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xE7 (further list markers)

KPS 9566-2011 (prefixed with 0xE7)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		㉛ 325B	㉜ 325C	㉝ 325D	㉞ 325E	㉟ 325F	㊱ 32B1	㊲ 32B2	㊳ 32B3	㊴ 32B4	㊵ 32B5	㊶ 32B6	㊷ 32B7	㊸ 32B8	㊹ 32B9	㊺ 32BA
5_	㊻ 32BB	㊼ 32BC	㊽ 32BD	㊾ 32BE	㊿ 32BF	� E301	� E302	� E303	� E304	� E305	� E306
6_		� E307	� E308	� E309	� E30A	� E30B	� E30C	� E30D	� E30E	� E30F	� E310	� E311	� E312	� E313	� E314	� E315
7_	� E316	� E317	� E318	� E319	� E31A	� E31B	� E31C	� E31D	� E31E	� E31F	� E320
8_		� E321	� E322	� E323	� E324	� E325	� E326	� E327	� E328
9_
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xE8

KPS 9566-2011 (prefixed with 0xE8)
	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E
4_	� E341	� E342	� E343	� E344	� E345	� E346	� E347	� E348	� E349	� E34A	� E34B	� E34C	� E34D	� E34E
5_
6_	� E35B	� E35C	� E35D	� E35E	� E35F	� E360	� E361	� E362	� E363	� E364	� E365	� E366	� E367	� E368
7_
8_	� E375	� E376	� E377	︰ FE30
9_
A_	Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xE9 (additional symbols and punctuation)

This set contains playing card suit symbols, various miscellaneous symbols, and halfwidth counterparts for some of the currency symbols in row 8. The Kelvin sign is also included,[3] having been replaced in row 8 by the euro sign.[1]

KPS 9566-2011 (prefixed with 0xE9)
	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_	∅ 2205	⊗ 2297	� E395	〓 3013		� E397	♧ 2667	♡ 2661	♤ 2664	♢ 2662	� E398	◯ 25EF	⦾ 29BE	� E39A	� E39B
5_
6_	� E3A8	� E3A9	� E3AA
7_
8_	� E3C2	� E3C3	� E3C4	� E3C5	� E3C6	� E3C7	� E3C8	K 212A	₩ 20A9	� E3C9	� E3CA	� E3CB	₤ 20A4	¥ 00A5
9_
A_	Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Extension set 0xEA (Japanese punctuation and additional jamo)

This set contains several punctuation marks used in Japan, and characters from the Hangul Compatibility Jamo Unicode block which are not already included in row 4.[3] This amounts to the jamo characters present in KS X 1001, but previously absent in KPS 9566.

KPS 9566-2011 (prefixed with 0xEA)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
4_		ヽ 30FD	ヾ 30FE	ゝ 309D	ゞ 309E	々 3005	〆 3006	〇 3007	ー 30FC
5_		� E3E8	� E3E9
6_		ㅥ 3165	ㅭ 316D	ㅱ 3171	ㅲ 3172	ㅳ 3173	ㅴ 3174	ㅵ 3175	ㅶ 3176	ㅷ 3177	ㅸ 3178	ㅹ 3179	ㅺ 317A	ㅻ 317B	ㅽ 317D	ㅾ 317E
7_	ㆀ 3180	ㆄ 3184	ㆅ 3185	ㆇ 3187	ㆈ 3188	ㆉ 3189	ㆊ 318A	ㆋ 318B	ㆌ 318C	ᆞ 119E	ㆎ 318E
8_
9_
A_		Part of main plane.

Letter Number Punctuation Symbol Other Undefined � Not in Unicode

Footnotes

For instance, the headings of the ISO-IR-202 chart show 7-bit binary codes, as well as kuten/hang-yol codes, for the characters).[21]
As a ISO 2022 compatible 94ⁿ-character set, the plain space and delete character are always available as single-byte codes at 0x20 and 0x7F (not 0xA0 and 0xFF) respectively.
A halfwidth such character is present in row 14, this is specifically a fullwidth character.
A vertical form of the tilde dash. The mapping file provided by the Unicode Consortium acknowledges by-name mapping to U+2E2F,[1] which is used by Red Star OS,[6] but notes that the Unicode character is intended for a significantly different character (a spacing vertical-tilde high diacritic) and also lists the mapping U+F104 (in the Private Use Area),[1] based on mapping data which had been submitted to the OpenOffice.org project in 2004.[20] Shown here using an image.
A character combining a period with a closing bracket, mapped to Private Use Area, shown here substituted.
Mapped to Private Use Area, shown here using an image.
Mac OS Korean (HangulTalk), an encoding of Wansung code plus extension sets, encodes a visually similar character at 0xA79B,[60] which Apple maps to the Unicode sequence U+25B4+20E4 (▴⃤).[61] There is no documented use of this mapping for the KPS 9566 character, however.
An emboldened/emphasised character from the name of a North Korean leader, mapped to Private Use Area, shown here simulated with markup.
Form of a fraction with a horizontal bar and vertical arrangement, mapped to Private Use Area, shown here simulated.
Degrees Kelvin in 1997 version (some versions of the code chart include a degree sign in the unit symbol). Euro as of 2003 version.
Emblem of the Workers' Party of Korea, mapped to Private Use Area, shown here using an image.
Listed in 1997 version charts and in Unicode proposal N2374 from 2001. Removed in 2003 version.
Mapped to U+261E (☞) in the 2003 edition.[1] The 2011 edition instead maps it to the Private Use Area character U+F13B.[3] The reference glyph is a backhand manicule,[21][3] i.e. matching U+1F449 (👉︎). Compare 0xE04D in KPS 9566-2011.
Circled upward-pointing manicule, mapped to Private Use Area,[1] shown here using an image. One possible non-PUA mapping would be to the sequence U+1F446+20DD (👆︎⃝).[6]
Up-left pointing scissors, mapped to Private Use Area, shown here using an image.

References

"KPS 9566-2003 to Unicode". Unicode Consortium.
Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 148–151. ISBN 978-0-596-51447-1.
Chung, Jaemin (2018-01-05). "Information on the most recent version of KPS 9566 (KPS 9566-2011?)" (PDF). UTC L2/18-011.
Cho, Chun-Hui (2000-07-05). "DPRK letter on character names and ordering in 10646-1: 2000" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2231.
Ewell, Doug (2002-08-15). "Re: Scripts in Unicode 4.0". Unicode Mail List Archive.
West, Andrew (2015-05-29). "KPS 9566 mappings (was Re: Arrow dingbats)". Unicode Mailing List Archive.
Jennings, Thomas Daniel (2020-03-17) [1999]. "An annotated history of some character codes or ASCII: American Standard Code for Information Infiltration". Sensitive research (SR-IX). Archived from the original on 2016-05-22. Retrieved 2020-03-17.
"Standard ECMA-6: 7-bit Coded Character Set". Ecma International.
Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. p. 89. ISBN 978-0-596-51447-1.
ECMA/TC 1 (1973). "Brief History". 7-bit Input/Output Coded Character Set (PDF) (4th ed.). ECMA. ECMA-6:1973.
ECMA (1994). Character Code Structure and Extension Techniques (PDF) (6th ed.). ECMA-35:1994.
Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 19–20, 581–582. ISBN 978-0-596-51447-1.
Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 94–147. ISBN 978-0-596-51447-1.
"2.4: Multiple byte graphic character sets". International Register of Coded Character Sets to be Used With Escape Sequences (ISO-IR) (PDF). ITSCJ/IPSJ. p. 14.
Hwang, Jinsang (2005). The Social Shaping of ICTs Standards: A Case of National Coded Character Set Standards Controversy in Korea (PDF). University of Edinburgh.
Lunde, Ken (1995-12-18). "3.3.6: N-byte Hangul". CJK.INF Version 1.9.
Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 242–255. ISBN 978-0-596-51447-1.
Shin, Jungshik. "What are KS X 1001(KS C 5601) and other Hangul codes?". Hangul & Internet in Korea FAQ.
Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Evidence for arrangement of Korean characters proposed by CSK" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2246.
"Conversion tables between KPS 9566-2003(N. Korean) & Unicode". Apache OpenOffice (AOO) Bugzilla. 2004-08-27.
Committee for Standardization of D. P. R. of Korea (1998-06-22). DPRK Standard Korean Graphic Character Set for Information Interchange (PDF). ITSCJ/IPSJ. ISO-IR-202.
Unicode Consortium. "History of Unicode Release and Publication Dates".
West, Andrew (2019-06-17) [2007-06-05]. "Unicode and ISO/IEC 10646".
Murata, Makoto (14 April 2000). "XML Japanese Profile". W3C Notes. W3C.
van Kesteren, Anne. Encoding Standard. WHATWG.
Lunde, Ken (1999). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing. Sebastopol, CA: O'Reilly. p. 116. ISBN 1-56592-224-7.
Cook, Richard. "Q: Why are DPRK (North Korean == kIRG_KPSource) glyphs missing from some CJK code charts?". FAQ - Chinese and Japanese. Unicode Consortium.
Jenkins, John H.; Cook, Richard; Lunde, Ken (2020-03-05). "Unicode Han Database (Unihan)". kIRG_KPSource. Unicode Standard Annex #38.
Yergeau, F. (1998). "UTF-8, a transformation format of ISO 10646". IETF. doi:10.17487/rfc2279. RFC 2279.
"Unicode Character Encoding Stability Policies". Unicode Consortium. 2017-06-23.
Jo, Chun-Hui (1999-08-10). "Amendment of the part containing the Korean characters in ISO/IEC 10646-1:1998 amendment 5" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2056.
"New Work item proposal (NP) for an amendment of the Korean part of ISO/IEC 10646-1:1993". 1999-12-07. L2/99-380, ISO/IEC JTC 1 N5999.
Karlsson, Kent (2000-03-02). "Comments on DPRK New Work Item proposal on Korean characters". ISO/IEC JTC 1/SC 2/WG 2 N2167.
Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal for the addition of 14 Korean alphabets to ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2243.
Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal for the addition of 82 symbols to ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2244.
Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal to change the existing name of Korean characters in ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2245.
Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal to add the hanja column of D. P. R. of Korea in ISO/IEC 10646-1 (14938 ideographs to CJK Unified Ideographs and 3181 ideographs to its Extention A)" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2247.
Korean script ad hoc group (2000-09-21). "Report of the meeting of the Korean script ad hoc group". ISO/IEC JTC 1/SC 2/WG 2 N2282.
Committee for Standardization of the D P R of Korea (CSK) (2001-09-03). Proposal to add of 70 symbols to ISO/IEC 10646-1:2000 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2374.
Committee for Standardization of the D P R of Korea (CSK) (2001-09-03). Proposal to add the 160 Compatibility Hanja code table of D P R of Korea into CJK Compatibility Ideographs (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2375.
Gim, Gyeongseog (2001-10-13). ROK's Comments about DPRK's proposal, WG2 N 2374, to add 70 symbols to ISO/IEC 10646-1:2000 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2390.
Korean Script ad hoc group (2001-10-16). A Report of Korean Script ad hoc group meeting on Oct. 15, 2001 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2392, UTC L2/01-388.
Freytag, Asmus (2002-02-13). "Notes on proposed Symbols from DPRK" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2417, UTC L2/02-102.
Emojipedia. "Unicode 4.0 Emoji". Emojipedia.
"Miscellaneous Symbols" (PDF). Unicode 4.0.0 Delta Code Charts. Unicode Consortium.
Whistler, Ken (2015-05-28). "Re: Arrow dingbats". Unicode Mail List Archive.
"Miscellaneous Symbols and Arrows" (PDF). Unicode 4.0.0 Delta Code Charts. Unicode Consortium.
Overington, William (2003-02-24). "Unicode 4.0 beta characters".
"Miscellaneous Symbols" (PDF). Unicode 3.2.0 Delta Code Charts. Unicode Consortium.
The Unicode 4.0 code chart shows the modified glyph,[45] whereas the Unicode 3.2 code chart shows the previous glyph.[49]
Scherer, Markus; Davis, Mark; Momoi, Kat; Tong, Darick; Kida, Yasuo; Edberg, Peter. "Emoji Symbols: Background Data—Background data for Proposal for Encoding Emoji Symbols" (PDF). UTC L2/10-132.
Suignard, Michel (2007-09-18). "Japanese TV Symbols" (PDF). UTC L2/07-391, ISO/IEC JTC 1/SC 2/WG 2 N3341.
Unicode Consortium (2020). "Emoji Versions & Sources, v13.0".
Emojipedia. "Unicode 5.2 Emoji List". Emojipedia.
Emojipedia. "Waving White Flag Emoji". Emojipedia.
Emojipedia. "Waving Black Flag Emoji". Emojipedia.
Marin Silva, Eduardo (2018). Proposal to reconsider compatibility symbols and punctuation used in the DPRK (PDF). UTC L2/18-004.
Korean Script ad hoc group (2001-10-16). A Report of Korean Script ad hoc group meeting on Oct. 15, 2001 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2392, UTC L2/01-388. D P R of Korea suggested that they would review this character more carefully before it is discussed again at Korean Script ad hoc group or WG2.
Marín Silva, Eduardo (2018). Proposal to encode: SYMBOL FOR TYPE A ELECTRONICS (PDF). UTC L2/18-184R.
Lunde, Ken (2009). "Appendix E: Vendor Character Set Standards" (PDF). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. ISBN 978-0-596-51447-1.
Apple (2005-04-05). "Map (external version) from Mac OS Korean encoding to Unicode 3.2 and later". Unicode Consortium.
Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.
This table is generated from KPS9566.TXT.[1]

External links

KPS 9566-97 code table from ISO-IR registry
KPS 9566-2003 to Unicode mapping
KPS 9566-2011 code table and mapping reverse engineered from Red Star OS

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[60] For instance, the headings of the ISO-IR-202 chart show 7-bit binary codes, as well as kuten/hang-yol codes, for the characters).[21]

[iso2022fixed-61] As a ISO 2022 compatible 94ⁿ-character set, the plain space and delete character are always available as single-byte codes at 0x20 and 0x7F (not 0xA0 and 0xFF) respectively.

[zenk-62] A halfwidth such character is present in row 14, this is specifically a fullwidth character.

[63] A vertical form of the tilde dash. The mapping file provided by the Unicode Consortium acknowledges by-name mapping to U+2E2F,[1] which is used by Red Star OS,[6] but notes that the Unicode character is intended for a significantly different character (a spacing vertical-tilde high diacritic) and also lists the mapping U+F104 (in the Private Use Area),[1] based on mapping data which had been submitted to the OpenOffice.org project in 2004.[20] Shown here using an image.

[dotbrack-64] A character combining a period with a closing bracket, mapped to Private Use Area, shown here substituted.

[triangle-65] Mapped to Private Use Area, shown here using an image.

[68] Mac OS Korean (HangulTalk), an encoding of Wansung code plus extension sets, encodes a visually similar character at 0xA79B,[60] which Apple maps to the Unicode sequence U+25B4+20E4 (▴⃤).[61] There is no documented use of this mapping for the KPS 9566 character, however.

[leader-69] An emboldened/emphasised character from the name of a North Korean leader, mapped to Private Use Area, shown here simulated with markup.

[vertfrac-71] Form of a fraction with a horizontal bar and vertical arrangement, mapped to Private Use Area, shown here simulated.

[72] Degrees Kelvin in 1997 version (some versions of the code chart include a degree sign in the unit symbol). Euro as of 2003 version.

[wpkemblem-73] Emblem of the Workers' Party of Korea, mapped to Private Use Area, shown here using an image.

[74] Listed in 1997 version charts and in Unicode proposal N2374 from 2001. Removed in 2003 version.

[75] Mapped to U+261E (☞) in the 2003 edition.[1] The 2011 edition instead maps it to the Private Use Area character U+F13B.[3] The reference glyph is a backhand manicule,[21][3] i.e. matching U+1F449 (👉︎). Compare 0xE04D in KPS 9566-2011.

[76] Circled upward-pointing manicule, mapped to Private Use Area,[1] shown here using an image. One possible non-PUA mapping would be to the sequence U+1F446+20DD (👆︎⃝).[6]

[77] Up-left pointing scissors, mapped to Private Use Area, shown here using an image.

[kps9566txt-1] "KPS 9566-2003 to Unicode". Unicode Consortium.

[lunde2009-2] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 148–151. ISBN 978-0-596-51447-1.

[utc-L2-18-011-3] Chung, Jaemin (2018-01-05). "Information on the most recent version of KPS 9566 (KPS 9566-2011?)" (PDF). UTC L2/18-011.

[wg2-n2231-4] Cho, Chun-Hui (2000-07-05). "DPRK letter on character names and ordering in 10646-1: 2000" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2231.

[ewellflags-5] Ewell, Doug (2002-08-15). "Re: Scripts in Unicode 4.0". Unicode Mail List Archive.

[westtilde-6] West, Andrew (2015-05-29). "KPS 9566 mappings (was Re: Arrow dingbats)". Unicode Mailing List Archive.

[jennings-7] Jennings, Thomas Daniel (2020-03-17) [1999]. "An annotated history of some character codes or ASCII: American Standard Code for Information Infiltration". Sensitive research (SR-IX). Archived from the original on 2016-05-22. Retrieved 2020-03-17.

[8] "Standard ECMA-6: 7-bit Coded Character Set". Ecma International.

[lunde2009-ascii-9] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. p. 89. ISBN 978-0-596-51447-1.

[10] ECMA/TC 1 (1973). "Brief History". 7-bit Input/Output Coded Character Set (PDF) (4th ed.). ECMA. ECMA-6:1973.

[ecma35-11] ECMA (1994). Character Code Structure and Extension Techniques (PDF) (6th ed.). ECMA-35:1994.

[lunde2009-kuten-12] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 19–20, 581–582. ISBN 978-0-596-51447-1.

[lunde2009-othercjksets-13] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 94–147. ISBN 978-0-596-51447-1.

[14] "2.4: Multiple byte graphic character sets". International Register of Coded Character Sets to be Used With Escape Sequences (ISO-IR) (PDF). ITSCJ/IPSJ. p. 14.

[Hwang-15] Hwang, Jinsang (2005). The Social Shaping of ICTs Standards: A Case of National Coded Character Set Standards Controversy in Korea (PDF). University of Edinburgh.

[cjkinf336-16] Lunde, Ken (1995-12-18). "3.3.6: N-byte Hangul". CJK.INF Version 1.9.

[lunde2009-euc-17] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 242–255. ISBN 978-0-596-51447-1.

[shin-18] Shin, Jungshik. "What are KS X 1001(KS C 5601) and other Hangul codes?". Hangul & Internet in Korea FAQ.

[wg2-n2246-19] Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Evidence for arrangement of Korean characters proposed by CSK" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2246.

[openoffice-20] "Conversion tables between KPS 9566-2003(N. Korean) & Unicode". Apache OpenOffice (AOO) Bugzilla. 2004-08-27.

[ir202-21] Committee for Standardization of D. P. R. of Korea (1998-06-22). DPRK Standard Korean Graphic Character Set for Information Interchange (PDF). ITSCJ/IPSJ. ISO-IR-202.

[22] Unicode Consortium. "History of Unicode Release and Publication Dates".

[23] West, Andrew (2019-06-17) [2007-06-05]. "Unicode and ISO/IEC 10646".

[24] Murata, Makoto (14 April 2000). "XML Japanese Profile". W3C Notes. W3C.

[25] van Kesteren, Anne. Encoding Standard. WHATWG.

[lunde1999-26] Lunde, Ken (1999). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing. Sebastopol, CA: O'Reilly. p. 116. ISBN 1-56592-224-7.

[27] Cook, Richard. "Q: Why are DPRK (North Korean == kIRG_KPSource) glyphs missing from some CJK code charts?". FAQ - Chinese and Japanese. Unicode Consortium.

[28] Jenkins, John H.; Cook, Richard; Lunde, Ken (2020-03-05). "Unicode Han Database (Unihan)". kIRG_KPSource. Unicode Standard Annex #38.

[29] Yergeau, F. (1998). "UTF-8, a transformation format of ISO 10646". IETF. doi:10.17487/rfc2279. RFC 2279.

[30] "Unicode Character Encoding Stability Policies". Unicode Consortium. 2017-06-23.

[31] Jo, Chun-Hui (1999-08-10). "Amendment of the part containing the Korean characters in ISO/IEC 10646-1:1998 amendment 5" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2056.

[l2-99-380-32] "New Work item proposal (NP) for an amendment of the Korean part of ISO/IEC 10646-1:1993". 1999-12-07. L2/99-380, ISO/IEC JTC 1 N5999.

[wg2-n2167-33] Karlsson, Kent (2000-03-02). "Comments on DPRK New Work Item proposal on Korean characters". ISO/IEC JTC 1/SC 2/WG 2 N2167.

[wg2-n2243-34] Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal for the addition of 14 Korean alphabets to ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2243.

[wg2-n2244-35] Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal for the addition of 82 symbols to ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2244.

[wg2-n2245-36] Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal to change the existing name of Korean characters in ISO/IEC 10646-1" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2245.

[wg2-n2247-37] Committee for Standardization of the D P R of Korea (CSK) (2000-08-10). "Proposal to add the hanja column of D. P. R. of Korea in ISO/IEC 10646-1 (14938 ideographs to CJK Unified Ideographs and 3181 ideographs to its Extention A)" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2247.

[wg2-n2282-38] Korean script ad hoc group (2000-09-21). "Report of the meeting of the Korean script ad hoc group". ISO/IEC JTC 1/SC 2/WG 2 N2282.

[wg2-n2374-39] Committee for Standardization of the D P R of Korea (CSK) (2001-09-03). Proposal to add of 70 symbols to ISO/IEC 10646-1:2000 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2374.

[wg2-n2375-40] Committee for Standardization of the D P R of Korea (CSK) (2001-09-03). Proposal to add the 160 Compatibility Hanja code table of D P R of Korea into CJK Compatibility Ideographs (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2375.

[wg2-n2390-41] Gim, Gyeongseog (2001-10-13). ROK's Comments about DPRK's proposal, WG2 N 2374, to add 70 symbols to ISO/IEC 10646-1:2000 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2390.

[wg2-n2392-42] Korean Script ad hoc group (2001-10-16). A Report of Korean Script ad hoc group meeting on Oct. 15, 2001 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2392, UTC L2/01-388.

[utc-L2-02-102-43] Freytag, Asmus (2002-02-13). "Notes on proposed Symbols from DPRK" (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2417, UTC L2/02-102.

[emojipediaU4.0-44] Emojipedia. "Unicode 4.0 Emoji". Emojipedia.

[U40-2600-45] "Miscellaneous Symbols" (PDF). Unicode 4.0.0 Delta Code Charts. Unicode Consortium.

[whistlerarrows-46] Whistler, Ken (2015-05-28). "Re: Arrow dingbats". Unicode Mail List Archive.

[U40-2B00-47] "Miscellaneous Symbols and Arrows" (PDF). Unicode 4.0.0 Delta Code Charts. Unicode Consortium.

[overington-48] Overington, William (2003-02-24). "Unicode 4.0 beta characters".

[U32-2600-49] "Miscellaneous Symbols" (PDF). Unicode 3.2.0 Delta Code Charts. Unicode Consortium.

[50] The Unicode 4.0 code chart shows the modified glyph,[45] whereas the Unicode 3.2 code chart shows the previous glyph.[49]

[utcL210132-51] Scherer, Markus; Davis, Mark; Momoi, Kat; Tong, Darick; Kida, Yasuo; Edberg, Peter. "Emoji Symbols: Background Data—Background data for Proposal for Encoding Emoji Symbols" (PDF). UTC L2/10-132.

[52] Suignard, Michel (2007-09-18). "Japanese TV Symbols" (PDF). UTC L2/07-391, ISO/IEC JTC 1/SC 2/WG 2 N3341.

[53] Unicode Consortium (2020). "Emoji Versions & Sources, v13.0".

[emojipediaU5.2-54] Emojipedia. "Unicode 5.2 Emoji List". Emojipedia.

[55] Emojipedia. "Waving White Flag Emoji". Emojipedia.

[56] Emojipedia. "Waving Black Flag Emoji". Emojipedia.

[57] Marin Silva, Eduardo (2018). Proposal to reconsider compatibility symbols and punctuation used in the DPRK (PDF). UTC L2/18-004.

[wg2-n2392-postal-58] Korean Script ad hoc group (2001-10-16). A Report of Korean Script ad hoc group meeting on Oct. 15, 2001 (PDF). ISO/IEC JTC 1/SC 2/WG 2 N2392, UTC L2/01-388. D P R of Korea suggested that they would review this character more carefully before it is discussed again at Korean Script ad hoc group or WG2.

[utc-L2-18-184r-59] Marín Silva, Eduardo (2018). Proposal to encode: SYMBOL FOR TYPE A ELECTRONICS (PDF). UTC L2/18-184R.

[lunde2009appE-66] Lunde, Ken (2009). "Appendix E: Vendor Character Set Standards" (PDF). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. ISBN 978-0-596-51447-1.

[67] Apple (2005-04-05). "Map (external version) from Mac OS Korean encoding to Unicode 3.2 and later". Unicode Consortium.

[Czyborra_1998_Cyrillic-70] Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.

[78] This table is generated from KPS9566.TXT.[1]

Character encodings
Early telecommunications	Telegraph code Needle Morse Non-Latin Wabun/Kana Chinese Cyrillic Korean Baudot and Murray FIELDATA ASCII ISO/IEC 646 BCDIC EBCDIC Teletex and Videotex/Teletext ISO/IEC 6937 / ITU T.51 ITU T.61 ITU T.101 World System Teletext background sets
ISO/IEC 8859	Approved -1 -2 -3 -4 -5 -6 -7 -8 -9 -10 -11 -13 -14 -15 -16 Abandoned -12 Adaptations ISO-IR-182 ISO-IR-200 ISO-IR-201 Proposed but not approved ISO-IR-111 ISO-IR-197 French/Dutch/Turkish draft
Bibliographic use	ANSEL ISO 5426 / 5426-2 / 5427 / 5428 / 6438 / 6861 / 6862 / 10585 / 10586 / 10754 / 11822 MARC-8
National standards	ArmSCII BraSCII CNS 11643 ELOT 927 GOST 10859 GB 18030 HKSCS I.S. 434 ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 LST 1284 LST 1564 LST 1590-1 LST 1590-2 LST 1590-3 LST 1590-4 PASCII RUSCII SI 960 TIS-620 TSCII VISCII VSCII YUSCII
ISO/IEC 2022	7-bit CN CN-EXT JP JP-EXT JP-1 JP-2 JP-3 KR ISO/IEC 4873 ISO/IEC 8859 ISO/IEC 10367 Extended Unix Code / EUC CN KR JP TW CCCII
MacOS code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic CentEuro ChineseSimp / EUC-CN ChineseTrad / Big5 Croatian Cyrillic Devanagari Dingbats Farsi (Persian) Gaelic Georgian Greek Gujarati Gurmukhi Hebrew Iceland Inuit Japanese / ShiftJIS Keyboard Korean / EUC-KR Latin (Kermit) Maltese/Esperanto Ogham / I.S. 434 Roman Romanian Sámi Symbol Thai / TIS-620 Turkish Turkic Cyrillic Ukrainian VT100
DOS code pages	100 111 112 113 151 152 161 162 163 164 165 166 210 220 301 437 449 489 620 667 668 707 708 709 710 711 714 715 720 721 737 768 770 771 772 773 774 775 776 777 778 790 850 851 852 853 854 855/872 856 857 858 859 860 861 862 863 864 865 866/808 867 868 869 874/1161/1162 876 877 878 881 882 883 884 885 891 895 896 897 898 899 900 903 904 906 907 909 910 911 926 927 928 929 932 934 936 938 941 942 943 944 946 947 948 949 950/1370 951 966 991 1034 1039 1040 1041 1042 1043 1044 1046 1086 1088 1092 1093 1098 1108 1109 1114 1115 1116 1117 1118 1119 1125/848 1126 1127 1131/849 1139 1167 1168 1300 1351 1361 1362 1363 1372 1373 1374 1375 1380 1381 1385 1386 1391 1392 1393 1394 3012 3021 3843 3844 3845 3846 3847 3848 30000 30001 30002 30003 30004 30005 30006 30007 30008 30009 30010 30011 30012 30013 30014 30015 30016 30017 30018 30019 30020 30021 30022 30023 30024 30025 30026 30027 30028 30029 30030 30031 30032 30033 30034 30039 30040 58152 58210 58335 59234 59829 60258 60853 61282 62306 CS Indic CSX Indic CSX+ Indic CWI-2 Iran System Kamenický KOI8 Mazovia MIK
IBM AIX code pages	367 371 806 813 819 895 896 912 913 914 915 916 919 920 921/901 922/902 923 952 953 954 955 956 957 958 959 960 961 963 964 965 970 971 1004 1006 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1029 1036 1089 1111 1124 1129/1163 1133 1350 1382 1383
IBM Apple Macintosh emulations	1275 1280 1281 1282 1283 1284 1285 1286
IBM Adobe emulations	1038 1276 1277
IBM DEC emulations	1020 1021 1023 1090 1100 1101 1102 1103 1104 1105 1106 1107 1287 1288
IBM HP emulations	1050 1051 1052 1053 1054 1055 1056 1057 1058
Windows code pages	CER-GS 874/1162 (TIS-620) 932/943 (Shift JIS) 936/1386 (GBK) 950/1370 (Big5) 949/1363 (EUC-KR) 1169 1174 Extended Latin-8 1200 (UTF-16LE) 1201 (UTF-16BE) 1250 1251 1252 1253 1254 1255 1256 1257 1258 1261 1270 54936 (GB18030) Armenian Cyrillic + Finnish Cyrillic + French Cyrillic + German Georgian Polytonic Greek 65001 (UTF-8)
EBCDIC code pages	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37/1140 37-2 38 39 40 251 252 254 256 257 258 259 260 264 273/1141 274 275 276 277/1142 278/1143 279 280/1144 281 282 283 284/1145 285/1146 286 287 288 289 290 293 297/1147 298 300 310 320 321 322 330 351 352 353 355 357 358 359 360 361 363 382 383 384 385 386 387 388 389 390 391 392 393 394 395 410 420 421 423 424 425 435 500/1148 803 829 833 834 835 836 837 838/1160 839 870/1110/1153 871/1149 875 880 881 882 883 884 885 886 887 888 889 890 892 893 905 918 924 930/1390 931 933/1364 935/1388 937/1371 939/1399 1001 1002 1003 1005 1007 1024 1025/1154 1026/1155 1027 1028 1030 1031 1032 1033 1037 1047 1068 1069 1070 1071 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1087 1091 1097 1112/1156 1113 1122/1157 1123/1158 1130/1164 1132 1136 1137 1150 1151 1152 1159 1165 1166 1278 1279 1303 1364 1376 1377 JEF KEIS
DEC terminals (VTx)	Multinational (MCS) National Replacement (NRCS) French Canadian Swiss Spanish United Kingdom Dutch Finnish French Norwegian and Danish Swedish Norwegian and Danish (alternative) 8-bit Greek 8-bit Turkish 7-bit Hebrew 8-bit Hebrew Special Graphics Technical (TCS)
Platform specific	Acorn Adobe Standard Adobe Latin 1 Amstrad CPC Apple I Apple II Apple III ATASCII Atari ST BICS Casio calculators CDC Compucolor II CP/M+ DEC RADIX 50 DEC MCS/NRCS DG International ELWRO-Junior FIELDATA GEM GEOS GSM 03.38 HP Roman Extension HP Roman-8 HP Roman-9 HP FOCAL HP RPL LICS LMBCS Mattel Aquarius Minitel MSX NEC APC NeXT OricSCII PCW PETSCII Sega SC-3000 Sharp calculators Sharp MZ Sinclair QL SQUOZE Teletext TI calculators TRS-80 Ventura International Ventura Symbol WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 (UTF-16LE/UTF-16BE) / UCS-2 UTF-32 (UTF-32LE/UTF-32BE) / UCS-4 UTF-EBCDIC GB 18030 BOCU-1 CESU-8 SCSU
TeX typesetting system	Cork IL1 IL2 IL3 L7X LGR LY1 OML OMS OMX OT1 OT2 OT3 OT4 PL0 QX T2A T2B T2C T2D T3 T4 T5 TS1 TS3 U X2
Miscellaneous code pages	ABICOMP APL 293 310 (Graphic Escape) 351 (GDDM) 907 (OEM) ISO-IR-68 ARIB STD-B24 HZ IEC-P27-1 INIS 7-bit 8-bit Cyrillic ISO-IR-169 ISO 2033 Johab SEASCII Stanford/ITS TACE16 TRON UTF-5 UTF-6 WTF-8
Related topics	Code page Control character (C0 C1) CCSID Character encodings in HTML Charset detection Han unification Hardware ISO 6429/IEC 6429/ANSI X3.64 Mojibake
Character sets