ISO/IEC 8859-14

ISO/IEC 8859-14:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic), is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1998. It is informally referred to as Latin-8 or Celtic. It was designed to cover the Celtic languages, such as Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton.

ISO/IEC 8859-14
MIME / IANAISO-8859-14
Alias(es)iso-ir-199, latin8, iso-celtic, l8[1]
Language(s)Irish, Manx, Scottish Gaelic, Welsh, Cornish, Breton, English
StandardISO/IEC 8859-14:1998
ClassificationISO/IEC 8859 (Extended ASCII, ISO/IEC 4873 level 1)
Based onISO-IR-182

ISO-8859-14 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. CeltScript made an extension for Windows called Extended Latin-8. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14.[2]

History

ISO-8859-14 was originally proposed for the Sami languages.[3] ISO 8859-12 was proposed for Celtic.[4] Later, ISO 8859-12 was proposed for Devanagari, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to ISO 8859-15,[5] but it got rejected as an ISO/IEC 8859 part, although it was registered as ISO-IR-197.[6]

The original proposal used a different arrangement of points 0xA1–BF.[4] At the committee draft stage of the specification, a dotless i was included at 0xAE,[7] which was changed to a registered trademark sign (matching ISO-8859-1) in the final publication.

ISO-IR-182, an earlier (registered in 1994) modification of ISO-8859-1, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for Welsh language use.[8] The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.

Codepage layout

ISO/IEC 8859-14[9][10]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0
1_
16
2_
32
SP
0020
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
\
005C
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D
~
007E
8_
128
9_
144
A_
160
NBSP
00A0

1E02

1E03
£
00A3
Ċ
010A
ċ
010B

1E0A
§
00A7

1E80
©
00A9

1E82

1E0B

1EF2
SHY
00AD
®
00AE
Ÿ
0178
B_
176

1E1E

1E1F
Ġ
0120
ġ
0121

1E40

1E41

00B6

1E56

1E81

1E57

1E83

1E60

1EF3

1E84

1E85

1E61
C_
192
À
00C0
Á
00C1
Â
00C2
Ã
00C3
Ä
00C4
Å
00C5
Æ
00C6
Ç
00C7
È
00C8
É
00C9
Ê
00CA
Ë
00CB
Ì
00CC
Í
00CD
Î
00CE
Ï
00CF
D_
208
Ŵ
0174
Ñ
00D1
Ò
00D2
Ó
00D3
Ô
00D4
Õ
00D5
Ö
00D6

1E6A
Ø
00D8
Ù
00D9
Ú
00DA
Û
00DB
Ü
00DC
Ý
00DD
Ŷ
0176
ß
00DF
E_
224
à
00E0
á
00E1
â
00E2
ã
00E3
ä
00E4
å
00E5
æ
00E6
ç
00E7
è
00E8
é
00E9
ê
00EA
ë
00EB
ì
00EC
í
00ED
î
00EE
ï
00EF
F_
240
ŵ
0175
ñ
00F1
ò
00F2
ó
00F3
ô
00F4
õ
00F5
ö
00F6

1E6B
ø
00F8
ù
00F9
ú
00FA
û
00FB
ü
00FC
ý
00FD
ŷ
0177
ÿ
00FF

  Letter  Number  Punctuation  Symbol  Other  Undefined   Differences from ISO-8859-1

Draft

The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead.

Draft layout

Differences from the final, published version of ISO/IEC 8859-14 are boxed. Only A0-BF is shown, the rest corresponding to the current ISO 8859-14.

ISO/IEC 8859-14 draft proposal[4]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
A_
160
NBSP
00A0

1E02
¢
00A2
£
00A3

1E03
Ċ
010A
ċ
010B
§
00A7

1E80
©
00A9

1E82

1E60

1EF2
SHY
00AD
®
00AE
Ÿ
0178
B_
176

1E0A

1E0B

1E1E

1E1F
Ġ
0120
ġ
0121

1E40

1E41

1E81

1E56

1E83

1E61

1EF3

1E84

1E85

1E57

References

  1. Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
  2. "SheetJS/js-codepage". GitHub.
  3. Everson, Michael. "Proposed ISO 8859-14 (later 15)".
  4. Everson, Michael. "Proposed ISO 8859-12 (later 14)".
  5. Everson, Michael (1996-06-19). Proposal for a new part of ISO/IEC 8859: Latin alphabet No. 9 (Sámi).
  6. Swedish Institute for Standards (1997-01-24). ISO-IR-197: Sami supplementary Latin set (PDF). ITSCJ/IPSJ.
  7. Everson, Michael (1997-05-05). "ISO/IEC CD 8859-14:1997 — Latin alphabet No. 8 (Celtic)" (Committee Draft).
  8. British Standards Institution (1994-03-16). Welsh variant of Latin Alphabet No. 1 (right-hand part) (PDF). ITSCJ/IPSJ. ISO-IR-182.
  9. Kuhn, Markus; Whistler, Ken (1999-07-27). "ISO/IEC 8859-14:1998 to Unicode". 8859 to Unicode mapping tables. Unicode, Inc.
  10. International Components for Unicode (ICU), iso-8859_14-1998.ucm, 1999-07-27
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.