ISO/IEC 8859-6

ISO/IEC 8859-6:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 6: Latin/Arabic alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. It is informally referred to as Latin/Arabic. It was designed to cover Arabic. Only nominal letters are encoded, no preshaped forms of the letters, so shaping processing is required for display. It does not include the extra letters needed to write most Arabic-script languages other than Arabic itself (such as Persian, Urdu, etc.).

ISO-8859-6 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. The text is in logical order, so bidi processing is required for display. Nominally ISO-8859-6 (code page 28596) is for “visual order”, and ISO-8859-6-I (code page 38596) is for logical order. But in practice, and required for HTML and XML documents, ISO-8859-6 also stands for logical order text. There is also ISO-8859-6-E which supposedly requires directionality to be explicitly specified with special control characters; this latter variant is in practice unused. IBM has assigned code page 1089 to ISO 8859-6.[1] It is an emulation for their AIX operating system.

History

ASMO 708 was devised by the now defunct Arab Standardization and Metrology Organization[2] in 1986 to be the 8-bit standard to be used in Arabic-speaking countries. The design of this character set was inspired by the previous 7-bit standard — ASMO 449 — but it is not simply the 7-bit character set moved to the upper part; there are some differences.

ASMO 708 is a bidirectional character set. The lower part of the character set differs from standard ISO 646 in the digits and in some punctuation. Depending on the context (whether the numbers are within Latin script or Arabic script), the digits are rendered either as Latin digits or Arabic digits. Also, depending on the context, symmetrical punctuation marks are reversed, i.e., whenever there is an opening punctuation mark, the shape is rendered differently according to the direction of the script.

The upper part of the character set has only the Arabic letters, Arabic punctuation that is different from Latin punctuation, plus few other characters.

ASMO 708 was designed in close cooperation[3] with ECMA, which adopted it as its own ECMA-114 standard in 1986. It was also approved as an ISO standard as ISO 8859-6.[4] It was also registered in the International Register of Coded Character Sets as IR 127[5] in 1986.

Relationship with other character sets

Some other character sets are related to ASMO 708.

  • ASMO 708/French 1[6] adds French lower case characters;
  • French 1/ASMO 708[6] adds French lower case characters in their ISO 8859-1 code points and dislocates the Arabic ones;
  • ISO/IR 167[7] adds French and German characters;
  • Microsoft’s code page 708, for MS-DOS, adds French characters in their typical code points from code page 437 and adds box-drawing characters;
  • Both Microsoft’s code page 710 (Transparent Arabic) and Microsoft’s code page 720 (Transparent ASMO), for MS-DOS, add French characters in their typical code points from code page 437 but dislocates the Arabic characters to allow the box-drawing characters from code page 437 to be in their original code points;
  • Microsoft’s Windows 1256 adds French lower case characters in their Windows 1252 code points and dislocates the Arabic ones;

Code chart

  Letter   Number   Punctuation   Symbol   Other   undefined

ISO/IEC 8859-6
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
1_
2_ SP
0020
32
!
0021
33
"
0022
34
#
0023
35
$
0024
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
3_ 0/٠
0030
48
1/١
0031
49
2/٢
0032
50
3/٣
0033
51
4/٤
0034
52
5/٥
0035
53
6/٦
0036
54
7/٧
0037
55
8/٨
0038
56
9/٩
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
4_ @
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
5_ P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
^
005E
94
_
005F
95
6_ `
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
7_ p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
8_
9_
A_ NBSP
00A0
160
¤
00A4
164
،
060C
172
SHY
00AD
173
B_ ؛
061B
187
؟
061F
191
C_ ء
0621
193
آ
0622
194
أ
0623
195
ؤ
0624
196
إ
0625
197
ئ
0626
198
ا
0627
199
ب
0628
200
ة
0629
201
ت
062A
202
ث
062B
203
ج
062C
204
ح
062D
205
خ
062E
206
د
062F
207
D_ ذ
0630
208
ر
0631
209
ز
0632
210
س
0633
211
ش
0634
212
ص
0635
213
ض
0636
214
ط
0637
215
ظ
0638
216
ع
0639
217
غ
063A
218
E_ ـ
0640
224
ف
0641
225
ق
0642
226
ك
0643
227
ل
0644
228
م
0645
229
ن
0646
230
ه
0647
231
و
0648
232
ى
0649
233
ي
064A
234
◌ً
064B
235
◌ٌ
064C
236
◌ٍ
064D
237
◌َ
064E
238
◌ُ
064F
239
F_ ◌ِ
0650
240
◌ّ
0651
241
◌ْ
0652
242
_0_1_2_3_4_5_6_7_8_9_A_B_C_D_E_F

Code values 0xEB0xF2 are assigned to combining characters.

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.