ISO/IEC 10367

Source: Wikipedia, the free encyclopedia.

ISO/IEC 10367:1991 is a standard developed by

ISO/IEC 4873[2] (as opposed to ISO/IEC 8859
, which defines character encodings at level 1 of ISO/IEC 4873).

Relationship to ISO/IEC 8859

The parts of

C1 area), and do not allow for use of multiple parts together. For use at levels 2 and 3 of ISO/IEC 4873 (i.e., with shift codes for additional graphical character sets), ISO/IEC 8859 stipulates that equivalent sets from ISO/IEC 10367 should be used instead.[3]

ISO/IEC 10367:1991 includes

ITU T.51) and of ISO/IEC 8859 parts 1 through 9 (i.e., those parts that existed as of 1991, when it was published), a set of additional Roman characters supplementing some of those parts, and a set of box drawing characters (shown below).[2][4]

Supplementary G3 Latin set

ISO/IEC 10367 includes the ISO-IR-154 graphical set, which is intended to supplement Latin alphabets number 1, 2 and 5 (i.e.,

ISO/IEC 6937 repertoire (ITU T.51 Annex A) without non-spacing codes.[6]

For instance, the letter Ĉ would be encoded under ISO/IEC 4873 level 2 as 0x8F 0x23 if this set is included.

Highlighted characters also appear in ISO-8859-1 or ISO-8859-9. Under the current edition of ISO/IEC 4873 / ECMA-43 (though not earlier editions),[7] characters must be used from the lowest-numbered working set they appear in, hence those characters are not used from this G3 set when the respective ISO-8859 right-hand side set is used as the G1 or G2 set.[8]

ISO/IEC 10367 supplementary G3 Latin set[5]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x/Ax Ā Ĉ Ċ Ė
Ē
Ĝ
3x/Bx ā ĉ ċ
ð
ė
ē
ĝ
4x/Cx Ğ Ġ Ģ Ĥ
Ħ
Ĩ
İ
Ī
Į
IJ
Ĵ Ķ
Ļ
Ŀ
Ņ
5x/Dx
Ŋ
Ō
Œ
Ŗ
Ŝ
Ŧ
Þ
Ũ
Ŭ
Ū
Ų
Ŵ
Ý
Ŷ
Ÿ
6x/Ex
ğ ġ ģ ĥ
ħ
ĩ
ı
ī
į
ij
ĵ ķ
ļ
ŀ
ņ
7x/Fx
ĸ
ŋ
ō
œ
ŗ
ŝ
ŧ
þ
ũ
ŭ
ū
ų
ŵ
ý
ŷ
ʼn
  Also in ISO-8859-1
  Also in ISO-8859-9

Box drawing set

The following shows the box drawing set from ISO/IEC 10367, which is registered for ISO/IEC 2022 use as ISO-IR-155. It does not use the 0x20/A0 or 0x7F/FF positions, but is nonetheless registered as a 96-character set.[9]

box-drawing characters in place of heavy-lined characters, and it replaces the upper half block (▀) at 0xCB with a private use character U+E019, documented as "Unit space B".[10]

ISO/IEC 10367 box drawing set[9]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x/Ax
3x/Bx
4x/Cx
5x/Dx
6x/Ex
7x/Fx

References

  1. ISO. ISO/IEC 10367:1991.{{cite web}}: CS1 maint: numeric names: authors list (link
    )
  2. ^ a b van Wingen, Johan W (1999). "8. Code Extension, ISO 2022 and 2375, ISO 4873 and 10367". Character sets. Letters, tokens and codes. Terena. Archived from the original on 2020-08-01.
  3. ^ ISO/IEC JTC 1/SC 2 (1998-02-12). Final Text of DIS 8859-10, Information Technology — 8-bit single-byte coded graphic character sets — Part 10: Latin alphabet No. 6 (PDF). ISO/IEC FDIS 8859-10:1998, JTC1/SC2 N2992, WG3 N415.{{citation}}: CS1 maint: numeric names: authors list (link)
  4. ^
    DKUUG
    .
  5. ^
    ISO-IR
    -154.
  6. ^ ISO/IEC JTC 1/SC 2/WG 3 (1998-04-15). "Annex E: Alternative coded representation of the repertoire with no non-spacing diacritical marks". WD 6937, Coded graphic character set for text communication - Latin alphabet (PDF). p. 37. JTC1/SC2/N454.{{citation}}: CS1 maint: numeric names: authors list (link)
  7. ^ ECMA (1991). "Main differences between the second edition (1985) and the present (third) edition of this ECMA Standard". ECMA-43: 8-Bit Coded Character Set Structure and Rules (PDF) (ECMA Standard) (3rd ed.). p. 23.
  8. ^ ECMA (1991). "Unique coding of characters". ECMA-43: 8-Bit Coded Character Set Structure and Rules (PDF) (ECMA Standard) (3rd ed.). p. 10.
  9. ^
    ISO-IR-155.{{citation}}: CS1 maint: numeric names: authors list (link
    )
  10. ^ Flohr, Guido. "Conversion routines for ISO_10367_BOX". libintl-perl. Locale::RecodeData::ISO_10367_BOX.