CNS 11643
Alias(es) | CSIC (Chinese Standard Interchange Code) |
---|---|
Language(s) | ISO 2022, DBCS, CJK encoding |
Encoding formats |
|
Other related encoding(s) | CCCII |
The CNS 11643 character set (Chinese National Standard 11643), also officially known as the Chinese Standard Interchange Code or CSIC[1] (Chinese: 中文標準交換碼), is officially the standard character set of Taiwan (Republic of China). In practice, variants of the related Big5 character set are de facto standard.
CNS 11643 is designed to conform to
History
The first edition of the standard was published in 1986, and included planes 1 and 2, deriving from levels 1 and 2 of
Unicode 1.0.0, although it did not yet include
In the second edition of the standard, published in 1992, a much larger collection of
The third edition of the standard, published in 2007, added the
As of 2017[update], there are several thousand CNS 11643 characters with no corresponding Unicode character, mostly in planes 10 through 14; these are mapped to the Unicode Supplementary Private Use Area.[5]
Relationship to Big5
Levels 1 and 2 of the
Within the Big5 hanzi repertoire, only one plane 1 character is conventionally mapped to Unicode differently from the corresponding character from the first two CNS 11643 planes: to U+5F5D (
References
- This page is based on the information on the CNS official web site.
- ISO-IR-171.
- ^ ISBN 9780596514471.
- ^ UTCL2/22-288.
- ^ "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
- ^ "CNS 11643 in Unicode's Supplementary Private Use Area". [chinese mac]. Council on East Asian Studies at Yale University.
- ^ Lunde, Ken (1995-12-18). "4.3: CJK Character Set Compatibility Issues - Chinese (Taiwan)". CJK.INF Version 1.9.
- IETF.
- Adobe Inc.
- ^ "ibm-950_P110-1999 (lead byte 0xC2)". International Components for Unicode Converter Explorer. Unicode Consortium. Archived from the original on 2021-07-12.
- ^ "ibm-950_P110-1999.ucm". ICU Data Repository. IBM/Unicode Consortium. 2007.
<U5284> \xE3\x5A |0
External links
- CNS 11643 official web site
- Current CNS 11643 open data, including mapping data
- Unicode Consortium mappings for CNS 11643-1986: planes 1 and 2, plus the 1988 plane 14 (not the 2007 plane 14) with extensions. Uses a single prefixed hex digit to indicate plane.
- CNS 11643 mappings from International Components for Unicode (ICU):
- "CNS-11643-1992": original version, current version. The original version of the mapping includes standard planes 1–7 but includes the plane 15 layout as plane 9; the current version includes only planes 1 and 2. Uses prefixed 0x81 through 0x89 to indicate plane.
- "EUC-TW-2014": standard assignments for planes 1 through 7 and 15, and IBM corporate assignments in planes 12 and 13. CNS codes in EUC format with two-byte plane 1.