Z-variant
In
Differences on the Z-axis
The Unicode philosophy of code point allocation for CJK languages is organized along three "
The Z-axis represents minor typographical differences. For example, the Chinese characters (U+838A 莊) and (U+8358 荘) are Z-variants, as are (U+8AAA 說) and (U+8AAC 説). The glossary at Unicode.org defines "Z-variant" as "Two CJK unified ideographs with identical semantics and unifiable shapes,"[1] where "unifiable" is taken in the sense of Han unification.
Thus, were Han unification perfectly successful, Z-variants would not exist. They exist in Unicode because it was deemed useful to be able to "round-trip" documents between Unicode and other CJK encodings such as
Confusion
There is some confusion over the exact definition of "Z-variant." For example, in an
] treats both pairs as Z-variants.See also
References
- ^ a b "Glossary". www.unicode.org.
- ^ Huang, K.; Ko, Y.; Konishi, K.; Qian, H. (April 2004). "Joint Engineering Team (JET) Guidelines for Internationalized Domain Names (IDN) Registration and Administration for Chinese, Japanese, and Korean". tools.ietf.org.
- ^ "Unihan Database Lookup". www.unicode.org.