Double-byte character set
This article includes a improve this article by introducing more precise citations. (September 2021) ) |
A double-byte character set (DBCS) is a
In CJK computing
The term DBCS traditionally refers to a character encoding where each graphic character is encoded in two bytes.
In an 8-bit code, such as
Sometimes, the use of the term "DBCS" can imply an underlying structure that does not comply with
This original meaning of DBCS is different from what some consider correct usage today. Some insist that these character encodings be properly called
use more than two bytes for some characters, and they support one byte for other characters.Ambiguity
Some people use DBCS to mean the UTF-16 and UTF-8 encodings, while other people use the term DBCS to mean older (pre-Unicode) character encodings that use more than one byte per character. Shift JIS, GB 2312 and Big5 are a few character encodings that can contain more than one byte per character, but even using the term DBCS for these character encodings is incorrect terminology because these character encodings are really variable-width encodings (as are both UTF-16 and UTF-8). Some IBM mainframes do have true DBCS code pages, which contain only the double byte portion of a multi-byte code page.
If a person uses the term "DBCS enablement" for software
TBCS
A triple-byte character set (TBCS) is a character encoding in which characters (including control characters) are encoded in three bytes.
See also
External links
- Microsoft's definition of "double-byte character set"
- IBM's definition of "double-byte character set" at the Wayback Machine (archived October 18, 2018)