VISCII
MIME / IANA | VISCII |
---|---|
Language(s) | RFC 1456 |
Classification | 8-bit SBCS |
Based on | ASCII |
VISCII is an unofficially-defined modified ASCII character encoding for using the Vietnamese language with computers. It should not be confused with the similarly-named officially registered VSCII encoding. VISCII keeps the 95 printable characters of ASCII unmodified, but it replaces 6 of the 33 control characters with printable characters. It adds 128 precomposed characters. Unicode and the Windows-1258 code page are now used for virtually all Vietnamese computer data,[citation needed] but legacy VSCII and VISCII files may need conversion.
History and naming
VISCII was designed by the Vietnamese Standardization Working Group (Viet-Std Group)[1] led by Christopher Cuong T. Nguyen, Cuong M. Bui, and Hoc D. Ngo based in Silicon Valley, California in 1992 while they were working with the Unicode consortium to include pre-composed Vietnamese characters in the Unicode standard. VISCII, along with VIQR, was first published in a bilingual report in September 1992, in which it was dubbed the "Vietnamese Standard Code for Information Interchange".[2] The report noted a proliferation in computer usage in Vietnam and the increasing volume of computer-based communications among Vietnamese abroad, that existing applications used vendor-specific encodings which were unable to interoperate with one another, and that standardisation between vendors was therefore necessary. The successful inclusion of composed and precomposed Vietnamese in Unicode 1.0 was the result of the lessons learned from the development of 8-bit VISCII and 7-bit VIQR.[2]
The next year, in 1993, Vietnam adopted
VISCII and
Design
A traditional extended ASCII character set consists of the ASCII set plus up to 128 characters. Vietnamese requires 134 additional letter-diacritic combinations, which is six too many. There are (short of dropping tone mark support for capital letters, as in VSCII-3) essentially four different ways to handle this problem:
- Use variable-width encoding (as does UTF-8)
- Include combining diacritical marks for tone marks (as do VSCII-2 and Windows-1258) or for diacritics in general (as do ANSEL and VNI)
- Replace some ASCII punctuation, preferably punctuation which is not invariant in ISO 646 (as does VNI for DOS)
- Replace at least six of the basic ASCII VPS and VSCII-1)
VISCII went for the last option, replacing six of the least problematic (e.g., least likely to be recognised by an application and acted on specially)
However, using up all the extended code points for accented letters left no room to add useful symbols, superscripted numbers, curved quotes, proper dashes, etc., like most other extended ASCII character sets.
Location of characters deliberately mostly follows
Support
VISCII is partially supported by the TriChlor Software Group in California, which has released various VISCII-compliant software packages, libraries, and fonts for MS-DOS and Windows, Unix, and Macintosh. VISCII-compliant software is available at many FTP sites.
VISCII was historically offered as an encoding for outgoing email by Mozilla Thunderbird.[8] It was also supported by the Windows Vietnamese keyboard software, WinVNKey, created by Christopher Cuong T. Nguyen and later upgraded through various Windows versions by Hoc D. Ngo and others.
VISCII was mostly used by overseas Vietnamese speakers, with VSCII (TCVN) being more popular in northern Vietnam and VNI being more popular in southern Vietnam.[9]
Character set
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH
|
Ẳ 1EB2 |
ETX
|
EOT
|
Ẵ 1EB4 |
Ẫ 1EAA |
BEL | BS | HT
|
LF
|
VT
|
FF
|
CR | SO
|
SI
|
1x | DLE
|
DC1
|
DC2
|
DC3
|
Ỷ 1EF6 |
NAK
|
SYN
|
ETB
|
CAN | Ỹ 1EF8 |
SUB | ESC | FS
|
GS
|
Ỵ 1EF4 |
US
|
2x | SP
|
! | " | # | $ | % | & | ' | ( | ) | * | + | ,
|
- | . | / |
3x | 0
|
1
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
9
|
: | ; | <
|
=
|
>
|
? |
4x | @
|
A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [
|
\ | ]
|
^ | _ |
6x | `
|
a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | {
|
| | }
|
~ | DEL |
8x | Ạ 1EA0 |
Ắ 1EAE |
Ằ 1EB0 |
Ặ 1EB6 |
Ấ 1EA4 |
Ầ 1EA6 |
Ẩ 1EA8 |
Ậ 1EAC |
Ẽ 1EBC |
Ẹ 1EB8 |
Ế 1EBE |
Ề 1EC0 |
Ể 1EC2 |
Ễ 1EC4 |
Ệ 1EC6 |
Ố 1ED0 |
9x | Ồ 1ED2 |
Ổ 1ED4 |
Ỗ 1ED6 |
Ộ 1ED8 |
Ợ 1EE2 |
Ớ 1EDA |
Ờ 1EDC |
Ở 1EDE |
Ị 1ECA |
Ỏ 1ECE |
Ọ 1ECC |
Ỉ 1EC8 |
Ủ 1EE6 |
Ũ 0168 |
Ụ 1EE4 |
Ỳ 1EF2 |
Ax | Õ 00D5 |
ắ 1EAF |
ằ 1EB1 |
ặ 1EB7 |
ấ 1EA5 |
ầ 1EA7 |
ẩ 1EA9 |
ậ 1EAD |
ẽ 1EBD |
ẹ 1EB9 |
ế 1EBF |
ề 1EC1 |
ể 1EC3 |
ễ 1EC5 |
ệ 1EC7 |
ố 1ED1 |
Bx | ồ 1ED3 |
ổ 1ED5 |
ỗ 1ED7 |
Ỡ 1EE0 |
Ơ 01A0 |
ộ 1ED9 |
ờ 1EDD |
ở 1EDF |
ị 1ECB |
Ự 1EF0 |
Ứ 1EE8 |
Ừ 1EEA |
Ử 1EEC |
ơ 01A1 |
ớ 1EDB |
Ư 01AF |
Cx | À | Á | Â | Ã | Ả 1EA2 |
Ă 0102 |
ẳ 1EB3 |
ẵ 1EB5 |
È | É | Ê | Ẻ 1EBA |
Ì | Í | Ĩ 0128 |
ỳ 1EF3 |
Dx | Đ 0110 |
ứ 1EE9 |
Ò | Ó | Ô
|
ạ 1EA1 |
ỷ 1EF7 |
ừ 1EEB |
ử 1EED |
Ù
|
Ú | ỹ 1EF9 |
ỵ 1EF5 |
Ý | ỡ 1EE1 |
ư 01B0 |
Ex | à | á | â | ã | ả 1EA3 |
ă 0103 |
ữ 1EEF |
ẫ 1EAB |
è | é | ê | ẻ 1EBB |
ì | í | ĩ 0129 |
ỉ 1EC9 |
Fx | đ 0111 |
ự 1EF1 |
ò | ó | ô
|
õ | ỏ 1ECF |
ọ 1ECD |
ụ 1EE5 |
ù
|
ú | ũ 0169 |
ủ 1EE7 |
ý | ợ 1EE3 |
Ữ 1EEE |
See also
- ASCII
- Vietnamese Quoted-Readable (VIQR)
- Vietnamese Standard Code for Information Interchange(VSCII)
- Windows-1258
References
- ^ Phung, Quang; Ngo, Hoc D.; Bui, Cuong. "Vietnamese-Standard Working Group Home Page". Viet-Std Group. Retrieved 2019-08-23.
- ^ a b c d e f Vietnamese Character Encoding Standardization Report - VISCII And VIQR 1.1 Character Encoding Specifications (Technical report). Viet-Std Group. 1992.
- ^ a b "[news] TCVN 5712:1993 (VSCII) -- Vietnamese national standard". 1993-06-02. Archived from the original on 2017-01-11.
- ISBN 978-0-596-51447-1.
- .
- IANA.
- ^ Sivonen, Henri (2014-09-26). "Character encoding changes in m-c require c-c action". mozilla.dev.apps.thunderbird.
- ^ Sivonen, Henri (2014-09-26). "Character encoding changes in m-c require c-c action". mozilla.dev.apps.thunderbird.
VISCII and armscii-8 are special in the sense that, for long time, Thunderbird itself (misguidedly) provided these encodings in the user interface for the choice of outgoing character encoding when composing a message. Therefore, it is possible that there exists a Thunderbird-created legacy of VISCII and armscii-8 email and Usenet posts.
- ^ Ngo, Hoc Dinh; Tran, TuBinh. "5. Why Having Vietnamese Charset (Character Set – Encoding) Conversion?". Some special functions of WinVNKey.
Further reading
- Flohr, Guido (2016) [2006]. "Locale::RecodeData::VISCII - Conversion routines for VISCII". CPAN libintl-perl. Archived from the original on 2017-01-14. Retrieved 2017-01-14.
- https://www.math.nmsu.edu/~mleisher/Software/csets/VISCII.TXT
External links
- RFC 1456 - Conventions for Encoding the Vietnamese Language
- Vietnamese-Standardization Working Group based in California
- Viet-Std Report 1992
- AnGiang Software
- VISCII-compliant software and fonts for MS-DOS and Windows
- VISCII-compliant software, libraries, and fonts for Unix
- WinVNKey, Vietnamese keyboard driver for Windows supporting multinational character sets, including VISCII
- MacVNKey, VISCII-compliant keyboard driver for Macintosh classic