ISO basic Latin alphabet
The ISO basic Latin alphabet is an international standard (beginning with
The two sets contain the following 26 letters each:[1]
Uppercase letter set | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Lowercase letter set | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z |
History
By the 1960s it became apparent to the
Terminology
The Unicode block that contains the alphabet is called "C0 Controls and Basic Latin". Two subheadings exist:[2]
- "Uppercase Latin alphabet": the letters start at U+0041 and contain the string LATIN CAPITAL LETTER in their descriptions
- "Lowercase Latin alphabet": the letters start at U+0061 and contain the string LATIN SMALL LETTER in their descriptions
There are also another two sets in the
- Uppercase: the letters start at U+FF21 and contain the string FULLWIDTH LATIN CAPITAL LETTER in their descriptions
- Lowercase: the letters start at U+FF41 and contain the string FULLWIDTH LATIN SMALL LETTER in their descriptions
Timeline for encoding standards
- 1865 International Morse Code was standardized at the International Telegraphy Congress in Paris, and was later made the standard by the International Telecommunication Union (ITU)
- 1950s Radiotelephony Spelling Alphabet by ICAO[4]
Timeline for widely used computer codes supporting the alphabet
- 1963: American Standards Association, which became the American National Standards Institutein 1969)
- 1963/1964: EBCDIC (developed by IBM and supporting the same alphabetic characters as ASCII, but with different code values)
- 1965-04-30: Ratified by ECMA as ECMA-6[5] based on work the ECMA's Technical Committee TC1 had carried out since December 1960.[5]
- 1972: ISO/IECstandard)
- 1983: ITU-T Rec. T.51 | ISO/IEC 6937(a multi-byte extension of ASCII)
- 1987: ISO/IEC 8859-1:1987 (8-bit character encoding)
- Subsequently, other versions and parts of ISO/IEC 8859 have been published.
- Mid-to-late 1980s: Windows-1250, Windows-1252, and other encodings used in Microsoft Windows (some roughly similar to ISO/IEC 8859-1)
- 1990: C0 Controls and Basic Latin" using the same alphabetic code values as ASCII and ISO/IEC 646
- Subsequently, other versions of Unicode have been published and it later became a joint ISO/IECstandard as well, as identified below.
- Subsequently, other versions of Unicode have been published and it later became a joint
- 1993: ISO/IEC 10646-1:1993, ISO/IEC standard for characters in Unicode 1.1
- Subsequently, other versions of ISO/IEC 10646-1 and one of ISO/IEC 10646-2 have been published. Since 2003, the standards have been published under the name "ISO/IEC 10646" without the separation into two parts.
- 1997: Windows Glyph List 4
Representation
In ASCII the letters belong to the
Not case sensitive, all letters have code words in the
Usage
All of the lowercase letters are used in the
Alphabets containing the same set of letters
The list below only includes alphabets that lack:
- letters whose diacritical marksmake them distinct letters.
- multigraphs that constitute distinct letters.
- ligatures that are distinct letters
Notable omissions due to these rules include
. The German alphabet is sometimes considered by tradition to contain only 26 letters (with ä, ö, ü considered variants and ß considered a ligature), but the current German orthographic rules include ä, ö, ü, ß in the alphabet placed after Z; however, this order is normally not used in collation: usually ä, ö, ü are collated as a, o, u (or sometimes as ae, oe, ue), ß as ss.Alphabet | Diacritic | Multigraphs (not constituting distinct letters) | Ligatures |
---|---|---|---|
Afrikaans alphabet
|
uu ⟩
Trigraphs: ⟨aai⟩, ⟨eeu⟩, ⟨oei⟩, ⟨ooi⟩ |
ʼn (N‑apostrophe)
| |
Aragonese alphabet (Academia de l'Aragonés orthography) | á, é, í, ó, ú, ü, lꞏl | ⟨ tz ⟩
|
|
Catalan alphabet
|
à, é, è, í, ï, ó, ò, ú, ü, ç, lꞏl | ⟨ ss ⟩
|
|
Dutch alphabet
|
ä, é, è, ë, ï, ö, ü | The uu ⟩
|
|
English alphabet | only in loanwords (see below)1 | ⟨ ng ⟩
|
æ, œ (both archaic) |
French alphabet
|
⟨ eî ⟩
|
æ (rare), œ (mandatory) | |
Italian alphabet (extended)[a]
|
ù
|
⟨ sci ⟩
|
|
Ido alphabet *
|
none | ⟨⟩ | |
Indonesian alphabet
|
only in learning materials (see below)4 | ⟨ ai ⟩, ⟨au⟩, ⟨ei⟩, ⟨oi⟩
|
|
Interlingua alphabet *
|
only in unassimilated loanwords (see below)2 | ⟨ rh⟩, ⟨sh ⟩
| |
Javanese Latin alphabet
|
é, è | ⟨ sy⟩, ⟨th ⟩
|
|
Latino sine flexione alphabet* | only an optional accent for unusual stress (see below)3 | ⟨ | |
Luxembourgish alphabet
|
ä, é, ë | ⟨ sch ⟩
|
|
Malay alphabet
|
only in learning materials (see below)4 | ⟨ sy ⟩
|
|
Portuguese alphabet[b]
|
⟨ õe ⟩
|
||
Sundanese Latin alphabet | é | ⟨ ng⟩, ⟨ny ⟩
|
* Constructed languages
- English is one of the few modern European languages requiring no diacritics for native words (although a
- Latino sine flexione, a.k.a. "Peano's Interlingua", allows but does not require the placement of an accent for unusual stress. (It antedates the other "Interlingua" by roughly four decades.)
- Malay and Indonesian (based on Malay) are the only languages outside Europe that use all the Latin alphabet and require no diacritics and ligatures.[d] Many of the 700+ languages of Indonesia also use the Indonesian alphabet to write their languages, some—such as Javanese—adding diacritics é and è, and some omitting q, x, and z.
Column numbering
This section needs additional citations for verification. (October 2023) |
The Roman (Latin) alphabet is commonly used for column numbering in a table or chart. This avoids confusion with row numbers using Arabic numerals. For example, a 3-by-3 table would contain columns A, B, and C, set against rows 1, 2, and 3. If more columns are needed beyond Z (normally the final letter of the alphabet), the column immediately after Z is AA, followed by AB, and so on (see bijective base-26 system). This can be seen by scrolling far to the right in a spreadsheet program such as Microsoft Excel or LibreOffice Calc.
These are double-digit "letters" for table columns, in the same way that 10 through 99 are double-digit numbers. The Greek alphabet has a similar extended form that uses such double-digit letters if necessary, but it is used for chapters of a fraternity as opposed to columns of a table.
Such double-digit letters for bullet points are AA, BB, CC, etc., as opposed to the number-like place value system explained above for table columns.
See also
- Hebrew alphabet
- Greek alphabet
- Latin alphabet
- Latin-script alphabet for the sound correspondence
- List of Latin-script alphabets
- Early Cyrillic alphabet, Cyrillic alphabets
- Windows code pages
Notes
- ^ The Italian alphabet is traditionally considered to have only 21 letters, excluding j, k, w, x, y. However, in practice these letters occur in a number of loanwords. J also occurs in some native Italian proper names as a variant of writing semivocalic i.
- ^ Note for Portuguese:
k and y (but not w) were part of the alphabet until several spelling reforms during the 20th century, the aim of which was to change the etymological Portuguese spelling into an easier phonetic spelling. These letters were replaced by other letters having the same sound: thus psychologia became psicologia, kioske became quiosque, martyr became mártir, etc. Nowadays k, w, and y are only found in foreign words and their derived terms and in scientific abbreviations (e.g. km, byronismo). These letters are considered part of the alphabet again following the 1990 Portuguese Language Orthographic Agreement, which came into effect on January 1, 2009, in Brazil. See Reforms of Portuguese orthography.
- ^ As an example of an article containing a diaeresis in "coöperate", as well as accents on loan words in English, such as a cedilla in "façades" and a circumflex in the word "crêpe", see Grafton, Anthony (October 23, 2006). "Books: The Nutty Professors, The history of academic charisma". The New Yorker.
- ^ However, Malay and Indonesian learning materials may use ⟨é⟩ (E with acute) to clarify the pronunciation of the letter E; in that case, ⟨e⟩ is pronounced /ə/ while ⟨é⟩ is pronounced /e/ and (è) is pronounced /ɛ/.
References
- ^ a b c "Internationalisation standardization of 7-bit codes, ISO 646". Trans-European Research and Education Networking Association (TERENA). Retrieved October 3, 2010.
- ^ "C0 Controls and Basic Latin" (PDF). Unicode.org. Retrieved August 8, 2016.
- ^ "Halfwidth and Fullwidth Forms" (PDF). Unicode.org. Retrieved August 8, 2016.
- ^ "The Postal History of ICAO". www.icao.int. Archived from the original on February 12, 2019. Retrieved February 17, 2019.
- ^ European Computer Manufacturers Association (Ecma). March 1985. Archived from the original (PDF) on May 29, 2016. Retrieved May 29, 2016.met for the first time in December 1960 to prepare standard codes for Input/Output purposes. On April 30, 1965, Standard ECMA-6 was adopted by the General Assembly of ECMA.
The Technical Committee TC1 of ECMA
- ^ "Unicode character database". The Unicode Standard. Retrieved March 22, 2013.
- ISBN 0-201-56788-1.
- Ager, Simon. "Latino sine Flexione". Omniglot. Latino sine Flexione alphabet. Retrieved April 14, 2023.
- ^ "The New Yorker's odd mark — the diaeresis". December 16, 2010. Archived from the original on December 16, 2010.
- ^ "Introduction al IED (in anglese)". www.interlingua.com. Retrieved September 21, 2020.