Audio coding format
An audio coding format[1] (or sometimes audio compression format) is a content representation format for storage or transmission of digital audio (such as in digital television, digital radio and in audio and video files). Examples of audio coding formats include MP3, AAC, Vorbis, FLAC, and Opus. A specific software or hardware implementation capable of audio compression and decompression to/from a specific audio coding format is called an audio codec; an example of an audio codec is LAME, which is one of several different codecs which implements encoding and decoding audio in the MP3 audio coding format in software.
Some audio coding formats are documented by a detailed
Audio content encoded in a particular audio coding format is normally encapsulated within a
An audio coding format does not dictate all
Lossless, lossy, and uncompressed audio coding formats
A lossless audio coding format reduces the total data needed to represent a sound but can be de-coded to its original, uncompressed form. A lossy audio coding format additionally reduces the bit resolution of the sound on top of compression, which results in far less data at the cost of irretrievably lost information.
Transmitted (streamed) audio is most often compressed using lossy audio codecs as the smaller size is far more convenient for distribution. The most widely used audio coding formats are
Lossless audio coding formats such as
History
In 1950,
Discrete cosine transform (DCT), developed by Nasir Ahmed, T. Natarajan and K. R. Rao in 1974,[8] provided the basis for the modified discrete cosine transform (MDCT) used by modern audio compression formats such as MP3[9] and AAC. MDCT was proposed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987,[10] following earlier work by Princen and Bradley in 1986.[11] The MDCT is used by modern audio compression formats such as Dolby Digital,[12][13] MP3,[9] and Advanced Audio Coding (AAC).[14]
List of lossy formats
General
Basic compression algorithm | Audio coding standard | Abbreviation | Introduction | Market share (2019)[15] | Ref |
---|---|---|---|---|---|
Modified discrete cosine transform (MDCT) | Dolby Digital (AC-3) | AC3 | 1991 | 58% | [12][16] |
Adaptive Transform Acoustic Coding
|
ATRAC | 1992 | Unknown | [12] | |
MPEG Layer III
|
MP3 | 1993 | 49% | [9][17] | |
Advanced Audio Coding (MPEG-2 / MPEG-4) | AAC | 1997 | 88% | [14][12] | |
Windows Media Audio | WMA | 1999 | Unknown | [12] | |
Ogg Vorbis | Ogg | 2000 | 7% | [18][12] | |
Constrained Energy Lapped Transform
|
CELT | 2011 | — | [19] | |
Opus
|
Opus | 2012 | 8% | [20] | |
LDAC | LDAC | 2015 | Unknown | [21][22] | |
Adaptive differential pulse-code modulation (ADPCM) | aptX / aptX-HD | aptX | 1989 | Unknown | [23] |
Digital Theater Systems
|
DTS | 1990 | 14% | [24][25] | |
Master Quality Authenticated | MQA | 2014 | Unknown | ||
Sub-band coding (SBC) | MPEG-1 Audio Layer II | MP2 | 1993 | Unknown | |
Musepack | MPC | 1997 |
Speech
- Linear predictive coding (LPC)
- Adaptive predictive coding (APC)
- Code-excited linear prediction (CELP)
- Algebraic code-excited linear prediction (ACELP)
- Relaxed code-excited linear prediction (RCELP)
- Low-delay CELP(LD-CELP)
- Adaptive Multi-Rate (used in GSM and 3GPP)
- Codec2(noted for its lack of patent restrictions)
- Speex (noted for its lack of patent restrictions)
- Modified discrete cosine transform (MDCT)
- AAC-LD
- Constrained Energy Lapped Transform(CELT)
- Opus(mostly for real-time applications)
List of lossless formats
- Apple Lossless(ALAC – Apple Lossless Audio Codec)
- Adaptive Transform Acoustic Coding(ATRAC)
- Audio Lossless Coding (also known as MPEG-4 ALS)
- Direct Stream Transfer (DST)
- Dolby TrueHD
- DTS-HD Master Audio
- Free Lossless Audio Codec(FLAC)
- Lossless discrete cosine transform (LDCT)
- Meridian Lossless Packing (MLP)
- Monkey's Audio (Monkey's Audio APE)
- MPEG-4 SLS (also known as HD-AAC)
- OptimFROG
- Original Sound Quality (OSQ)
- RealPlayer (RealAudio Lossless)
- Shorten(SHN)
- TTA (True Audio Lossless)
- WavPack (WavPack lossless)
- WMA Lossless(Windows Media Lossless)
See also
- Comparison of audio coding formats
- Data compression#Audio
- Audio file format
- List of audio compression formats
References
- ^ The term "audio coding" can be seen in e.g. the name Advanced Audio Coding, and is analogous to the term video coding
- ^ "Video – Where is synchronization information stored in container formats?".
- ^ US patent 2605361, C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-07-29
- .
- ISSN 0005-8580.
- ^ ISBN 9783319056609.
- ISSN 1932-8346.
- S2CID 149806273. Archived from the original(PDF) on 2016-12-08. Retrieved 2019-10-20.
- ^ a b c Guckert, John (Spring 2012). "The Use of FFT and MDCT in MP3 Audio Compression" (PDF). University of Utah. Retrieved 14 July 2019.
- S2CID 58446992.
- .
- ^ ISBN 9780387782638.
- S2CID 897622.
- ^ a b Brandenburg, Karlheinz (1999). "MP3 and AAC Explained" (PDF). Archived (PDF) from the original on 2017-02-13.
- ^ "Video Developer Report 2019" (PDF). Bitmovin. 2019. Retrieved 5 November 2019.
- S2CID 897622.
- ^ Stanković, Radomir S.; Astola, Jaakko T. (2012). "Reminiscences of the Early Work in DCT: Interview with K.R. Rao" (PDF). Reprints from the Early Days of Information Sciences. 60. Retrieved 13 October 2019.
- ^ Xiph.Org Foundation (2009-06-02). "Vorbis I specification - 1.1.2 Classification". Xiph.Org Foundation. Retrieved 2009-09-22.
- ^ Terriberry, Timothy B. Presentation of the CELT codec. Presentation (PDF).
- arXiv:1602.04845.
- ^ Darko, John H. (2017-03-29). "The inconvenient truth about Bluetooth audio". DAR__KO. Archived from the original on 2018-01-14. Retrieved 2018-01-13.
- ^ Ford, Jez (2015-08-24). "What is Sony LDAC, and how does it do it?". AVHub. Retrieved 2018-01-13.
- ^ Ford, Jez (2016-11-22). "aptX HD - lossless or lossy?". AVHub. Retrieved 2018-01-13.
- ^ "Digital Theater Systems Audio Formats". Library of Congress. 27 December 2011. Retrieved 10 November 2019.
- ISBN 9780470041963.