Audio signal processing

Audio signal processing is a subfield of

sound power level is typically measured in decibels. As audio signals may be represented in either digital or analog

format, processing may occur in either domain. Analog processors operate directly on the electrical signal, while digital processors operate mathematically on its digital representation.

History

The motivation for audio signal processing began at the beginning of the 20th century with inventions like the

studio-to-transmitter links.^[1] The theory of signal processing and its application to audio was largely developed at Bell Labs in the mid 20th century. Claude Shannon and Harry Nyquist's early work on communication theory, sampling theory and pulse-code modulation (PCM) laid the foundations for the field. In 1957, Max Mathews became the first person to synthesize audio from a computer, giving birth to computer music

.

Major developments in

audio coding formats such as MP3^[9] and Advanced Audio Coding (AAC).^[10]

An analog audio signal is a continuous signal represented by an electrical voltage or current that is analogous to the sound waves in the air. Analog signal processing then involves physically altering the continuous signal by changing the voltage or current or charge via

electrical circuits

.

Historically, before the advent of widespread digital technology, analog was the only method by which to manipulate a signal. Since that time, as computers and software have become more capable and affordable, digital signal processing has become the method of choice. However, in music applications, analog technology is often still desirable as it often produces nonlinear responses that are difficult to replicate with digital filters.

Digital

A digital representation expresses the audio waveform as a sequence of symbols, usually

digital circuits such as digital signal processors, microprocessors and general-purpose computers. Most modern audio systems use a digital approach as the techniques of digital signal processing are much more powerful and efficient than analog domain signal processing.^[11]

Applications

Processing methods and application areas include

reverb

removal or addition, etc.).

Audio broadcasting

Audio signal processing is used when broadcasting audio signals in order to enhance their fidelity or optimize for bandwidth or latency. In this domain, the most important audio processing takes place just before the transmitter. The audio processor here must prevent or minimize

shortwave broadcasting), and adjust overall loudness

to the desired level.

Active noise control

destructive interference

.

Audio synthesis

Audio synthesis is the electronic generation of audio signals. A musical instrument that accomplishes this is called a synthesizer. Synthesizers can either imitate sounds or generate new ones. Audio synthesis is also used to generate human speech using speech synthesis.

Audio effects

Audio effects alter the sound of a

reverb and delay

, which create echoing sounds and emulate the sound of different spaces.

Musicians,

electric or electronic instruments, they can be used with any audio source, such as acoustic instruments, drums, and vocals.^[12]^[13]

Computer audition

Computer audition (CA) or machine listening is the general field of study of algorithms and systems for audio interpretation by machines.^[14]^[15] Since the notion of what it means for a machine to "hear" is very broad and somewhat vague, computer audition attempts to bring together several disciplines that originally dealt with specific problems or had a concrete application in mind. The engineer Paris Smaragdis, interviewed in Technology Review, talks about these systems — "software that uses sound to locate people moving through rooms, monitor machinery for impending breakdowns, or activate traffic cameras to record accidents."^[16]

Inspired by models of

human audition, CA deals with questions of representation, transduction, grouping, use of musical knowledge and general sound semantics for the purpose of performing intelligent operations on audio and music signals by the computer. Technically this requires a combination of methods from the fields of signal processing, auditory modelling, music perception and cognition, pattern recognition, and machine learning, as well as more traditional methods of artificial intelligence for musical knowledge representation.^[17]^[18]

References

ISBN 0-471-79147-4.{{cite book}}: CS1 maint: multiple names: authors list (link
)

^ US patent 2605361, C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-07-29

ISSN 1932-8346. Archived
(PDF) from the original on 2022-10-09.

^ P. Cummiskey, Nikil S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech", Bell Syst. Tech. J., vol. 52, pp. 1105—1118, Sept. 1973

ISSN 0005-8580
.

S2CID 149806273. Archived
(PDF) from the original on 2022-10-09.

^ J. P. Princen, A. W. Johnson und A. B. Bradley: Subband/transform coding using filter bank designs based on time domain aliasing cancellation, IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987.

ISBN 9783319056609
.

^ Guckert, John (Spring 2012). "The Use of FFT and MDCT in MP3 Audio Compression" (PDF). University of Utah. Archived (PDF) from the original on 2022-10-09. Retrieved 14 July 2019.

^ Brandenburg, Karlheinz (1999). "MP3 and AAC Explained" (PDF). Archived (PDF) from the original on 2017-02-13.

ISBN 0-471-97226-6
.

ISBN 9781457415043
.

ISBN 9781931140065
.

ISBN 9781615209194
.

^ "Machine Audition: Principles, Algorithms and Systems" (PDF).

^ Paris Smaragdis taught computers how to play more life-like music

ISBN 978-3-540-57394-4
.

JSTOR 40285634
.

Further reading

Rocchesso, Davide (March 20, 2003). Introduction to Sound Processing (PDF).

Wilmering, Thomas; Moffat, David; Milo, Alessia; Sandler, Mark B. (2020). "A History of Audio Effects". Applied Sciences. 10 (3): 791.
hdl:10026.1/15335
.

v
t
e
Analog and digital audio broadcasting
Terrestrial
Radio modulation

AM

FM

COFDM

Frequency allocations

LW (LF)

MW (MF)

SW (HF)

VHF (low / mid / high)

L band (UHF)

Digital systems

CAM-D

DAB/DAB+

DRM/DRM+

FMeXtra

HD Radio

CDR

DVB-T2 Lite

Satellite
Frequency allocations

C band

K_u band

L band

S band

Digital systems

ADR

DAB-S

DVB-SH

S-DMB

SDR

Commercial radio providers

Sirius XM
Canada

Codecs

AAC

AMR-WB+

HDC

HE-AAC

MPEG-1 Audio Layer II

DRA+

Subcarrier signals

AMSS

DirectBand

PAD

RDS/RBDS

SCA/SCMO

DARC

Related topics
Technical (audio)

Audio data compression

Audio signal processing

Technical (AM stereo formats)

Belar

C-QUAM

Harris

Kahn-Hazeltine

Magnavox

Technical (emission)

AM broadcasting

AM expanded band

Cable radio

Digital radio

Error detection and correction

FM broadcast band

FM broadcasting

FM extended band in Brazil

Multipath propagation

Shortwave relay station

Cultural

History of radio

International broadcasting

Radio portal

Comparison of radio systems

v
t
e
Music production
Engineering

Audio filter

Audio mastering

Audio mixing

Critical distance

Effects loop

Effects unit
Talk box

Wah-wah pedal

Diffusion

Microphone

Overdubbing

Ping-ponging

Punch in/out

Sound recording

Tape loop

Signal
processing

Pitch shift

Auto-Tune

Chorus effect

Compression

Delay effect (STEED)

Distortion

Double tracking (ADT)

Ducking

Equalization

Exciter effect

Flanging

Octave effect

Noise gate

Phaser

Pumping

Reverb

Reverse echo

Practices

aesthetics

Hip hop production

Lo-fi

Overproduction

Recording studio as an instrument

Sampling

Turntablism

Wall of Sound

Xenochrony

Roles

professions

Arranger

Audio engineer

Backup band

Bandleader

DJ

Ghostwriters in music

Horn section

Orchestrator

Record producer

Rhythm section

Session musician

Backup singer

Ghost singer

Vocal coach

Other

Click track

Interpolation

Loudness war

Mashup

Medley

Music technology (electric)

Music technology (electronic and digital)

Remix

Record production portal

Retrieved from "https://en.wikipedia.org/w/index.php?title=Audio_signal_processing&oldid=1218448373"

[1] ISBN 0-471-79147-4.{{cite book}}: CS1 maint: multiple names: authors list (link
)

[DPCM-2] US patent 2605361, C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-07-29

[3] ISSN 1932-8346. Archived
(PDF) from the original on 2022-10-09.

[4] P. Cummiskey, Nikil S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech", Bell Syst. Tech. J., vol. 52, pp. 1105—1118, Sept. 1973

[5] ISSN 0005-8580
.

[DCT-6] S2CID 149806273. Archived
(PDF) from the original on 2022-10-09.

[7] J. P. Princen, A. W. Johnson und A. B. Bradley: Subband/transform coding using filter bank designs based on time domain aliasing cancellation, IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987.

[Schroeder2014-8] ISBN 9783319056609
.

[Guckert-9] Guckert, John (Spring 2012). "The Use of FFT and MDCT in MP3 Audio Compression" (PDF). University of Utah. Archived (PDF) from the original on 2022-10-09. Retrieved 14 July 2019.

[brandenburg-10] Brandenburg, Karlheinz (1999). "MP3 and AAC Explained" (PDF). Archived (PDF) from the original on 2017-02-13.

[11] ISBN 0-471-97226-6
.

[12] ISBN 9781457415043
.

[13] ISBN 9781931140065
.

[14] ISBN 9781615209194
.

[15] "Machine Audition: Principles, Algorithms and Systems" (PDF).

[16] Paris Smaragdis taught computers how to play more life-like music

[Computer_audition_Tanguiane1993-17] ISBN 978-3-540-57394-4
.

[Computer_audition_Tangian1994-18] JSTOR 40285634
.

[1]

[9]

[10]

[11]

[12]

[13]