Image compression

Image compression is a type of

statistical properties of image data to provide superior results compared with generic data compression methods which are used for other digital data.^[1]

Lossy and lossless image compression

Image compression may be lossy or lossless. Lossless compression is preferred for archival purposes and often for medical imaging, technical drawings, clip art, or comics. Lossy compression methods, especially when used at low bit rates, introduce compression artifacts. Lossy methods are especially suitable for natural images such as photographs in applications where minor (sometimes imperceptible) loss of fidelity is acceptable to achieve a substantial reduction in bit rate. Lossy compression that produces negligible differences may be called visually lossless.

Methods for lossy compression:

Transform coding – This is the most commonly used method.
- Nasir Ahmed, T. Natarajan and K. R. Rao in 1974.^[2] The DCT is sometimes referred to as "DCT-II" in the context of a family of discrete cosine transforms (see discrete cosine transform
  ). It is generally the most efficient form of image compression.
  - DCT is used in
    HEIF
    .
- The more recently developed wavelet transform is also used extensively, followed by quantization and entropy coding.
dithering to avoid posterization
.
- block palette, typically 2 or 4 colors for each block of 4x4 pixels, used in BTC, CCC, S2TC, and S3TC.
Chroma subsampling. This takes advantage of the fact that the human eye perceives spatial changes of brightness more sharply than those of color, by averaging or dropping some of the chrominance information in the image.
Fractal compression.
More recently, methods based on
Machine Learning were applied, using Multilayer perceptrons, Convolutional neural networks and Generative adversarial networks.^[3] Implementations are available in OpenCV, TensorFlow, MATLAB's Image Processing Toolbox (IPT), and the High-Fidelity Generative Image Compression (HiFiC) open source project.^[4]

Methods for lossless compression:

TGA, TIFF

Area image compression

Predictive coding – used in

DPCM

Entropy encoding – the two most common entropy encoding techniques are arithmetic coding and Huffman coding

Adaptive dictionary algorithms such as
GIF and TIFF

PNG, MNG, and TIFF

Chain codes
Diffusion models^[5]

Other properties

The best image quality at a given compression rate (or bit rate) is the main goal of image compression, however, there are other important properties of image compression schemes:

Scalability generally refers to a quality reduction achieved by manipulation of the bitstream or file (without decompression and re-compression). Other names for scalability are progressive coding or embedded bitstreams. Despite its contrary nature, scalability also may be found in lossless codecs, usually in form of coarse-to-fine pixel scans. Scalability is especially useful for previewing images while downloading them (e.g., in a web browser) or for providing variable quality access to e.g., databases. There are several types of scalability:

Quality progressive or layer progressive: The bitstream successively refines the reconstructed image.
Resolution progressive: First encode a lower image resolution; then encode the difference to higher resolutions.^[6]^[7]
Component progressive: First encode grey-scale version; then adding full color.

Region of interest coding. Certain parts of the image are encoded with higher quality than others. This may be combined with scalability (encode these parts first, others later).

Meta information. Compressed data may contain information about the image which may be used to categorize, search, or browse images. Such information may include color and texture statistics, small preview images, and author or copyright information.

Processing power. Compression algorithms require different amounts of

processing power

to encode and decode. Some high compression algorithms require high processing power.

The quality of a compression method often is measured by the peak signal-to-noise ratio. It measures the amount of noise introduced through a lossy compression of the image, however, the subjective judgment of the viewer also is regarded as an important measure, perhaps, being the most important measure.

History

Entropy coding started in the late 1940s with the introduction of Shannon–Fano coding,^[8] the basis for Huffman coding which was published in 1952.^[9] Transform coding dates back to the late 1960s, with the introduction of fast Fourier transform (FFT) coding in 1968 and the Hadamard transform in 1969.^[10]

An important development in image

digital photos,^[14] with several billion JPEG images produced every day as of 2015.^[15]

Portable Network Graphics (PNG) format.^[17]

The

video coding standard for digital cinema in 2004.^[23]

Notes and references

^ "Image Data Compression".
S2CID 149806273. Archived from the original
(PDF) on 2011-11-25.

^ Gilad David Maayan (Nov 24, 2021). "AI-Based Image Compression: The State of the Art". Towards Data Science. Retrieved 6 April 2023.

^ "High-Fidelity Generative Image Compression". Retrieved 6 April 2023.

^ Bühlmann, Matthias (2022-09-28). "Stable Diffusion Based Image Compression". Medium. Retrieved 2022-11-02.

S2CID 8018433
.

^ Shao, Dan; Kropatsch, Walter G. (February 3–5, 2010). Špaček, Libor; Franc, Vojtěch (eds.). "Irregular Laplacian Graph Pyramid" (PDF). Computer Vision Winter Workshop 2010. Nové Hrady, Czech Republic: Czech Pattern Recognition Society. Archived (PDF) from the original on 2013-05-27.

hdl:11858/00-001M-0000-002C-4314-2. Archived
(PDF) from the original on 2011-05-24. Retrieved 2019-04-21.

doi:10.1109/JRPROC.1952.273898, archived
(PDF) from the original on 2005-10-08

doi:10.1109/PROC.1969.6869
.

doi:10.1016/1051-2004(91)90086-Z
.

CCITT. September 1992. Archived
(PDF) from the original on 2000-08-18. Retrieved 12 July 2019.

BT.com. BT Group
. 31 May 2018. Retrieved 5 August 2019.

^ "What Is a JPEG? The Invisible Object You See Every Day". The Atlantic. 24 September 2013. Retrieved 13 September 2019.

^ Baraniuk, Chris (15 October 2015). "Copy protections could come to JPEGs". BBC News. BBC. Retrieved 13 September 2019.

^ "The GIF Controversy: A Software Developer's Perspective". 27 January 1995. Retrieved 26 May 2015.

doi:10.17487/RFC1951. RFC 1951
. Retrieved 2014-04-23.

ISBN 9781461507994
.

^
S2CID 2765169. Archived from the original
(PDF) on 2019-10-13.

^ Sullivan, Gary (8–12 December 2003). "General characteristics and design considerations for temporal subband video coding". ITU-T. Video Coding Experts Group. Retrieved 13 September 2019.

ISBN 9780080922508
.

S2CID 109186495
.

ISBN 9780240806174
.

container formats
Video
compression
ISO, IEC,
MPEG

DV

MJPEG

Motion JPEG 2000

MPEG-1

MPEG-2
Part 2

MPEG-4
Part 2 / ASP

Part 10 / AVC

Part 33 / IVC

MPEG-H
Part 2 / HEVC

MPEG-I
Part 3 / VVC

MPEG-5

Part 1 / EVC

Part 2 / LCEVC

ITU-T, VCEG

H.120

H.261

H.262

H.263

H.264 / AVC

H.265 / HEVC

H.266 / VVC

SMPTE

VC-1

VC-2

VC-3

VC-5

VC-6

TrueMotion

TrueMotion S

VP3

VP6

VP7

VP8

VP9

AV1

Others

Apple Video

AVS

Bink

Cinepak

Daala

DVI

FFV1

Huffyuv

Indeo

Lagarith

Microsoft Video 1

MSU Lossless

OMS Video

Pixlet

ProRes
422

4444

QuickTime
Animation

Graphics

RealVideo

RTVideo

SheerVideo

Smacker

Sorenson Video/Spark

Theora

Thor

Ut

WMV

XEB

YULS

Audio
compression
ISO, IEC,
MPEG

MPEG-1 Layer II
Multichannel

MPEG-1 Layer I

MPEG-1 Layer III (MP3)

AAC
HE-AAC

AAC-LD

MPEG Surround

MPEG-4 ALS

MPEG-4 SLS

MPEG-4 DST

MPEG-4 HVXC

MPEG-4 CELP

MPEG-D USAC

MPEG-H 3D Audio

ITU-T

G.711
A-law

µ-law

G.718

G.719

G.722

G.722.1

G.722.2

G.723

G.723.1

G.726

G.728

G.729

G.729.1

IETF

Opus

iLBC

Speex

Vorbis

3GPP

AMR

AMR-WB

AMR-WB+

EVRC

EVRC-B

EVS

GSM-HR

GSM-FR

GSM-EFR

ETSI

AC-3

AC-4

DTS

Bluetooth SIG

SBC

LC3

Others

ACELP

ALAC

Asao

ATRAC

AVS

CELT

Codec 2

DRA

FLAC

iSAC

MELP

Monkey's Audio

MT9

Musepack

OptimFROG

OSQ

QCELP

RCELP

RealAudio

RTAudio

SD2

SHN

SILK

Siren

SMV

SVOPC

TTA
True Audio

TwinVQ

VMR-WB

VSELP

WavPack

WMA

MQA

aptX

aptX HD

aptX Low Latency

aptX Adaptive

LDAC

LHDC

LLAC

Image
compression
IEC, ISO, IETF,
W3C, ITU-T, JPEG

CCITT Group 4

GIF

HEIC / HEIF

HEVC

JBIG

JBIG2

JPEG

JPEG 2000

JPEG-LS

JPEG XL

JPEG XR

JPEG XS

JPEG XT

PNG

TIFF

TIFF/EP

TIFF/IT

Others

APNG

AV1

AVIF

BPG

DjVu

EXR

FLIF

ICER

MNG

PGF

QOI

QTVR

WBMP

WebP

Containers
ISO, IEC

MPEG-ES
MPEG-PES

MPEG-PS

MPEG-TS

ISO/IEC base media file format

MPEG-4 Part 14
(MP4)

Motion JPEG 2000

MPEG-21 Part 9

MPEG media transport

ITU-T

H.222.0

T.802

IETF

RTP

Ogg

SMPTE

GXF

MXF

Others

3GP and 3G2

AMV

ASF

AIFF

AVI

AU

BPG

Bink
Smacker

BMP

DivX Media Format

EVO

Flash Video

HEIF

IFF

M2TS

Matroska
WebM

QuickTime File Format

RatDVD

RealMedia

RIFF
WAV

MOD and TOD

VOB, IFO and BUP

Collaborations

NETVC

MPEG LA

Alliance for Open Media

Methods

Entropy

Arithmetic

Huffman

Modified

LPC
ACELP

CELP

LSP

WLPC

Lossless

Lossy

LZ
DEFLATE

LZW

PCM
A-law

µ-law

ADPCM

DPCM

Transforms
DCT

FFT

MDCT

Wavelet
Daubechies

DWT

Lists

Comparison of audio coding formats

Comparison of video codecs

List of codecs

See Compression methods for techniques and Compression software for codecs

Retrieved from "https://en.wikipedia.org/w/index.php?title=Image_compression&oldid=1215074125"

[1] "Image Data Compression".

[2] S2CID 149806273. Archived from the original
(PDF) on 2011-11-25.

[3] Gilad David Maayan (Nov 24, 2021). "AI-Based Image Compression: The State of the Art". Towards Data Science. Retrieved 6 April 2023.

[4] "High-Fidelity Generative Image Compression". Retrieved 6 April 2023.

[5] Bühlmann, Matthias (2022-09-28). "Stable Diffusion Based Image Compression". Medium. Retrieved 2022-11-02.

[6] S2CID 8018433
.

[7] Shao, Dan; Kropatsch, Walter G. (February 3–5, 2010). Špaček, Libor; Franc, Vojtěch (eds.). "Irregular Laplacian Graph Pyramid" (PDF). Computer Vision Winter Workshop 2010. Nové Hrady, Czech Republic: Czech Pattern Recognition Society. Archived (PDF) from the original on 2013-05-27.

[Shannon-8] :11858/00-001M-0000-002C-4314-2. Archived
(PDF) from the original on 2011-05-24. Retrieved 2019-04-21.

[Huffman-9] :10.1109/JRPROC.1952.273898, archived
(PDF) from the original on 2005-10-08

[Hadamard-10] :10.1109/PROC.1969.6869
.

[Ahmed-11] :10.1016/1051-2004(91)90086-Z
.

[t81-12] CCITT. September 1992. Archived
(PDF) from the original on 2000-08-18. Retrieved 12 July 2019.

[13] BT.com. BT Group
. 31 May 2018. Retrieved 5 August 2019.

[Atlantic-14] "What Is a JPEG? The Invisible Object You See Every Day". The Atlantic. 24 September 2013. Retrieved 13 September 2019.

[15] Baraniuk, Chris (15 October 2015). "Copy protections could come to JPEGs". BBC News. BBC. Retrieved 13 September 2019.

[cloanto-16] "The GIF Controversy: A Software Developer's Perspective". 27 January 1995. Retrieved 26 May 2015.

[IETF-17] :10.17487/RFC1951. RFC 1951
. Retrieved 2014-04-23.

[18] ISBN 9781461507994
.

[Unser-19] 
S2CID 2765169. Archived from the original
(PDF) on 2019-10-13.

[20] Sullivan, Gary (8–12 December 2003). "General characteristics and design considerations for temporal subband video coding". ITU-T. Video Coding Experts Group. Retrieved 13 September 2019.

[21] ISBN 9780080922508
.

[22] S2CID 109186495
.

[23] ISBN 9780240806174
.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[14]

[15]

[17]

[23]