Level of measurement

Level of measurement or scale of measure is a classification that describes the nature of information within the values assigned to variables.^[1] Psychologist Stanley Smith Stevens developed the best-known classification with four levels, or scales, of measurement: nominal, ordinal, interval, and ratio.^[1]^[2] This framework of distinguishing levels of measurement originated in psychology and has since had a complex history, being adopted and extended in some disciplines and by some scholars, and criticized or rejected by others.^[3] Other classifications include those by Mosteller and Tukey,^[4] and by Chrisman.^[5]

Stevens's typology

Overview

Stevens proposed his typology in a 1946

quantitative" (to a different degree, all the rest of his scales). The concept of scale types later received the mathematical rigour that it lacked at its inception with the work of mathematical psychologists Theodore Alper (1985, 1987), Louis Narens (1981a, b), and R. Duncan Luce

(1986, 1987, 2001). As Luce (1997, p. 395) wrote:

S. S. Stevens (1946, 1951, 1975) claimed that what counted was having an interval or ratio scale. Subsequent research has given meaning to this assertion, but given his attempts to invoke scale type ideas it is doubtful if he understood it himself ... no measurement theorist I know accepts Stevens's broad definition of measurement ... in our view, the only sensible meaning for 'rule' is empirically testable laws about the attribute.

Comparison

Incremental
progress Measure property Mathematical
operators Advanced
operations Central
tendency Variability

Nominal Classification, membership =, ≠ Grouping Mode Qualitative variation

Ordinal Comparison, level >, < Sorting Median Range,
interquartile range

Interval Difference, affinity +, − Comparison to a standard Arithmetic mean Deviation

Ratio Magnitude, amount ×, / Ratio Geometric mean,
harmonic mean Coefficient of variation,
studentized range

Nominal level

The nominal type differentiates between items or subjects based only on their names or (meta-)categories and other qualitative classifications they belong to; thus
globally unique identifier
.
Examples of these classifications include gender, nationality, ethnicity, language, genre, style, biological species, and form.[6]^[7] In a university one could also use residence hall or department affiliation as examples. Other concrete examples are

in
parts of speech
: noun, verb, preposition, article, pronoun, etc.

in politics, power projection: hard power, soft power, etc.

in biology, the taxonomic ranks below domains: Archaea, Bacteria, and Eukarya

in
faults
: specification faults, design faults, and code faults

Nominal scales were often called qualitative scales, and measurements made on qualitative scales were called qualitative data. However, the rise of qualitative research has made this usage confusing. If numbers are assigned as labels in nominal measurement, they have no specific numerical value or meaning. No form of arithmetic computation (+, −, ×, etc.) may be performed on nominal measures. The nominal level is the lowest measurement level used from a statistical point of view.

Mathematical operations

non-trivial operations
that generically apply to objects of the nominal type.

Central tendency

The mode, i.e. the most common item, is allowed as the measure of central tendency for the nominal type. On the other hand, the median, i.e. the middle-ranked item, makes no sense for the nominal type of data since ranking is meaningless for the nominal type.^[8]

Ordinal scale

Further information: Ordinal data

The ordinal type allows for
rank order (1st, 2nd, 3rd, etc.) by which data can be sorted but still does not allow for a relative degree of difference between them. Examples include, on one hand, dichotomous data with dichotomous (or dichotomized) values such as 'sick' vs. 'healthy' when measuring health, 'guilty' vs. 'not-guilty' when making judgments in courts, 'wrong/false' vs. 'right/true' when measuring truth value, and, on the other hand, non-dichotomous data consisting of a spectrum of values, such as 'completely agree', 'mostly agree', 'mostly disagree', 'completely disagree' when measuring opinion
.
The ordinal scale places events in order, but there is no attempt to make the intervals of the scale equal in terms of some rule. Rank orders represent ordinal scales and are frequently used in research relating to qualitative phenomena. A student's rank in his graduation class involves the use of an ordinal scale. One has to be very careful in making a statement about scores based on ordinal scales. For instance, if Devi's position in his class is 10 and Ganga's position is 40, it cannot be said that Devi's position is four times as good as that of Ganga. Ordinal scales only permit the ranking of items from highest to lowest. Ordinal measures have no absolute values, and the real differences between adjacent ranks may not be equal. All that can be said is that one person is higher or lower on the scale than another, but more precise comparisons cannot be made. Thus, the use of an ordinal scale implies a statement of 'greater than' or 'less than' (an equality statement is also acceptable) without our being able to state how much greater or less. The real difference between ranks 1 and 2, for instance, may be more or less than the difference between ranks 5 and 6. Since the numbers of this scale have only a rank meaning, the appropriate measure of central tendency is the median. A percentile or quartile measure is used for measuring dispersion. Correlations are restricted to various rank order methods. Measures of statistical significance are restricted to the non-parametric methods (R. M. Kothari, 2004).

Central tendency

The median, i.e. middle-ranked, item is allowed as the measure of central tendency; however, the mean (or average) as the measure of central tendency is not allowed. The mode is allowed.
In 1946, Stevens observed that psychological measurement, such as measurement of opinions, usually operates on ordinal scales; thus means and standard deviations have no
cognitive and other abilities, are ordinal, although some theoreticians have argued they can be treated as interval or ratio scales. However, there is little prima facie evidence to suggest that such attributes are anything more than ordinal (Cliff, 1996; Cliff & Keats, 2003; Michell, 2008).^[9] In particular,^[10] IQ scores reflect an ordinal scale, in which all scores are meaningful for comparison only.^[11]^[12]^[13] There is no absolute zero, and a 10-point difference may carry different meanings at different points of the scale.^[14]^[15]

Interval scale

The interval type allows for defining the degree of difference between measurements, but not the ratio between measurements. Examples include
affine line
).

Central tendency and statistical dispersion
The mode, median, and arithmetic mean are allowed to measure central tendency of interval variables, while measures of statistical dispersion include range and standard deviation. Since one can only divide by differences, one cannot define measures that require some ratios, such as the coefficient of variation. More subtly, while one can define moments about the origin, only central moments are meaningful, since the choice of origin is arbitrary. One can define standardized moments, since ratios of differences are meaningful, but one cannot define the coefficient of variation, since the mean is a moment about the origin, unlike the standard deviation, which is (the square root of) a central moment.

Ratio scale

See also: Positive real numbers § Ratio scale

The ratio type takes its name from the fact that measurement is the estimation of the ratio between a magnitude of a continuous quantity and a
plane angle, energy and electric charge. In contrast to interval scales, ratios can be compared using division. Very informally, many ratio scales can be described as specifying "how much" of something (i.e. an amount or magnitude). Ratio scale is often used to express an order of magnitude such as for temperature in Orders of magnitude (temperature)
.

Central tendency and statistical dispersion

The geometric mean and the harmonic mean are allowed to measure the central tendency, in addition to the mode, median, and arithmetic mean. The studentized range and the coefficient of variation are allowed to measure statistical dispersion. All statistical measures are allowed because all necessary mathematical operations are defined for the ratio scale.

Debate on Stevens's typology

While Stevens's typology is widely adopted, it is still being challenged by other theoreticians, particularly in the cases of the nominal and ordinal types (Michell, 1986).^[16] Duncan (1986), for example, objected to the use of the word measurement in relation to the nominal type and Luce (1997) disagreed with Steven's definition of measurement.
On the other hand, Stevens (1975) said of his own definition of measurement that "the assignment can be any consistent rule. The only rule not allowed would be random assignment, for randomness amounts in effect to a nonrule". Hand says, "Basic psychology texts often begin with Stevens's framework and the ideas are ubiquitous. Indeed, the essential soundness of his hierarchy has been established for representational measurement by mathematicians, determining the invariance properties of mappings from empirical systems to real number continua. Certainly the ideas have been revised, extended, and elaborated, but the remarkable thing is his insight given the relatively limited formal apparatus available to him and how many decades have passed since he coined them."^[17]
Although Stevens suggested that the level of measurement of a set of observations dictates which mathematical or statistical operations are permissible, statistical analyses themselves do not typically make assumptions about levels of measurement.^[18]
The use of the mean as a measure of the central tendency for the ordinal type is still debatable among those who accept Stevens's typology. Many behavioural scientists use the mean for ordinal data, anyway. This is often justified on the basis that the ordinal type in behavioural science is in fact somewhere between the true ordinal and interval types; although the interval difference between two ordinal ranks is not constant, it is often of the same order of magnitude.
For example, applications of measurement models in educational contexts often indicate that total scores have a fairly linear relationship with measurements across the range of an assessment. Thus, some argue that so long as the unknown interval difference between ordinal scale ranks is not too variable, interval scale statistics such as means can meaningfully be used on ordinal scale variables. Statistical analysis software such as SPSS requires the user to select the appropriate measurement class for each variable. This ensures that subsequent user errors cannot inadvertently perform meaningless analyses (for example correlation analysis with a variable on a nominal level).
L. L. Thurstone made progress toward developing a justification for obtaining the interval type, based on the law of comparative judgment. A common application of the law is the analytic hierarchy process. Further progress was made by Georg Rasch (1960), who developed the probabilistic Rasch model
that provides a theoretical basis and justification for obtaining interval-level measurements from counts of observations such as total scores on assessments.

Other proposed typologies

Typologies aside from Stevens's typology have been proposed. For instance, Mosteller and Tukey (1977), Nelder (1990)^[19] described continuous counts, continuous ratios, count ratios, and categorical modes of data. See also Chrisman (1998), van den Berg (1991).^[20]

Mosteller and Tukey's typology (1977)

Mosteller and Tukey^[4] noted that the four levels are not exhaustive and proposed:

Names

Grades (ordered labels like beginner, intermediate, advanced)

Ranks (orders with 1 being the smallest or largest, 2 the next smallest or largest, and so on)

Counted fractions (bound by 0 and 1)

Counts (non-negative integers)

Amounts (non-negative real numbers)

Balances (any real number)

For example, percentages (a variation on fractions in the Mosteller–Tukey framework) do not fit well into Stevens's framework: No transformation is fully admissible.^[16]

Chrisman's typology (1998)

Nicholas R. Chrisman^[5] introduced an expanded list of levels of measurement to account for various measurements that do not necessarily fit with the traditional notions of levels of measurement. Measurements bound to a range and repeating (like degrees in a circle, clock time, etc.), graded membership categories, and other types of measurement do not fit to Stevens's original work, leading to the introduction of six new levels of measurement, for a total of ten:

Nominal

Gradation of membership

Ordinal

Interval

Log-interval

Extensive ratio

Cyclical ratio

Derived ratio

Counts

Absolute

While some claim that the extended levels of measurement are rarely used outside of academic geography,
fuzzy set theory, while absolute measurements include probabilities and the plausibility and ignorance in Dempster–Shafer theory
. Cyclical ratio measurements include angles and times. Counts appear to be ratio measurements, but the scale is not arbitrary and fractional counts are commonly meaningless. Log-interval measurements are commonly displayed in stock market graphics. All these types of measurements are commonly used outside academic geography, and do not fit well to Stevens' original work.

Scale types and Stevens's "operational theory of measurement"

The theory of scale types is the intellectual handmaiden to Stevens's "operational theory of measurement", which was to become definitive within psychology and the
British Association for the Advancement of Science to investigate the possibility of genuine scientific measurement in the psychological and behavioral sciences. This committee, which became known as the Ferguson committee, published a Final Report (Ferguson, et al., 1940, p. 245) in which Stevens's sone
scale (Stevens & Davis, 1938) was an object of criticism:

…any law purporting to express a quantitative relation between sensation intensity and stimulus intensity is not merely false but is in fact meaningless unless and until a meaning can be given to the concept of addition as applied to sensation.

That is, if Stevens's
concatenation operations. This conclusion was later rendered false by the discovery of the theory of conjoint measurement
by Debreu (1960) and independently by Luce & Tukey (1964). However, Stevens's reaction was not to conduct experiments to test for the presence of additive structure in sensations, but instead to render the conclusions of the Ferguson committee null and void by proposing a new theory of measurement:

Paraphrasing N. R. Campbell (Final Report, p.340), we may say that measurement, in the broadest sense, is defined as the assignment of numerals to objects and events according to rules (Stevens, 1946, p.677).

Stevens was greatly influenced by the ideas of another Harvard academic,
operationalism Stevens used to define measurement. In Stevens's definition, for example, it is the use of a tape measure that defines length (the object of measurement) as being measurable (and so by implication quantitative). Critics of operationism object that it confuses the relations between two objects or events for properties of one of those of objects or events.^[23]^[24]
(Moyer, 1981a,b; Rogers, 1989).
The Canadian measurement theorist William Rozeboom was an early and trenchant critic of Stevens's theory of scale types.[25]

Same variable may be different scale type depending on context

Another issue is that the same variable may be a different scale type depending on how it is measured and on the goals of the analysis. For example, hair color is usually thought of as a nominal variable, since it has no apparent ordering.^[26] However, it is possible to order colors (including hair colors) in various ways, including by hue; this is known as colorimetry. Hue is an interval level variable.

See also

Cohen's kappa

Coherence (units of measurement)

Hume's principle

Inter-rater reliability

Logarithmic scale

Ramsey–Lewis method

Set theory

Statistical data type

Transition (linguistics)

References

^
ISBN 978-1-4020-5613-0
.

^
S2CID 4667599
.

doi:10.1037/0033-2909.100.3.398
.

^
ISBN 978-0201048544
.

^
ISSN 1523-0406. – via Taylor & Francis
(subscription required)

^ Nominal measures are based on sets and depend on categories, a la Aristotle: Chrisman, Nicholas (March 1995). "Beyond Stevens: A revised approach to measurement for geographic information". Retrieved 2014-08-25.

ISBN 0-262-63-032-X

PMID 21897729
.

LCCN 68011394
. Although, formally speaking, interval measurement can always be obtained by specification, such specification is theoretically meaningful only if it is implied by the theory and model relevant to the measurement procedure.
William W. Rozeboom (January 1969). "Reviewed Work: Statistical Theories of Mental Test Scores". American Educational Research Journal. 6 (1): 112–116.
JSTOR 1162101
.

ISBN 978-1-58488-814-7
. Although in practice IQ and most other human characteristics measured by psychological tests (such as anxiety, introversion, self esteem, etc.) are treated as interval scales, many researchers would argue that they are more appropriately categorized as ordinal scales. Such arguments would be based on the fact that such measures do not really meet the requirements of an interval scale, because it cannot be demonstrated that equal numerical differences at different points on the scale are comparable.

ISBN 978-0-669-61382-7
. The I.Q. is essentially a rank; there are no true "units" of intellectual ability.

ISBN 978-0-89079-585-9
. An IQ score is not an equal-interval score, as is evident in Table A.4 in the WISC-III manual.

ISBN 978-0-521-54478-8
. When we come to quantities like IQ or g, as we are presently able to measure them, we shall see later that we have an even lower level of measurement—an ordinal level. This means that the numbers we assign to individuals can only be used to rank them—the number tells us where the individual comes in the rank order and nothing else.

ISBN 978-1-56000-360-1
. Ideally, a scale of measurement should have a true zero-point and identical intervals. . . . Scales of hardness lack these advantages, and so does IQ. There is no absolute zero, and a 10-point difference may carry different meanings at different points of the scale.

ISBN 978-0-19-852367-3
. In the jargon of psychological measurement theory, IQ is an ordinal scale, where we are simply rank-ordering people. . . . It is not even appropriate to claim that the 10-point difference between IQ scores of 110 and 100 is the same as the 10-point difference between IQs of 160 and 150

^
JSTOR 2684788
.

S2CID 148934577
.

doi:10.15626/MP.2019.1916
.

^ Nelder, J. A. (1990). The knowledge needed to computerise the analysis and interpretation of statistical information. In Expert systems and artificial intelligence: the need for information about data. Library Association Report, London, March, 23–27.

^ van den Berg, G. (1991). Choosing an analysis method. Leiden: DSWO Press

S2CID 21372776
.

Percy Bridgman (1957) The Logic of Modern Physics

S2CID 170941474
.

^ Michell, J. (1999). Measurement in Psychology – A critical history of a methodological concept. Cambridge: Cambridge University Press.

S2CID 46970420
.

^ "What is the difference between categorical, ordinal and interval variables?". Institute for Digital Research and Education. University of California, Los Angeles. Archived from the original on 2016-01-25. Retrieved 7 February 2016.

Further reading

This 'further reading' section may need cleanup. Please read the editing guide and help improve the section. (June 2021) (Learn how and when to remove this template message)

Alper, T. M. (1985). "A note on real measurement structures of scale type (m, m + 1)". Journal of Mathematical Psychology. 29: 73–81.
doi:10.1016/0022-2496(85)90019-7
.

Alper, T. M. (1987). "A classification of all order-preserving homeomorphism groups of the reals that satisfy finite uniqueness". Journal of Mathematical Psychology. 31 (2): 135–154.
doi:10.1016/0022-2496(87)90012-5
.

Briand, L. & El Emam, K. & Morasca, S. (1995). On the Application of Measurement Theory in Software Engineering. Empirical Software Engineering, 1, 61–88. [On line] https://web.archive.org/web/20070926232755/http://www2.umassd.edu/swpi/ISERN/isern-95-04.pdf

ISBN 0-8058-1333-0

ISBN 0-8058-2093-0

Lord, Frederic M (December 1953). "On the Statistical Treatment of Football Numbers" (PDF). doi:10.1037/h0063675. Archived from the original
(PDF) on 20 July 2011. Retrieved 16 September 2010.

See also reprints in:
Readings in Statistics, Ch. 3, (Haber, A., Runyon, R. P., and Badia, P.) Reading, Mass: Addison–Wesley, 1970

Maranell, Gary Michael, ed. (2007). "Chapter 31". Scaling: A Sourcebook for Behavioral Scientists. New Brunswick, New Jersey & London, UK: Aldine Transaction. pp. 402–405.
ISBN 978-0-202-36175-8
. Retrieved 16 September 2010.

Hardcastle, G. L. (1995). "S. S. Stevens and the origins of operationism". Philosophy of Science. 62 (3): 404–424.
S2CID 170941474
.

Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison–Wesley.

Luce, R. D. (1986). "Uniqueness and homogeneity of ordered relational structures". Journal of Mathematical Psychology. 30 (4): 391–415.
S2CID 13567893
.

Luce, R. D. (1987). "Measurement structures with Archimedean ordered translation groups". Order. 4 (2): 165–189.
S2CID 16080432
.

Luce, R. D. (1997). "Quantification and symmetry: commentary on Michell 'Quantitative science and the definition of measurement in psychology'". British Journal of Psychology. 88 (3): 395–398.
doi:10.1111/j.2044-8295.1997.tb02645.x
.

Luce, R. D. (2000). Utility of uncertain gains and losses: measurement theoretic and experimental approaches. Mahwah, N.J.: Lawrence Erlbaum.

Luce, R. D. (2001). "Conditions equivalent to unit representations of ordered relational structures". Journal of Mathematical Psychology. 45 (1): 81–98.
S2CID 12231599
.

Luce, R. D.; Tukey, J. W. (1964). "Simultaneous conjoint measurement: a new scale type of fundamental measurement". Journal of Mathematical Psychology. 1: 1–27.
doi:10.1016/0022-2496(64)90015-x
.

Michell, J. (1986). "Measurement scales and statistics: a clash of paradigms". Psychological Bulletin. 100 (3): 398–407.
doi:10.1037/0033-2909.100.3.398
.

Michell, J. (1997). "Quantitative science and the definition of measurement in psychology". British Journal of Psychology. 88 (3): 355–383.
S2CID 143169737
.

Michell, J. (1999). Measurement in Psychology – A critical history of a methodological concept. Cambridge: Cambridge University Press.

Michell, J. (2008). "Is psychometrics pathological science?". Measurement – Interdisciplinary Research & Perspectives. 6 (1–2): 7–24.
S2CID 146702066
.

Narens, L. (1981a). "A general theory of ratio scalability with remarks about the measurement-theoretic concept of meaningfulness". Theory and Decision. 13: 1–70.
S2CID 119401596
.

Narens, L. (1981b). "On the scales of measurement". Journal of Mathematical Psychology. 24 (3): 249–275.
doi:10.1016/0022-2496(81)90045-6
.

Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Danish Institute for Educational Research.

Rozeboom, W. W. (1966). "Scaling theory and the nature of measurement". Synthese. 16 (2): 170–233.
S2CID 46970420
.

PMID 17750512. Archived from the original
(PDF) on 25 November 2011. Retrieved 16 September 2010.

Stevens, S. S. (1951). Mathematics, measurement and psychophysics. In S. S. Stevens (Ed.), Handbook of experimental psychology (pp. 1–49). New York: Wiley.

Stevens, S. S. (1975). Psychophysics. New York: Wiley.

von Eye, A. (2005). "Review of Cliff and Keats, Ordinal measurement in the behavioral sciences". Applied Psychological Measurement. 29 (5): 401–403.
S2CID 220583753
.

v
t
e
Statistics

Outline

Index

Continuous data
Center

Mean
Arithmetic

Arithmetic-Geometric

Cubic

Generalized/power

Geometric

Harmonic

Heronian

Heinz

Lehmer

Median

Mode

Dispersion

Average absolute deviation

Coefficient of variation

Interquartile range

Percentile

Range

Standard deviation

Variance

Shape

Central limit theorem

Moments
Kurtosis

L-moments

Skewness

Count data

Index of dispersion

Summary tables

Contingency table

Frequency distribution

Grouped data

Dependence

Partial correlation

Pearson product-moment correlation

Rank correlation
Kendall's τ

Spearman's ρ

Scatter plot

Graphics

Bar chart

Biplot

Box plot

Control chart

Correlogram

Fan chart

Forest plot

Histogram

Pie chart

Q–Q plot

Radar chart

Run chart

Scatter plot

Stem-and-leaf display

Violin plot

Data collection
Study design

Effect size

Missing data

Optimal design

Population

Replication

Sample size determination

Statistic

Statistical power

Survey methodology

Sampling
Cluster

Stratified

Opinion poll

Questionnaire

Standard error

Controlled experiments

Blocking

Factorial experiment

Interaction

Random assignment

Randomized controlled trial

Randomized experiment

Scientific control

Adaptive designs

Adaptive clinical trial

Stochastic approximation

Up-and-down designs

Observational studies

Cohort study

Cross-sectional study

Natural experiment

Quasi-experiment

Statistical inference
Statistical theory

Population

Statistic

Probability distribution

Sampling distribution
Order statistic

Empirical distribution
Density estimation

Statistical model
Model specification

L^p space

Parameter
location

scale

shape

Parametric family
Likelihood (monotone)

Location–scale family

Exponential family

Completeness

Sufficiency

Statistical functional

Bootstrap

U

V

Optimal decision
loss function

Efficiency

Statistical distance
divergence

Asymptotics

Robustness

Frequentist inference
Point estimation

Estimating equations
Maximum likelihood

Method of moments

M-estimator

Minimum distance

Unbiased estimators
Mean-unbiased minimum-variance
Rao–Blackwellization

Lehmann–Scheffé theorem

Median unbiased

Plug-in

Interval estimation

Confidence interval

Pivot

Likelihood interval

Prediction interval

Tolerance interval

Resampling
Bootstrap

Jackknife

Testing hypotheses

1- & 2-tails

Power

Uniformly most powerful test

Permutation test
Randomization test

Multiple comparisons

Parametric tests

Likelihood-ratio

Score/Lagrange multiplier

Wald

Specific tests

Z-test (normal)

Student's t-test

F-test

Goodness of fit

Chi-squared

G-test

Kolmogorov–Smirnov

Anderson–Darling

Lilliefors

Jarque–Bera

Normality (Shapiro–Wilk)

Likelihood-ratio test

Model selection
Cross validation

AIC

BIC

Rank statistics

Sign
Sample median

Signed rank (Wilcoxon)
Hodges–Lehmann estimator

Rank sum (Mann–Whitney)

Nonparametric anova
1-way (Kruskal–Wallis)

2-way (Friedman)

Ordered alternative (Jonckheere–Terpstra)

Van der Waerden test

Bayesian inference

Bayesian probability
prior

posterior

Credible interval

Bayes factor

Bayesian estimator
Maximum posterior estimator

Correlation

Pearson product-moment

Partial correlation

Confounding variable

Coefficient of determination

Regression analysis

Errors and residuals

Regression validation

Mixed effects models

Simultaneous equations models

Multivariate adaptive regression splines (MARS)

Linear regression

Simple linear regression

Ordinary least squares

General linear model

Bayesian regression

Non-standard predictors

Nonlinear regression

Nonparametric

Semiparametric

Isotonic

Robust

Heteroscedasticity

Homoscedasticity

Generalized linear model

Exponential families

Logistic (Bernoulli) / Binomial / Poisson regressions

Partition of variance

Analysis of variance (ANOVA, anova)

Analysis of covariance

Multivariate ANOVA

Degrees of freedom

Categorical / Multivariate / Time-series / Survival analysis
Categorical

Cohen's kappa

Contingency table

Graphical model

Log-linear model

McNemar's test

Cochran–Mantel–Haenszel statistics

Multivariate

Regression

Manova

Principal components

Canonical correlation

Discriminant analysis

Cluster analysis

Classification

Structural equation model
Factor analysis

Multivariate distributions

Elliptical distributions
Normal

Time-series
General

Decomposition

Trend

Stationarity

Seasonal adjustment

Exponential smoothing

Cointegration

Structural break

Granger causality

Specific tests

Dickey–Fuller

Johansen

Q-statistic (Ljung–Box)

Durbin–Watson

Breusch–Godfrey

Time domain

Autocorrelation (ACF)
partial (PACF)

Cross-correlation (XCF)

ARMA model

ARIMA model (Box–Jenkins)

Autoregressive conditional heteroskedasticity (ARCH)

Vector autoregression (VAR)

Frequency domain

Spectral density estimation

Fourier analysis

Least-squares spectral analysis

Wavelet

Whittle likelihood

Survival
Survival function

Kaplan–Meier estimator (product limit)

Proportional hazards models

Accelerated failure time (AFT) model

First hitting time

Hazard function

Nelson–Aalen estimator

Test

Log-rank test

Applications
Biostatistics

Bioinformatics

Clinical trials / studies

Epidemiology

Medical statistics

Engineering statistics

Chemometrics

Methods engineering

Probabilistic design

Process / quality control

Reliability

System identification

Social statistics

Actuarial science

Census

Crime statistics

Demography

Econometrics

Jurimetrics

National accounts

Official statistics

Population statistics

Psychometrics

Spatial statistics

Cartography

Environmental statistics

Geographic information system

Geostatistics

Kriging

Category

Mathematics portal

Commons

WikiProject

Wikiversity has learning resources about Level of measurement

Retrieved from "https://en.wikipedia.org/w/index.php?title=Level_of_measurement&oldid=1217401021"

[Koch_2008-1] 
ISBN 978-1-4020-5613-0
.

[Stevens_1946-2] 
S2CID 4667599
.

[3] :10.1037/0033-2909.100.3.398
.

[Mosteller-4] 
ISBN 978-0201048544
.

[Chrisman-5] 
ISSN 1523-0406. – via Taylor & Francis
(subscription required)

[6] Nominal measures are based on sets and depend on categories, a la Aristotle: Chrisman, Nicholas (March 1995). "Beyond Stevens: A revised approach to measurement for geographic information". Retrieved 2014-08-25.

[7] ISBN 0-262-63-032-X

[8] PMID 21897729
.

[9] LCCN 68011394
. Although, formally speaking, interval measurement can always be obtained by specification, such specification is theoretically meaningful only if it is implied by the theory and model relevant to the measurement procedure.
William W. Rozeboom (January 1969). "Reviewed Work: Statistical Theories of Mental Test Scores". American Educational Research Journal. 6 (1): 112–116.
JSTOR 1162101
.

[10] William W. Rozeboom (January 1969). "Reviewed Work: Statistical Theories of Mental Test Scores". American Educational Research Journal. 6 (1): 112–116.
JSTOR 1162101
.

[10] ISBN 978-1-58488-814-7
. Although in practice IQ and most other human characteristics measured by psychological tests (such as anxiety, introversion, self esteem, etc.) are treated as interval scales, many researchers would argue that they are more appropriately categorized as ordinal scales. Such arguments would be based on the fact that such measures do not really meet the requirements of an interval scale, because it cannot be demonstrated that equal numerical differences at different points on the scale are comparable.

[11] ISBN 978-0-669-61382-7
. The I.Q. is essentially a rank; there are no true "units" of intellectual ability.

[12] ISBN 978-0-89079-585-9
. An IQ score is not an equal-interval score, as is evident in Table A.4 in the WISC-III manual.

[13] ISBN 978-0-521-54478-8
. When we come to quantities like IQ or g, as we are presently able to measure them, we shall see later that we have an even lower level of measurement—an ordinal level. This means that the numbers we assign to individuals can only be used to rank them—the number tells us where the individual comes in the rank order and nothing else.

[14] ISBN 978-1-56000-360-1
. Ideally, a scale of measurement should have a true zero-point and identical intervals. . . . Scales of hardness lack these advantages, and so does IQ. There is no absolute zero, and a 10-point difference may carry different meanings at different points of the scale.

[15] ISBN 978-0-19-852367-3
. In the jargon of psychological measurement theory, IQ is an ordinal scale, where we are simply rank-ordering people. . . . It is not even appropriate to claim that the 10-point difference between IQ scores of 110 and 100 is the same as the 10-point difference between IQs of 160 and 150

[Velleman_and_Wilkinson_1993-16] 
JSTOR 2684788
.

[Hand_2017-17] S2CID 148934577
.

[18] :10.15626/MP.2019.1916
.

[19] Nelder, J. A. (1990). The knowledge needed to computerise the analysis and interpretation of statistical information. In Expert systems and artificial intelligence: the need for information about data. Library Association Report, London, March, 23–27.

[20] van den Berg, G. (1991). Choosing an analysis method. Leiden: DSWO Press

[Wolman_2006-21] S2CID 21372776
.

[22] Percy Bridgman (1957) The Logic of Modern Physics

[23] S2CID 170941474
.

[24] Michell, J. (1999). Measurement in Psychology – A critical history of a methodological concept. Cambridge: Cambridge University Press.

[25] S2CID 46970420
.

[26] "What is the difference between categorical, ordinal and interval variables?". Institute for Digital Research and Education. University of California, Los Angeles. Archived from the original on 2016-01-25. Retrieved 7 February 2016.

[1]

[2]

[3]

[4]

[5]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[23]

[24]

[26]