Molecular clock

The molecular clock is a figurative term for a technique that uses the

sequences for DNA, RNA, or amino acid sequences for proteins

Early discovery and genetic equidistance

The notion of the existence of a so-called "molecular clock" was first attributed to Émile Zuckerkandl and Linus Pauling who, in 1962, noticed that the number of amino acid differences in hemoglobin between different lineages changes roughly linearly with time, as estimated from fossil evidence.^[1] They generalized this observation to assert that the rate of evolutionary change of any specified protein was approximately constant over time and over different lineages (known as the molecular clock hypothesis).

The genetic equidistance phenomenon was first noted in 1963 by Emanuel Margoliash, who wrote: "It appears that the number of residue differences between cytochrome c of any two species is mostly conditioned by the time elapsed since the lines of evolution leading to these two species originally diverged. If this is correct, the cytochrome c of all mammals should be equally different from the cytochrome c of all birds. Since fish diverges from the main stem of vertebrate evolution earlier than either birds or mammals, the cytochrome c of both mammals and birds should be equally different from the cytochrome c of fish. Similarly, all vertebrate cytochrome c should be equally different from the yeast protein."^[2] For example, the difference between the cytochrome c of a carp and a frog, turtle, chicken, rabbit, and horse is a very constant 13% to 14%. Similarly, the difference between the cytochrome c of a bacterium and yeast, wheat, moth, tuna, pigeon, and horse ranges from 64% to 69%. Together with the work of Emile Zuckerkandl and Linus Pauling, the genetic equidistance result led directly to the formal postulation of the molecular clock hypothesis in the early 1960s.^[3]

Similarly,

Pan troglodytes) albumin immunological cross-reactions suggested they were about equally different from Ceboidea (New World Monkey) species (within experimental error). This meant that they had both accumulated approximately equal changes in albumin since their shared common ancestor. This pattern was also found for all the primate comparisons they tested. When calibrated with the few well-documented fossil branch points (such as no Primate fossils of modern aspect found before the K-T boundary), this led Sarich and Wilson to argue that the human-chimp divergence probably occurred only ~4–6 million years ago.^[5]

Relationship with neutral theory

The observation of a clock-like rate of molecular change was originally purely phenomenological. Later, the work of Motoo Kimura^[6] developed the neutral theory of molecular evolution, which predicted a molecular clock. Let there be N individuals, and to keep this calculation simple, let the individuals be haploid (i.e. have one copy of each gene). Let the rate of neutral mutations (i.e. mutations with no effect on fitness) in a new individual be $\mu$ . The probability that this new mutation will become fixed in the population is then 1/N, since each copy of the gene is as good as any other. Every generation, each individual can have new mutations, so there are $\mu$ N new neutral mutations in the population as a whole. That means that each generation, $\mu$ new neutral mutations will become fixed. If most changes seen during molecular evolution are neutral, then fixations in a population will accumulate at a clock-rate that is equal to the rate of neutral mutations in an individual.

Nevertheless, there is some controversy regarding this view. Recently,

eukarya evolution^[8]

.

Calibration

To use molecular clocks to estimate divergence times, molecular clocks need to be "calibrated". This is because molecular data alone does not contain any information on absolute times. For viral phylogenetics and ancient DNA studies—two areas of evolutionary biology where it is possible to sample sequences over an evolutionary timescale—the dates of the intermediate samples can be used to calibrate the molecular clock. However, most phylogenies require that the molecular clock be calibrated using independent evidence about dates, such as the fossil record.^[9] There are two general methods for calibrating the molecular clock using fossils: node calibration and tip calibration.^[10]

Node calibration

Sometimes referred to as node dating, node calibration is a method for time-scaling

probability density can be used to represent the uncertainty about the age of the clade. These calibration densities can take the shape of standard probability densities (e.g. normal, lognormal, exponential, gamma) that can be used to express the uncertainty associated with divergence time estimates. ^[12] Determining the shape and parameters of the probability distribution is not trivial, but there are methods that use not only the oldest fossil but a larger sample of the fossil record of clades to estimate calibration densities empirically.^[15] Studies have shown that increasing the number of fossil constraints increases the accuracy of divergence time estimation.^[16]

Tip calibration

Sometimes referred to as tip dating, tip calibration is a method of molecular clock calibration in which fossils are treated as taxa and placed on the tips of the tree. This is achieved by creating a matrix that includes a molecular dataset for the extant taxa along with a morphological dataset for both the extinct and the extant taxa.^[14] Unlike node calibration, this method reconstructs the tree topology and places the fossils simultaneously. Molecular and morphological models work together simultaneously, allowing morphology to inform the placement of fossils.^[10] Tip calibration makes use of all relevant fossil taxa during clock calibration, rather than relying on only the oldest fossil of each clade. This method does not rely on the interpretation of negative evidence to infer maximum clade ages.^[14]

Expansion calibration

Demographic changes in populations can be detected as fluctuations in historical coalescent effective population size from a sample of extant genetic variation in the population using coalescent theory.^[17]^[18]^[19] Ancient population expansions that are well documented and dated in the geological record can be used to calibrate a rate of molecular evolution in a manner similar to node calibration. However, instead of calibrating from the known age of a node, expansion calibration uses a two-epoch model of constant population size followed by population growth, with the time of transition between epochs being the parameter of interest for calibration.^[20]^[21] Expansion calibration works at shorter, intraspecific timescales in comparison to node calibration, because expansions can only be detected after the most recent common ancestor of the species in question. Expansion dating has been used to show that molecular clock rates can be inflated at short timescales^[20] (< 1 MY) due to incomplete fixation of alleles, as discussed below^[22]^[23]

Total evidence dating

This approach to tip calibration goes a step further by simultaneously estimating fossil placement, topology, and the evolutionary timescale. In this method, the age of a fossil can inform its phylogenetic position in addition to morphology. By allowing all aspects of tree reconstruction to occur simultaneously, the risk of biased results is decreased.^[10] This approach has been improved upon by pairing it with different models. One current method of molecular clock calibration is total evidence dating paired with the fossilized birth-death (FBD) model and a model of morphological evolution.^[24] The FBD model is novel in that it allows for "sampled ancestors", which are fossil taxa that are the direct ancestor of a living taxon or lineage. This allows fossils to be placed on a branch above an extant organism, rather than being confined to the tips.^[25]

Methods

Bayesian methods can provide more appropriate estimates of divergence times, especially if large datasets—such as those yielded by phylogenomics—are employed.^[26]

Non-constant rate of molecular clock

Sometimes only a single divergence date can be estimated from fossils, with all other dates inferred from that. Other sets of species have abundant fossils available, allowing the hypothesis of constant divergence rates to be tested. DNA sequences experiencing low levels of

Myr in bacteria, mammals, invertebrates, and plants.^[27]

In the same study, genomic regions experiencing very high negative or purifying selection (encoding rRNA) were considerably slower (1% per 50 Myr).

In addition to such variation in rate with genomic position, since the early 1990s variation among taxa has proven fertile ground for research too,[28] even over comparatively short periods of evolutionary time (for example mockingbirds^[29]). Tube-nosed seabirds have molecular clocks that on average run at half speed of many other birds,^[30] possibly due to long generation times, and many turtles have a molecular clock running at one-eighth the speed it does in small mammals, or even slower.^[31] Effects of small population size are also likely to confound molecular clock analyses. Researchers such as Francisco J. Ayala have more fundamentally challenged the molecular clock hypothesis.^[32]^[33]^[34] According to Ayala's 1999 study, five factors combine to limit the application of molecular clock models:

Changing generation times (If the rate of new mutations depends at least partly on the number of generations rather than the number of years)
Population size (Genetic drift is stronger in small populations, and so more mutations are effectively neutral)
Species-specific differences (due to differing metabolism, ecology, evolutionary history, ...)
Change in function of the protein studied (can be avoided in closely related species by utilizing non-coding DNA sequences or emphasizing silent mutations)
Changes in the intensity of natural selection.

Molecular clock users have developed workaround solutions using a number of statistical approaches including

maximum likelihood techniques and later Bayesian modeling. In particular, models that take into account rate variation across lineages have been proposed in order to obtain better estimates of divergence times. These models are called relaxed molecular clocks^[35] because they represent an intermediate position between the 'strict' molecular clock hypothesis and Joseph Felsenstein's many-rates model^[36] and are made possible through MCMC techniques that explore a weighted range of tree topologies and simultaneously estimate parameters of the chosen substitution model. It must be remembered that divergence dates inferred using a molecular clock are based on statistical inference and not on direct evidence

.

The molecular clock runs into particular challenges at very short and very long timescales. At long timescales, the problem is

saturation. When enough time has passed, many sites have undergone more than one change, but it is impossible to detect more than one. This means that the observed number of changes is no longer linear with time, but instead flattens out. Even at intermediate genetic distances, with phylogenetic data still sufficient to estimate topology, signal for the overall scale of the tree can be weak under complex likelihood models, leading to highly uncertain molecular clock estimates.^[37]

At very short time scales, many differences between samples do not represent

alleles that were both present as part of a polymorphism in the common ancestor. The inclusion of differences that have not yet become fixed

leads to a potentially dramatic inflation of the apparent rate of the molecular clock at very short timescales.[23]^[38]

Uses

The molecular clock technique is an important tool in

fossils, such as the divergences between living taxa

has allowed the study of macroevolutionary processes in organisms that had limited fossil records. Phylogenetic comparative methods rely heavily on calibrated phylogenies.

References

^ Zuckerkandl E, Pauling (1962). "Molecular disease, evolution, and genic heterogeneity". In Kasha, M., Pullman, B (eds.). Horizons in Biochemistry. Academic Press, New York. pp. 189–225.
PMID 14077496
.

S2CID 14261833
.

PMID 4962458
.

S2CID 7349579
.

S2CID 4161261
.

ISSN 1471-0064
.

PMID 2068073
.

PMID 17047029
.

^
PMID 27325838
.

S2CID 17647010
.

^
PMID 22367748
.

PMID 12538260
.

^
PMID 26439502
.

S2CID 252353611
.

S2CID 3895351
.

PMID 1316531
.

S2CID 27134675
.

PMID 21753753
.

^
PMID 21926069
.

PMID 26683588
.

PMID 26333662
.

^
PMID 15814826
.

PMID 25009181
.

PMID 28173531
.

PMID 22628470
.

S2CID 8260277
.

S2CID 23887665
.

S2CID 51797284
.

S2CID 20390465
.

PMID 1584014
.

PMID 10070256. Archived from the original
on 16 December 2012.

S2CID 28166727
.
"No Missing Link? Evolutionary Changes Occur Suddenly, Professor Says". ScienceDaily (Press release). 12 February 2007.

PMID 31111152
.

PMID 16683862
.

S2CID 9791493
.

^ Marshall, D. C., et al. 2016. Inflation of molecular clock rates and dates: molecular phylogenetics, biogeography, and diversification of a global cicada radiation from Australasia (Hemiptera: Cicadidae: Cicadettini). Systematic Biology 65(1):16–34.

PMID 19661199
.

Further reading

Ho, S.Y.W., ed. (2020). The Molecular Evolutionary Clock: Theory and Practice. Springer, Cham.
S2CID 231672167
.

Kumar S (August 2005). "Molecular clocks: four decades of evolution". Nature Reviews. Genetics. 6 (8): 654–662.
S2CID 14261833
.

Morgan GJ (1998). "Emile Zuckerkandl, Linus Pauling, and the molecular evolutionary clock, 1959-1965". Journal of the History of Biology. 31 (2): 155–178.
S2CID 5660841
.

Zuckerkandl E, Pauling LB (1965). "Evolutionary divergence and convergence in proteins". In Bryson V, Vogel HJ (eds.). Evolving Genes and Proteins. Academic Press, New York. pp. 97–166.

External links

Allan Wilson and the molecular clock

Molecular clock explanation of the molecular equidistance phenomenon

Date-a-Clade service for the molecular tree of life

v
t
e
Chronology
Key topics

Archaeology

Astronomy

Geology

History
Big History

Paleontology

Time

Epochs
Calendar eras

Human Era

Ab urbe condita

Anno Domini / Common Era

Anno Mundi

Bosporan era

Bostran era

Byzantine era

Seleucid era

Era of Caesar (Iberia)

Before present

Hijri

Egyptian

Sothic cycle

Hindu units of time (Yuga)

Mesoamerican
Long Count

Short Count

Tzolk'in

Haab'

Regnal year

Anka year

Canon of Kings

English and British regnal year

Lists of kings

Limmu

Era names

Chinese

Japanese

Korean

Vietnamese

Calendars
Pre-Julian / Julian

Pre-Julian Roman

Original Julian

Proleptic Julian

Revised Julian

Gregorian

Gregorian

Proleptic Gregorian

Old Style and New Style dates

Adoption of the Gregorian calendar

Dual dating

Astronomical

Lunisolar (Hebrew, Hindu)

Solar

Lunar (Islamic)

Astronomical year numbering

Others

Chinese sexagenary cycle

Geologic Calendar

Iranian

ISO week date

Mesoamerican
Maya

Aztec

Winter count

New Earth Time

Astronomic time

Cosmic Calendar

Ephemeris

Galactic year

Metonic cycle

Milankovitch cycles

Geologic time
Concepts

Deep time

Geological history of Earth

Geological time units

Standards

Global Standard Stratigraphic Age (GSSA)

Global Boundary Stratotype Section and Point (GSSP)

Methods

Chronostratigraphy

Geochronology

Isotope geochemistry

Law of superposition

Luminescence dating

Samarium–neodymium dating

Chronological
dating
Absolute dating

Amino acid racemisation

Archaeomagnetic dating

Dendrochronology

Ice core

Incremental dating

Lichenometry

Paleomagnetism

Radiometric dating
Lead–lead

Potassium–argon

Radiocarbon

Uranium–lead

Tephrochronology

Luminescence dating

Thermoluminescence dating

Relative dating

Fluorine absorption

Nitrogen dating

Obsidian hydration

Seriation

Stratigraphy

Genetic methods

Molecular clock

Linguistic methods

Glottochronology

Related topics

Chronicle

New Chronology

Synchronoptic view

Timeline

Year zero

Floruit

Terminus post quem

ASPRO chronology

Retrieved from "https://en.wikipedia.org/w/index.php?title=Molecular_clock&oldid=1220572364"

[Zuckerkand62-1] Zuckerkandl E, Pauling (1962). "Molecular disease, evolution, and genic heterogeneity". In Kasha, M., Pullman, B (eds.). Horizons in Biochemistry. Academic Press, New York. pp. 189–225.

[2] PMID 14077496
.

[3] S2CID 14261833
.

[4] PMID 4962458
.

[5] S2CID 7349579
.

[Kimura68-6] S2CID 4161261
.

[7] ISSN 1471-0064
.

[8] PMID 2068073
.

[Benton01-9] PMID 17047029
.

[Donoghue02-10] 
PMID 27325838
.

[11] S2CID 17647010
.

[ReferenceA-12] 
PMID 22367748
.

[13] PMID 12538260
.

[O'Reilly03-14] 
PMID 26439502
.

[Claramunt2022-15] S2CID 252353611
.

[Zheng04-16] S2CID 3895351
.

[17] PMID 1316531
.

[18] S2CID 27134675
.

[19] PMID 21753753
.

[:0-20] 
PMID 21926069
.

[21] PMID 26683588
.

[22] PMID 26333662
.

[:1-23] 
PMID 15814826
.

[Heath05-24] PMID 25009181
.

[Gavryushkina06-25] PMID 28173531
.

[Dos_Reis2012-26] PMID 22628470
.

[Ochman87-27] S2CID 8260277
.

[Douzery03-28] S2CID 23887665
.

[Hunt01-29] S2CID 51797284
.

[Rheindt05-30] S2CID 20390465
.

[Avise92-31] PMID 1584014
.

[Ayala99-32] PMID 10070256. Archived from the original
on 16 December 2012.

[Schwartz06-33] S2CID 28166727
.
"No Missing Link? Evolutionary Changes Occur Suddenly, Professor Says". ScienceDaily (Press release). 12 February 2007.

[34] "No Missing Link? Evolutionary Changes Occur Suddenly, Professor Says". ScienceDaily (Press release). 12 February 2007.

[Pascual2019-34] PMID 31111152
.

[Drummond06-35] PMID 16683862
.

[Felsenstein01-36] S2CID 9791493
.

[37] Marshall, D. C., et al. 2016. Inflation of molecular clock rates and dates: molecular phylogenetics, biogeography, and diversification of a global cicada radiation from Australasia (Hemiptera: Cicadidae: Cicadettini). Systematic Biology 65(1):16–34.

[38] PMID 19661199
.

[1]

[2]

[3]

[5]

[6]

[8]

[9]

[10]

[12]

[15]

[16]

[14]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]