Paleogenomics

Source: Wikipedia, the free encyclopedia.

Paleogenomics is a field of science based on the reconstruction and analysis of

genomic information in extinct species. Improved methods for the extraction of ancient DNA (aDNA) from museum artifacts, ice cores, archeological or paleontological sites, and next-generation sequencing technologies have spurred this field. It is now possible to detect genetic drift, ancient population migration and interrelationships, the evolutionary history of extinct plant, animal and Homo species, and identification of phenotypic features across geographic regions. Scientists can also use paleogenomics to compare ancient ancestors against modern-day humans.[1] The rising importance of paleogenomics is evident from the fact that the 2022 Nobel Prize in physiology or medicine was awarded to a Swedish geneticist Svante Pääbo
[1955-], who worked on paleogenomics.

Background

Initially, aDNA sequencing involved cloning small fragments into bacteria, which proceeded with low efficiency due to the oxidative damage the aDNA suffered over millennia.

NGS techniques prompted far more. Moreover, this technological revolution allowed the transition from paleogenetics to paleogenomics.[1]

Sequencing methods

Challenges and techniques

NGS second generation, and various library methods are available for sequencing aDNA, besides many bioinformatics tools. When dealing with each of these methods it is important to consider that aDNA can be altered post-mortem.[2]
Specific alterations arise from:

Specific patterns and onset of these alterations help scientists to estimate the sample's age.


Formerly, scientists diagnosed post-mortem damages using enzymatic reactions or

single strand breaks
in double helix of DNA and abasic site (created by C->T mutation).
A single fragment of aDNA can be sequenced in its full length with HTS. With these data we can create a distribution representing a size decay curve that enables a direct quantitative comparison of fragmentation across specimens through space and environmental conditions. Throughout the decay curve it is possible to obtain the median length of the given fragment of aDNA. This length reflects the fragmentation levels after death, which generally increases with depositional temperature.[4]

Libraries

Two different libraries can be performed for aDNA sequencing using PCR for genome amplification:

  • Double-stranded aDNA library (dsDNA library)
  • Single-stranded aDNA library (ssDNA library)

The first one is created using the blunt-end approach. This technique uses two different adaptors: these adaptors bind randomly the fragment and it can then be amplified. The fragment that does not contain both adaptors cannot be amplified causing an error source. To reduce this error,

Illumina
T/A ligation was introduced: this method consists in inserting the A tailing in DNA sample to facilitate the ligation of T tailed adaptors. In this methods we optimize the amplification of the aDNA.

To obtain ssDNA libraries, DNA is first denatured with heat. The obtained ssDNA is then ligated to two adaptors in order to generate the complementary strand and finally PCR is applied.[4]

aDNA Enrichment

As aDNA may contain bacterial DNA or other microorganisms, the process requires enrichment. In order to separate endogenous and exogenous fractions, various methods are employed:

  • Damaged template enrichment: Used when constructing an ssDNA library because this method targets DNA damage. When Bst polymerase fills the nick, the sample is treated with uracil DNA glycosylase and endonuclease VIII. These compounds attack the abasic site. The undamaged DNA remains attached to streptavidin-coated paramagnetic beads and can be separated from the sample. This method is specific for samples from late Pleistocene Neanderthals.[5]
  • Extension-free target enrichment in solution: this method is based on target-probe hybridization. This method requires DNA denaturation and then inserts overlapping tiled probes along target regions. Then, PCR for DNA amplification is used and finally DNA is linked to a biotinylated adaptor. It's useful for samples from Archaic hominin ancestry.
  • Solid-phase target enrichment: in this method
    real-time PCR method are used in parallel with shotgun sequencing
    screening.
  • Whole-genome enrichment: used for sequencing the entire genome of single individuals. Whole-genome In-Solution Capture (WISC) is used.[6] This method starts with the preparation of a genome-wide RNA probe library from a species with a genome that is closely related to the target genome in the DNA sample.[4]

Diversification of present-day non-African populations and anatomically modern humans

By now many studies in different fields have led to the conclusion that present-day non-African population is the result of the diversification in several different

modern humans and archaic humans, such as Neanderthal and Denisovan populations, testifying the “leaky replacement” model of Eurasian human population history. According to all these data, the human divergence of the non-African lineages occurred around 45,000 – 55,000 BP.[7] Besides that, in many cases ancient DNA has allowed to track historical processes which have led, in time, to the actual population genetic structure, which would have been difficult to do counting only on the analysis of present-day genomes. Among these still unresolved questions, some of the most studied are the identity of the first inhabitants of the Americas, the peopling of Europe and the origin of agriculture in Europe.[1]

Phenotypic variation in humans

Analysis of

biological adaptation
.

Skin colour

Migration of humans out of

selective pressure
on skin colour trait, favouring lighter skin colour at higher latitudes. The two most important genes involved in skin pigmentation are SLC24A5 and SLC45A2. Nowadays the “light skin” alleles of these genes are fixed in
Europe but they reached a relatively high frequency only fairly recently (about 5000 years ago).[7] Such slow depigmentation process suggests that ancient Europeans could have faced the downsides of low vitamin D production, such as musculoskeletal and cardiovascular conditions. Another hypothesis is that pre-agricultural Europeans could have met their vitamin D requirements through their diet (since meat and fish contain some vitamin D)[8]

Adaptation to agricultural diet

One of the major examples of adaptation following the switch to agricultural diet is the persistence of production of the lactase enzyme in adulthood. This enzyme is essential to digest lactose present in milk and dietary products and its absence leads to diarrhea following the consumption of these products. Lactase persistence is determined predominantly by a single-base mutation in the MCM6 gene and ancient DNA data show that this mutation became common only within the past 5000 years, thousands of years after the beginning of dairying practices.[7] Thus, even in the case of lactase-persistence there is a huge time delay between the onset of a new habit and the spread of the adaptive allele and so milk consumption may have been restricted to children or to lactose-reduced products.

Another example of mutation positively selected by the switch to agriculture is the number of AMY1 gene copies. AMY1 encodes for the starch-digesting enzyme

chimpanzees.[8]

The immune system

The human

selective pressure
on different immune-associated genes. Migrations, for example, exposed humans to new habitats carrying new pathogens or pathogen vectors (e.g. mosquitos). Also the switch to agriculture involved exposition to different pathogens and health conditions, both due to the increased population density and to living close to livestock. However, it is difficult to directly correlate particular ancient genome changes to improved resistance to particular pathogens, giving the vastness and complexity of the human immune system. Besides studying directly changes in the human immune system, it is also possible to study the ancient genomes of pathogens, such as those causing
This suggests that in the ancient past plague may had been less virulent compared to more recent Y. pestis outbreaks.

A study of

inflammatory disease risk in post-Neolithic Europeans over the last 10,000 years, estimating nature, strength, and time of onset of selections due to pathogens.[10]

Plants and animals

Many non-hominin

seeds, pollen and wood. A correlation has been identified between ancient and extant barley. Another application was the detection of domestication and adaptation process of maize which include genes for drought tolerance and sugar content.[1]

Challenges and future perspectives

The analysis of ancient genomes of anatomically modern humans has, in recent years, completely revolutionized our way of studying population migrations, transformation and evolution. Nevertheless, much still remains unknown. The first and obvious problem related to this kind of approach, which is going to be partially overcome by the continuous improvement of the ancient DNA extraction techniques, is the difficulty of recovering well preserved ancient genomes, a challenge that is particularly observed in Africa and in Asia, where the temperatures are higher than in other colder regions of the world. Further, Africa is, among all the continents, the one that harbors the most

Bioethics

populations
. In addition, paleogenomic studies have the potential to harm community or individual histories and identities, as well as to reveal denouncing information about their descendants. For these reasons, these kind of studies are still a touchy subject. Paleogenomics studies can have negative consequences mainly because of the discrepancies between articulations of ethical principles and practices. In fact, ancestors’ remains are usually considered legally and scientifically as “artifacts”, rather than “human subjects”, which justifies questionable behaviors and lack of engagement from
communities
. Testing of ancestral remains are therefore used in disputes, claims in treaty, repatriation, or other legal cases. The acknowledgement of the importance and susceptibility of this subject is heading towards ethical commitment and guidance applicable to different contexts, in order to preserve ancestral remains’ dignity and avoid ethical issues.
CRISPR/Cas9 technology, is, however, strongly connected to many ethical issues.[1]

See also

References

  1. ^ a b c d e f Lan T. and Lindqvist C. 2018. Paleogenomics: Genome-Scale Analysis of Ancient DNA and Population and Evolutionary Genomic Inferences. In: Population Genomics, Springer, Cham. pp 1-38.
  2. ^
    PMID 2928314
    .
  3. .
  4. ^ a b c Orlando L., Gilbert MT., Willerslev E. 2015. Reconstructing ancient genomes and epigenomes. Nat. Rev. Genet. 16(7):395-408.
  5. PMID 25081630
    .
  6. .
  7. ^ a b c d e f Skoglund P. and Mathieson I. 2018. Ancient genomics of modern humans: the first decade. Annu. Rev. Genom. Hum. Genet. 19:1, 381-404.
  8. ^ a b c Marciniak S., Perry G. H. Harnessing ancient genomes to study the history of human adaptation. Nature Reviews Genetics volume 18, pages 659–674 (2017)
  9. S2CID 15705508
    .
  10. .
  11. ^ Advancing the ethics of paleogenomics: Ancestral remains should not be regarded as "artifacts" but as human relatives who eserve respect - Jessica Bardill, Alyssa C. Bader, Nanibaa' A. Garrison, Deborah A. Bolnick, Jennifer A. Raff, Alexa Walker, Ripan S. Malhi, and the Summer Internship for INdigenous peoples in Genomics (SING) Consortium