DNA methylation
DNA methylation is a biological process by which
As of 2016, two nucleobases have been found on which natural, enzymatic DNA methylation takes place:
Unmodified base | ||||||||
Adenine, A | Cytosine, C | |||||||
Modified forms | ||||||||
N6-Methyladenine, 6mA | 5-Methylcytosine, 5mC | N4-Methylcytosine, 4mC | ||||||
Cytosine methylation is widespread in both
but has received considerably less attention.Methylation of cytosine to form
In plants and other organisms, DNA methylation is found in three different sequence contexts: CG (or
Conserved function of DNA methylation
The DNA methylation landscape of vertebrates is very particular compared to other organisms. In mammals, around 75% of CpG dinucleotides are methylated in somatic cells,[19] and DNA methylation appears as a default state that has to be specifically excluded from defined locations.[16][20] By contrast, the genome of most plants, invertebrates, fungi, or protists show "mosaic" methylation patterns, where only specific genomic elements are targeted, and they are characterized by the alternation of methylated and unmethylated domains.[21][22]
High CpG methylation in mammalian genomes has an evolutionary cost because it increases the frequency of spontaneous mutations. Loss of amino-groups occurs with a high frequency for cytosines, with different consequences depending on their methylation. Methylated C residues spontaneously deaminate to form T residues over time; hence CpG dinucleotides steadily deaminate to TpG dinucleotides, which is evidenced by the under-representation of CpG dinucleotides in the human genome (they occur at only 21% of the expected frequency).[23] (On the other hand, spontaneous deamination of unmethylated C residues gives rise to U residues, a change that is quickly recognized and repaired by the cell.)
CpG islands
In mammals, the only exception for this global CpG depletion resides in a specific category of GC- and CpG-rich sequences termed CpG islands that are generally unmethylated and therefore retained the expected CpG content.
Repression of CpG-dense promoters
DNA methylation was probably present at some extent in very early eukaryote ancestors. In virtually every organism analyzed, methylation in promoter regions correlates negatively with gene expression.[21][28] CpG-dense promoters of actively transcribed genes are never methylated, but, reciprocally, transcriptionally silent genes do not necessarily carry a methylated promoter. In mouse and human, around 60–70% of genes have a CpG island in their promoter region and most of these CpG islands remain unmethylated independently of the transcriptional activity of the gene, in both differentiated and undifferentiated cell types.[29][30] Of note, whereas DNA methylation of CpG islands is unambiguously linked with transcriptional repression, the function of DNA methylation in CG-poor promoters remains unclear; albeit there is little evidence that it could be functionally relevant.[31]
DNA methylation may affect the transcription of genes in two ways. First, the methylation of DNA itself may physically impede the binding of
Repression of transposable elements
DNA methylation is a powerful transcriptional repressor, at least in CpG dense contexts. Transcriptional repression of protein-coding genes appears essentially limited to very specific classes of genes that need to be silent permanently and in almost all tissues. While DNA methylation does not have the flexibility required for the fine-tuning of gene regulation, its stability is perfect to ensure the permanent silencing of transposable elements.[33] Transposon control is one of the most ancient functions of DNA methylation that is shared by animals, plants and multiple protists.[34] It is even suggested that DNA methylation evolved precisely for this purpose.[35]
Genome expansion
DNA methylation of transposable elements has been known to be related to genome expansion. However, the evolutionary driver for genome expansion remains unknown. There is a clear correlation between the size of the genome and CpG, suggesting that the DNA methylation of transposable elements led to a noticeable increase in the mass of DNA.[36]
Methylation of the gene body of highly transcribed genes
A function that appears even more conserved than transposon silencing is positively correlated with gene expression. In almost all species where DNA methylation is present, DNA methylation is especially enriched in the body of highly transcribed genes.
In mammals
During embryonic development
DNA methylation patterns are largely erased and then re-established between generations in mammals. Almost all of the methylations from the parents are erased, first during
Whereas DNA methylation is not necessary per se for transcriptional silencing, it is thought nonetheless to represent a "locked" state that definitely inactivates transcription. In particular, DNA methylation appears critical for the maintenance of mono-allelic silencing in the context of
Due to the phenomenon of
In cancer
In many disease processes, such as
Global hypomethylation has also been implicated in the development and progression of cancer through different mechanisms.[48] Typically, there is hypermethylation of tumor suppressor genes and hypomethylation of oncogenes.[49]
Generally, in progression to cancer, hundreds of genes are silenced or activated. Although silencing of some genes in cancers occurs by mutation, a large proportion of carcinogenic gene silencing is a result of altered DNA methylation (see DNA methylation in cancer). DNA methylation causing silencing in cancer typically occurs at multiple CpG sites in the CpG islands that are present in the promoters of protein coding genes.[citation needed]
Altered expressions of microRNAs also silence or activate many genes in progression to cancer (see microRNAs in cancer). Altered microRNA expression occurs through hyper/hypo-methylation of CpG sites in CpG islands in promoters controlling transcription of the microRNAs.
Silencing of DNA repair genes through methylation of CpG islands in their promoters appears to be especially important in progression to cancer (see methylation of DNA repair genes in cancer).
In atherosclerosis
Epigenetic modifications such as DNA methylation have been implicated in cardiovascular disease, including atherosclerosis. In animal models of atherosclerosis, vascular tissue, as well as blood cells such as mononuclear blood cells, exhibit global hypomethylation with gene-specific areas of hypermethylation. DNA methylation polymorphisms may be used as an early biomarker of atherosclerosis since they are present before lesions are observed, which may provide an early tool for detection and risk prevention.[50]
Two of the cell types targeted for DNA methylation polymorphisms are monocytes and lymphocytes, which experience an overall hypomethylation. One proposed mechanism behind this global hypomethylation is elevated homocysteine levels causing hyperhomocysteinemia, a known risk factor for cardiovascular disease. High plasma levels of homocysteine inhibit DNA methyltransferases, which causes hypomethylation. Hypomethylation of DNA affects genes that alter smooth muscle cell proliferation, cause endothelial cell dysfunction, and increase inflammatory mediators, all of which are critical in forming atherosclerotic lesions.[51] High levels of homocysteine also result in hypermethylation of CpG islands in the promoter region of the estrogen receptor alpha (ERα) gene, causing its down regulation.[52] ERα protects against atherosclerosis due to its action as a growth suppressor, causing the smooth muscle cells to remain in a quiescent state.[53] Hypermethylation of the ERα promoter thus allows intimal smooth muscle cells to proliferate excessively and contribute to the development of the atherosclerotic lesion.[54]
Another gene that experiences a change in methylation status in atherosclerosis is the monocarboxylate transporter (MCT3), which produces a protein responsible for the transport of lactate and other ketone bodies out of many cell types, including vascular smooth muscle cells. In atherosclerosis patients, there is an increase in methylation of the CpG islands in exon 2, which decreases MCT3 protein expression. The downregulation of MCT3 impairs lactate transport and significantly increases smooth muscle cell proliferation, which further contributes to the atherosclerotic lesion. An ex vivo experiment using the demethylating agent Decitabine (5-aza-2 -deoxycytidine) was shown to induce MCT3 expression in a dose dependent manner, as all hypermethylated sites in the exon 2 CpG island became demethylated after treatment. This may serve as a novel therapeutic agent to treat atherosclerosis, although no human studies have been conducted thus far.[55]
In heart failure
In addition to atherosclerosis described above, specific epigenetic changes have been identified in the failing human heart. This may vary by disease etiology. For example, in ischemic heart failure DNA methylation changes have been linked to changes in gene expression that may direct gene expression associated with the changes in heart metabolism known to occur.[56] Additional forms of heart failure (e.g. diabetic cardiomyopathy) and co-morbidities (e.g. obesity) must be explored to see how common these mechanisms are. Most strikingly, in failing human heart these changes in DNA methylation are associated with racial and socioeconomic status which further impact how gene expression is altered,[57] and may influence how the individual's heart failure should be treated.
In aging
In humans and other mammals, DNA methylation levels can be used to accurately estimate the age of tissues and cell types, forming an accurate epigenetic clock.[58]
A longitudinal study of twin children showed that, between the ages of 5 and 10, there was divergence of methylation patterns due to environmental rather than genetic influences.[59] There is a global loss of DNA methylation during aging.[49]
In a study that analyzed the complete DNA methylomes of CD4+ T cells in a newborn, a 26 years old individual and a 103 years old individual were observed that the loss of methylation is proportional to age.[60] Hypomethylated CpGs observed in the centenarian DNAs compared with the neonates covered all genomic compartments (promoters, intergenic, intronic and exonic regions).[60] However, some genes become hypermethylated with age, including genes for the estrogen receptor, p16, and insulin-like growth factor 2.[49]
In exercise
High intensity exercise has been shown to result in reduced DNA methylation in skeletal muscle.[61] Promoter methylation of PGC-1α and PDK4 were immediately reduced after high intensity exercise, whereas PPAR-γ methylation was not reduced until three hours after exercise.[61] At the same time, six months of exercise in previously sedentary middle-age men resulted in increased methylation in adipose tissue.[62] One study showed a possible increase in global genomic DNA methylation of white blood cells with more physical activity in non-Hispanics.[63]
In B-cell differentiation
A study that investigated the methylome of B cells along their differentiation cycle, using whole-genome bisulfite sequencing (WGBS), showed that there is a hypomethylation from the earliest stages to the most differentiated stages. The largest methylation difference is between the stages of germinal center B cells and memory B cells. Furthermore, this study showed that there is a similarity between B cell tumors and long-lived B cells in their DNA methylation signatures.[18]
In the brain
Two reviews summarize evidence that DNA methylation alterations in brain neurons are important in learning and memory.[64][65] Contextual fear conditioning (a form of associative learning) in animals, such as mice and rats, is rapid and is extremely robust in creating memories.[66] In mice[67] and in rats[68] contextual fear conditioning, within 1–24 hours, it is associated with altered methylations of several thousand DNA cytosines in genes of hippocampus neurons. Twenty four hours after contextual fear conditioning, 9.2% of the genes in rat hippocampus neurons are differentially methylated.[68] In mice,[67] when examined at four weeks after conditioning, the hippocampus methylations and demethylations had been reset to the original naive conditions. The hippocampus is needed to form memories, but memories are not stored there. For such mice, at four weeks after contextual fear conditioning, substantial differential CpG methylations and demethylations occurred in cortical neurons during memory maintenance, and there were 1,223 differentially methylated genes in their anterior cingulate cortex.[67] Mechanisms guiding new DNA methylations and new DNA demethylations in the hippocampus during memory establishment were summarized in 2022.[69] That review also indicated the mechanisms by which the new patterns of methylation gave rise to new patterns of messenger RNA expression. These new messenger RNAs were then transported by messenger RNP particles (neuronal granules) to synapses of the neurons, where they could be translated into proteins.[69] Active changes in neuronal DNA methylation and demethylation appear to act as controllers of synaptic scaling and glutamate receptor trafficking in learning and memory formation.[64]
DNA methyltransferases (in mammals)
In mammalian cells, DNA methylation occurs mainly at the C5 position of CpG dinucleotides and is carried out by two general classes of enzymatic activities – maintenance methylation and de novo methylation.[70]
Maintenance methylation activity is necessary to preserve DNA methylation after every cellular DNA replication cycle. Without the DNA methyltransferase (DNMT), the replication machinery itself would produce daughter strands that are unmethylated and, over time, would lead to passive demethylation. DNMT1 is the proposed maintenance methyltransferase that is responsible for copying DNA methylation patterns to the daughter strands during DNA replication. Mouse models with both copies of DNMT1 deleted are embryonic lethal at approximately day 9, due to the requirement of DNMT1 activity for development in mammalian cells.[citation needed]
It is thought that DNMT3a and DNMT3b are the de novo methyltransferases that set up DNA methylation patterns early in development. DNMT3L is a protein that is homologous to the other DNMT3s but has no catalytic activity. Instead, DNMT3L assists the de novo methyltransferases by increasing their ability to bind to DNA and stimulating their activity. Mice and rats have a third functional de novo methyltransferase enzyme named DNMT3C, which evolved as a paralog of Dnmt3b by tandem duplication in the common ancestral of Muroidea rodents. DNMT3C catalyzes the methylation of promoters of transposable elements during early spermatogenesis, an activity shown to be essential for their epigenetic repression and male fertility.[71][72] It is yet unclear if in other mammals that do not have DNMT3C (like humans) rely on DNMT3B or DNMT3A for de novo methylation of transposable elements in the germline. Finally, DNMT2 (TRDMT1) has been identified as a DNA methyltransferase homolog, containing all 10 sequence motifs common to all DNA methyltransferases; however, DNMT2 (TRDMT1) does not methylate DNA but instead methylates cytosine-38 in the anticodon loop of aspartic acid transfer RNA.[73]
Since many tumor suppressor genes are silenced by DNA methylation during
In plants
Significant progress has been made in understanding DNA methylation in the model plant Arabidopsis thaliana. DNA methylation in plants differs from that of mammals: while DNA methylation in mammals mainly occurs on the cytosine nucleotide in a CpG site, in plants the cytosine can be methylated at CpG, CpHpG, and CpHpH sites, where H represents any nucleotide but not guanine.[74] Overall, Arabidopsis DNA is highly methylated, mass spectrometry analysis estimated 14% of cytosines to be modified.[9]: abstract Later, bisulfite sequencing data estimated that around 25% of Arabidopsis CG sites are methylated, but these levels vary based on the geographic location of Arabidopsis accessions (plants in the north are more highly methylated than southern accessions).[75]
The principal Arabidopsis DNA methyltransferase enzymes, which transfer and covalently attach methyl groups onto DNA, are DRM2, MET1, and CMT3. Both the DRM2 and MET1 proteins share significant homology to the mammalian methyltransferases DNMT3 and DNMT1, respectively, whereas the CMT3 protein is unique to the plant kingdom. There are currently two classes of DNA methyltransferases: 1) the de novo class or enzymes that create new methylation marks on the DNA; 2) a maintenance class that recognizes the methylation marks on the parental strand of DNA and transfers new methylation to the daughter strands after DNA replication. DRM2 is the only enzyme that has been implicated as a de novo DNA methyltransferase. DRM2 has also been shown, along with MET1 and CMT3 to be involved in maintaining methylation marks through DNA replication.[76] Other DNA methyltransferases are expressed in plants but have no known function (see the Chromatin Database).
Genome-wide levels of DNA methylation vary widely between plant species, and Arabidopsis cytosines tend to be less densely methylated than those in other plants. For example, ~92.5% of CpG cytosines are methylated in Beta vulgaris.[77] The patterns of methylation also differ between cytosine sequence contexts; universally, CpG methylation is higher than CHG and CHH methylation, and CpG methylation can be found in both active genes and transposable elements, while CHG and CHH are usually relegated to silenced transposable elements.[78][74]
It is not clear how the cell determines the locations of de novo DNA methylation, but evidence suggests that for many (though not all) locations,
In insects
Diverse orders of insects show varied patterns of DNA methylation, from almost undetectable levels in flies to low levels in butterflies and higher in true bugs and some cockroaches (up to 14% of all CG sites in Blattella asahinai).[80]
Functional DNA methylation has been discovered in Honey Bees.[81][82] DNA methylation marks are mainly on the gene body, and current opinions on the function of DNA methylation is gene regulation via alternative splicing[83]
DNA methylation levels in Drosophila melanogaster are nearly undetectable.[84] Sensitive methods applied to Drosophila DNA Suggest levels in the range of 0.1–0.3% of total cytosine.[85] This low level of methylation[86] appears to reside in genomic sequence patterns that are very different from patterns seen in humans, or in other animal or plant species to date. Genomic methylation in D. melanogaster was found at specific short motifs (concentrated in specific 5-base sequence motifs that are CA- and CT-rich but depleted of guanine) and is independent of DNMT2 activity. Further, highly sensitive mass spectrometry approaches,[87] have now demonstrated the presence of low (0.07%) but significant levels of adenine methylation during the earliest stages of Drosophila embryogenesis.
In fungi
Many
Although brewers' yeast (
In other eukaryotes
DNA methylation is largely absent from Dictyostelium discoidium[92] where it appears to occur at about 0.006% of cytosines.[6] In contrast, DNA methylation is widely distributed in Physarum polycephalum[93] where 5-methylcytosine makes up as much as 8% of total cytosine[4]
In bacteria
One region of the DNA that keeps its hemimethylated status for longer is the origin of replication, which has an abundance of GATC sites. This is central to the bacterial mechanism for timing DNA replication. SeqA binds to the origin of replication, sequestering it and thus preventing methylation. Because hemimethylated origins of replication are inactive, this mechanism limits DNA replication to once per cell cycle.
Expression of certain genes, for example, those coding for pilus expression in E. coli, is regulated by the methylation of GATC sites in the promoter region of the gene operon. The cells' environmental conditions just after DNA replication determine whether Dam is blocked from methylating a region proximal to or distal from the promoter region. Once the pattern of methylation has been created, the pilus gene transcription is locked in the on or off position until the DNA is again replicated. In E. coli, these pili operons have important roles in virulence in urinary tract infections. It has been proposed[by whom?] that inhibitors of Dam may function as antibiotics.
On the other hand, DNA cytosine methylase targets CCAGG and CCTGG sites to methylate cytosine at the C5 position (C meC(A/T) GG). The other methylase enzyme, EcoKI, causes methylation of adenines in the sequences AAC(N6)GTGC and GCAC(N6)GTT.
In Clostridioides difficile, DNA methylation at the target motif CAAAAA was shown to impact
Molecular cloning
Most strains used by molecular biologists are derivatives of E. coli K-12, and possess both Dam and Dcm, but there are commercially available strains that are dam-/dcm- (lack of activity of either methylase). In fact, it is possible to unmethylate the DNA extracted from dam+/dcm+ strains by transforming it into dam-/dcm- strains. This would help digest sequences that are not being recognized by methylation-sensitive restriction enzymes.[97][98]
The restriction enzyme DpnI can recognize 5'-GmeATC-3' sites and digest the methylated DNA. Being such a short motif, it occurs frequently in sequences by chance, and as such its primary use for researchers is to degrade template DNA following PCRs (PCR products lack methylation, as no methylases are present in the reaction). Similarly, some commercially available restriction enzymes are sensitive to methylation at their cognate restriction sites and must as mentioned previously be used on DNA passed through a dam-/dcm- strain to allow cutting.[citation needed]
Detection
DNA methylation can be detected by the following assays currently used in scientific research:[99]
- Mass spectrometry is a very sensitive and reliable analytical method to detect DNA methylation. MS, in general, is however not informative about the sequence context of the methylation, thus limited in studying the function of this DNA modification.
- Methylation-Specific PCR (MSP), which is based on a chemical reaction of sodium bisulfite with DNA that converts unmethylated cytosines of CpG dinucleotides to uracil or UpG, followed by traditional PCR.[100] However, methylated cytosines will not be converted in this process, and primers are designed to overlap the CpG site of interest, which allows one to determine methylation status as methylated or unmethylated.
- Next-generation sequencing platform. The sequences obtained are then re-aligned to the reference genome to determine the methylation status of CpG dinucleotides based on mismatches resulting from the conversion of unmethylated cytosines into uracil.
- Enzymatic methyl-seq (EM-seq) works similarly to bisulfite sequencing, but uses enzymes, TET2, to deaminate unmethylated cytosine into uracil prior to sequencing. EM-seq libraries are less prone to DNA damage than bisulfite-treated libraries.[101]
- Reduced representation bisulfite sequencing, also known as RRBS knows several working protocols. The first RRBS protocol was called RRBS and aims for around 10% of the methylome, a reference genome is needed. Later came more protocols that were able to sequence a smaller portion of the genome and higher sample multiplexing. EpiGBS was the first protocol where you could multiplex 96 samples in one lane of Illumina sequencing and were a reference genome was no longer needed. A de novo reference construction from the Watson and Crick reads made population screening of SNP's and SMP's simultaneously a fact.
- The HELP assay, which is based on restriction enzymes' differential ability to recognize and cleave methylated and unmethylated CpG DNA sites.
- GLAD-PCR assay, which is based on a new type of enzymes – site-specific methyl-directed DNA endonucleases, which hydrolyze only methylated DNA.
- ChIP-on-chip assays, which is based on the ability of commercially prepared antibodies to bind to DNA methylation-associated proteins like MeCP2.
- Restriction landmark genomic scanning, a complicated and now rarely used assay based upon restriction enzymes' differential recognition of methylated and unmethylated CpG sites; the assay is similar in concept to the HELP assay.
- DNA microarrays (MeDIP-chip) or DNA sequencing(MeDIP-seq).
- Pyrosequencing of bisulfite treated DNA. This is the sequencing of an amplicon made by a normal forward primer but a biotinylated reverse primer to PCR the gene of choice. The Pyrosequencer then analyses the sample by denaturing the DNA and adding one nucleotide at a time to the mix according to a sequence given by the user. If there is a mismatch, it is recorded and the percentage of DNA for which the mismatch is present is noted. This gives the user a percentage of methylation per CpG island.
- Molecular break light assay for DNA adenine methyltransferase activity – an assay that relies on the specificity of the restriction enzyme DpnI for fully methylated (adenine methylation) GATC sites in an oligonucleotide labeled with a fluorophore and quencher. The adenine methyltransferase methylates the oligonucleotide making it a substrate for DpnI. Cutting of the oligonucleotide by DpnI gives rise to a fluorescence increase.[102][103]
- Methyl Sensitive Southern Blotting is similar to the HELP assay, although uses Southern blotting techniques to probe gene-specific differences in methylation using restriction digests. This technique is used to evaluate local methylation near the binding site for the probe.
- MethylCpG Binding Proteins (MBPs) and fusion proteins containing just the Methyl Binding Domain (MBD) are used to separate native DNA into methylated and unmethylated fractions. The percentage methylation of individual CpG islands can be determined by quantifying the amount of the target in each fraction.[citation needed] Extremely sensitive detection can be achieved in FFPE tissues with abscription-based detection.
- High Resolution Melt Analysis (HRM or HRMA), is a post-PCR analytical technique. The target DNA is treated with sodium bisulfite, which chemically converts unmethylated cytosines into uracils, while methylated cytosines are preserved. PCR amplification is then carried out with primers designed to amplify both methylated and unmethylated templates. After this amplification, highly methylated DNA sequences contain a higher number of CpG sites compared to unmethylated templates, which results in a different melting temperature that can be used in quantitative methylation detection.[104][105]
- Ancient DNA methylation reconstruction, a method to reconstruct high-resolution DNA methylation from ancient DNA samples. The method is based on the natural degradation processes that occur in ancient DNA: with time, methylated cytosines are degraded into thymines, whereas unmethylated cytosines are degraded into uracils. This asymmetry in degradation signals was used to reconstruct the full methylation maps of the Neanderthal and the Denisovan.[106] In September 2019, researchers published a novel method to infer morphological traits from DNA methylation data. The authors were able to show that linking down-regulated genes to phenotypes of monogenic diseases, where one or two copies of a gene are perturbed, allows for ~85% accuracy in reconstructing anatomical traits directly from DNA methylation maps.[107]
- Methylation Sensitive Single Nucleotide Primer Extension Assay (msSNuPE), which uses internal primers annealing straight 5' of the nucleotide to be detected.[108]
- Illumina Methylation Assay measures locus-specific DNA methylation using array hybridization. Bisulfite-treated DNA is hybridized to probes on "BeadChips." Single-base base extension with labeled probes is used to determine methylation status of target sites.[109] In 2016, the Infinium MethylationEPIC BeadChip was released, which interrogates over 850,000 methylation sites across the human genome.[110]
- Using nanopore sequencing, researchers have directly identified DNA and RNA base modifications at nucleotide resolution, including 5mC, 5hmC, 6mA, and BrdU in DNA, and m6A in RNA, with detection of other natural or synthetic epigenetic modifications possible through training basecalling algorithms.[111]
Differentially methylated regions (DMRs)
QDMR (Quantitative Differentially Methylated Regions) is a quantitative approach to quantify methylation difference and identify DMRs from genome-wide methylation profiles by adapting Shannon entropy.[120] The platform-free and species-free nature of QDMR makes it potentially applicable to various methylation data. This approach provides an effective tool for the high-throughput identification of the functional regions involved in epigenetic regulation. QDMR can be used as an effective tool for the quantification of methylation difference and identification of DMRs across multiple samples.[121]
Gene-set analysis (a.k.a. pathway analysis; usually performed tools such as DAVID, GoSeq or GSEA) has been shown to be severely biased when applied to high-throughput methylation data (e.g. MeDIP-seq, MeDIP-ChIP, HELP-seq etc.), and a wide range of studies have thus mistakenly reported hyper-methylation of genes related to development and differentiation; it has been suggested that this can be corrected using sample label permutations or using a statistical model to control for differences in the numbers of CpG probes / CpG sites that target each gene.[122]
DNA methylation marks
DNA methylation marks – genomic regions with specific methylation patterns in a specific biological state such as tissue, cell type, individual – are regarded as possible functional regions involved in gene transcriptional regulation. Although various human cell types may have the same genome, these cells have different methylomes. The systematic identification and characterization of methylation marks across cell types are crucial to understanding the complex regulatory network for cell fate determination. Hongbo Liu et al. proposed an entropy-based framework termed SMART to integrate the whole genome bisulfite sequencing methylomes across 42 human tissues/cells and identified 757,887 genome segments.[123] Nearly 75% of the segments showed uniform methylation across all cell types. From the remaining 25% of the segments, they identified cell type-specific hypo/hypermethylation marks that were specifically hypo/hypermethylated in a minority of cell types using a statistical approach and presented an atlas of the human methylation marks. Further analysis revealed that the cell type-specific hypomethylation marks were enriched through H3K27ac and transcription factor binding sites in a cell type-specific manner. In particular, they observed that the cell type-specific hypomethylation marks are associated with the cell type-specific super-enhancers that drive the expression of cell identity genes. This framework provides a complementary, functional annotation of the human genome and helps to elucidate the critical features and functions of cell type-specific hypomethylation.[citation needed]
The entropy-based Specific Methylation Analysis and Report Tool, termed "SMART", which focuses on integrating a large number of DNA methylomes for the de novo identification of cell type-specific methylation marks. The latest version of SMART is focused on three main functions including de novo identification of differentially methylated regions (DMRs) by genome segmentation, identification of DMRs from predefined regions of interest, and identification of differentially methylated CpG sites.[124]
In identification and detection of body fluids
DNA methylation allows for several tissues to be analyzed in one assay as well as for small amounts of body fluid to be identified with the use of extracted DNA. Usually, the two approaches of DNA methylation are either methylated-sensitive restriction enzymes or treatment with sodium bisulphite.[125] Methylated sensitive restriction enzymes work by cleaving specific CpG, cytosine and guanine separated by only one phosphate group, recognition sites when the CpG is methylated. In contrast, unmethylated cytosines are transformed to uracil and in the process, methylated cytosines remain methylated. In particular, methylation profiles can provide insight on when or how body fluids were left at crime scenes, identify the kind of body fluid, and approximate age, gender, and phenotypic characteristics of perpetrators.[126] Research indicates various markers that can be used for DNA methylation. Deciding which marker to use for an assay is one of the first steps of the identification of body fluids. In general, markers are selected by examining prior research conducted. Identification markers that are chosen should give a positive result for one type of cell. One portion of the chromosome that is an area of focus when conducting DNA methylation are tissue-specific differentially methylated regions, T-DMRs. The degree of methylation for the T-DMRs ranges depending on the body fluid.[126] A research team developed a marker system that is two-fold. The first marker is methylated only in the target fluid while the second is methylated in the rest of the fluids.[108] For instance, if venous blood marker A is un-methylated and venous blood marker B is methylated in a fluid, it indicates the presence of only venous blood. In contrast, if venous blood marker A is methylated and venous blood marker B is un-methylated in some fluid, then that indicates venous blood is in a mixture of fluids. Some examples for DNA methylation markers are Mens1(menstrual blood), Spei1(saliva), and Sperm2(seminal fluid).
DNA methylation provides a relatively good means of sensitivity when identifying and detecting body fluids. In one study, only ten nanograms of a sample was necessary to ascertain successful results.[127] DNA methylation provides a good discernment of mixed samples since it involves markers that give "on or off" signals. DNA methylation is not impervious to external conditions. Even under degraded conditions using the DNA methylation techniques, the markers are stable enough that there are still noticeable differences between degraded samples and control samples. Specifically, in one study, it was found that there were not any noticeable changes in methylation patterns over an extensive period of time.[126]
The detection of DNA methylation in cell-free DNA and other body fluids has recently become one of the main approaches to Liquid biopsy.[128] In particular, the identification of tissue-specific and disease-specific patterns allows for non-invasive detection and monitoring of diseases such as cancer.[129] If compared to strictly genomic approaches to liquid biopsy, DNA methylation profiling offers a larger number of differentially methylated CpG sites and differentially methylated regions (DMRSs), potentially enhancing its sensitivity. Signal deconvolution algorithms based on DNA methylation have been successfully applied to cell-free DNA and can nominate the tissue of origin of cancers of unknown primary, allograft rejection, and resistance to hormone therapy.[130]
Computational prediction
DNA methylation can also be detected by computational models through sophisticated algorithms and methods. Computational models can facilitate the global profiling of DNA methylation across chromosomes, and often such models are faster and cheaper to perform than biological assays. Such up-to-date computational models include Bhasin, et al.,[131] Bock, et al.,[132] and Zheng, et al.[133][134] Together with biological assay, these methods greatly facilitate the DNA methylation analysis.
See also
- 5-Hydroxymethylcytosine
- 5-Methylcytosine
- 7-Methylguanosine
- Decrease in DNA Methylation I (DDM1), a plant methylation gene
- Demethylating agent
- Differentially methylated regions
- DNA demethylation
- DNA methylation reprogramming
- Epigenetics, of which DNA methylation is a significant contributor
- Epigenetic clock, a method to calculate age based on DNA methylation
- Epigenome
- Genome
- Genomic imprinting, an inherited repression of an allele, relying on DNA methylation
- MethBase DNA Methylation database hosted on the UCSC Genome Browser
- MethDB DNA Methylation database
- N6-Methyladenosine
References
- PMC 1200409.
- PMID 4391887.
- PMC 341080.
- ^ PMID 5530731.
- PMID 1713034.
- ^ PMID 37081864.
- PMID 25299492.
- S2CID 82947750.
As a result of this process, known as repeat-induced point mutation (RIP), the wild-type Neurospora genome contains a small fraction of methylated DNA, the majority of the DNA remaining nonmethylated.
- ^ PMID 24640988.
- PMID 16479578.
- PMID 27027282.
- ^ Angéla Békési and Beáta G Vértessy "Uracil in DNA: error or signal?"
- PMID 27375676.
- PMID 12036582.
- PMID 11784085.
- ^ PMID 19829295.
- PMID 23828890.
- ^ PMID 26053498.
- S2CID 20307488.
- PMID 22170606.
- ^ Supplemental figures appear to be only accessible via the science.sciencemag.org paywall.
- PMID 17420183.
- ^ PMID 11237011.
- S2CID 4236677.
- PMID 3656447.
- PMID 20885785.
- PMID 16432200.
- ^ PMID 20395551.
- PMID 18514006.
- S2CID 22446734.
- S2CID 4403755.
- PMID 20875111.
- PMID 32561758.
- PMID 24630728.
- PMID 9260521.
- PMID 32719115.
- S2CID 34258335.
- PMID 20613842.
- S2CID 9328002.
- PMID 22404632. – via Annual Reviews (subscription required)
- ^ S2CID 247010798.
- PMID 7557385.
- S2CID 4311091.
- S2CID 205357042.
- PMID 23166394.
- PMID 29784032.
- PMID 19707296.
- ISBN 978-1-904455-88-2.
- ^ PMID 20448029.
- PMID 15131116.
- PMID 12881445.
- PMID 19285843.
- PMID 12163701.
- PMID 10727665.
- PMID 16116050.
- PMID 30089854.
- PMID 33769919.
- PMID 24138928.
- PMID 20505345.
- ^ PMID 22689993.
- ^ PMID 22405075.
- PMID 23825961.
- PMID 21178401.
- ^ PMID 26849493.
- PMID 28082740.
- PMID 10327235.
- ^ PMID 26656643.
- ^ PMID 28620075.
- ^ PMID 35098021.
- ^ Gratchev A. "Review on DNA Methylation". METHODS.info.
- S2CID 30907442.
- PMID 28854222.
- S2CID 39089541.
- ^ S2CID 256745172.
- PMID 25939354.
- PMID 12151602.
- PMID 27671052.
- S2CID 206525166.
- ^ PMID 12169664.
- PMID 28025279.
- S2CID 31709665.
- ISBN 9780128025864.
- PMID 23852726.
- PMID 6266924.
- S2CID 4427540.
- PMID 24558263.
- PMID 25936838.
- PMID 6330093.
- JSTOR 3761319.
- PMID 22276181.
- S2CID 4380222.
- PMID 1713034.
- S2CID 83941623.
- ^ Matthew J. Blow, Tyson A. Clark, Chris G. Daum, Adam M. Deutschbauer, Alexey Fomenkov, Roxanne Fries, Jeff Froula, Dongwan D. Kang, Rex R. Malmstrom, Richard D. Morgan, Janos Posfai, Kanwar Singh, Axel Visel, Kelly Wetmore, Zhiying Zhao, Edward M. Rubin, Jonas Korlach, Len A. Pennacchio, Richard J. Roberts: The Epigenomic Landscape of Prokaryotes. In: PLoS Genet. 12(2), Feb 2016, S. e1005854. doi:10.1371/journal.pgen.1005854. PMID 26870957. PMC 4752239
- PMID 34402642.
- PMID 31768029.
- PMID 8200522.
- ^ "Making unmethylated (dam-/dcm-) DNA". Archived from the original on 2011-01-06.
- .
- PMID 24107250.
- PMID 33028374.
- PMID 17726531.
- PMID 17263334.
- PMID 17289753.
- PMID 19454604.
- S2CID 28665590.
- S2CID 202676502.
- ^ PMID 26829227.
- ^ "Infinium Methylation Assay | Interrogate single CpG sites". www.illumina.com. Retrieved 2020-01-10.
- ^ "Infinium MethylationEPIC Kit | Methylation profiling array for EWAS". www.illumina.com. Retrieved 2020-01-10.
- ^ "Epigenetics and methylation analysis". Oxford Nanopore Technologies. Archived from the original on 2021-09-01. Retrieved 2021-09-27.
- PMID 18577705.
- S2CID 22243051.
- PMID 19151715.
- S2CID 17089710.
- PMID 18600261.
- PMID 19881528.
- PMID 18577732.
- PMID 18413340.
- ^ "QDMR: a quantitative method for identification of differentially methylated regions by entropy". bioinfo.hrbmu.edu.cn. Archived from the original on 2015-10-23. Retrieved 2013-03-09.
- PMID 21306990.
- PMID 23732277.
- PMID 26635396.
- ^ Hongbo L (2016). "SMART 2: A Comprehensive Analysis Tool for Bisulfite Sequencing Data". fame.edbc.org.
- PMID 25488609.
- ^ PMID 25732744.
- S2CID 7438775.
- S2CID 51965986.
- PMID 33506766.
- PMID 30498206.
- S2CID 14487630.
- PMID 16520826.
- ^ Zheng H, Jiang SW, Wu H (2011). "Enhancement on the predictive power of the prediction model for human genomic DNA methylation". International Conference on Bioinformatics and Computational Biology (BIOCOMP'11).
- PMID 23369266.
Further reading
- Law JA, Jacobsen SE (March 2010). "Establishing, maintaining and modifying DNA methylation patterns in plants and animals". Nature Reviews. Genetics. 11 (3): 204–220. PMID 20142834.
- Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, et al. (May 2009). "Developmental programming of CpG island methylation profiles in the human genome". Nature Structural & Molecular Biology. 16 (5): 564–571. S2CID 8804930.
- Patra SK (April 2008). "Ras regulation of DNA-methylation and cancer". Experimental Cell Research. 314 (6): 1193–1201. PMID 18282569.
- Patra SK, Patra A, Rizzi F, Ghosh TC, Bettuzzi S (June 2008). "Demethylation of (Cytosine-5-C-methyl) DNA and regulation of transcription in the epigenetic pathways of cancer development". Cancer and Metastasis Reviews. 27 (2): 315–334. S2CID 22435914.
- Nagpal G, Sharma M, Kumar S, Chaudhary K, Gupta S, Gautam A, Raghava GP (February 2014). "PCMdb: pancreatic cancer methylation database". Scientific Reports. 4: 4197. PMID 24569397.
External links
- DNA+Methylation at the U.S. National Library of Medicine Medical Subject Headings (MeSH)
- ENCODE threads explorer Non-coding RNA characterization. Nature (journal)
- PCMdb Pancreatic Cancer Methylation Database.
- SMART Specific Methylation Analysis and Report Tool
- Human Methylation Mark Atlas
- DiseaseMeth Archived 2020-01-27 at the Wayback Machine Human disease methylation database
- EWAS Atlas A knowledgebase of epigenome-wide association studies