Short interspersed nuclear element
Short interspersed nuclear elements (SINEs) are non-autonomous,
The internal regions of SINEs originate from
In essence, short interspersed nuclear elements are genetic parasites which have evolved very early in the history of eukaryotes to utilize protein machinery within the organism as well as to co-opt the machinery from similarly parasitic genomic elements. The simplicity of these elements make them remarkably successful at persisting and amplifying (through retrotransposition) within the genomes of eukaryotes. These "parasites" which have become ubiquitous in genomes can be very deleterious to organisms as discussed below. However, eukaryotes have been able to integrate short-interspersed nuclear elements into different signaling, metabolic and regulatory pathways and SINEs have become a great source of genetic variability. They seem to play a particularly important role in the regulation of gene expression and the creation of RNA genes. This regulation extends to chromatin re-organization and the regulation of genomic architecture. The different lineages, mutations, and activities among eukaryotes make short-interspersed nuclear elements a useful tool in phylogenetic analysis.
Classification and structure
SINEs are classified as non-LTR
Internal structure
SINEs are characterized by their different modules, which are essentially a sectioning of their sequence. SINEs can, but do not necessarily have to possess a head, a body, and a tail. The head, is at the
Transcription
Short-interspersed nuclear elements are transcribed by
Effects on gene expression
Changes in chromosome structure influence
In fact Usmanova et al. 2008 suggested that short-interspersed nuclear elements can serve as direct signals in
In addition to directly affecting chromatin structure, there are a number of ways in which SINEs can potentially regulate gene expression. For example, long non-coding RNA can directly interact with transcriptional repressors and activators, attenuating or modifying their function.[15] This type of regulation can occur in different ways: the RNA transcript can directly bind to the transcription factor as a co-regulator; also, the RNA can regulate and modify the ability of co-regulators to associate with the transcription factor.[15] For example, Evf-2, a certain long non-coding RNA, has been known to function as a co-activator for certain homeobox transcription factors which are critical to nervous system development and organization.[16] Furthermore, RNA transcripts can interfere with the functionality of the transcriptional complex by interacting or associating with RNA polymerases during the transcription or loading processes.[15] Moreover, non-coding RNAs like SINEs can bind or interact directly with the DNA duplex coding the gene and thus prevent its transcription.[15]
Also, many non-coding RNAs are distributed near protein-coding genes, often in the reverse direction. This is especially true for short-interspersed nuclear elements as seen in Usmanova et al. These non-coding RNAs, which lie adjacent to or overlap gene-sets provide a mechanism by which transcription factors and machinery can be recruited to increase or repress the transcription of local genes. The particular example of SINEs potentially recruiting the YY1
In conclusion, non-coding RNAs such as SINEs are capable of affecting gene expression on a multitude of different levels and in different ways. Short-interspersed nuclear elements are believed to be deeply integrated into a complex regulatory network capable of fine-tuning gene expression across the eukaryotic genome.
Propagation and regulation
The RNA coded by the short-interspersed nuclear element does not code for any protein product but is nonetheless
LINE-1 (L1) is transcribed and retrotransposed most frequently in the
SINEs are known to share sequence homology with LINES which gives a basis by which the LINE machinery can reverse transcribe and integrate SINE transcripts.[22] Alternately, some SINEs are believed to use a much more complex system of integrating back into the genome; this system involves the use random double-stranded DNA breaks (rather than the endonuclease coded by related long-interspersed nuclear elements creating an insertion-site).[22] These DNA breaks are utilized to prime reverse transcriptase, ultimately integrating the SINE transcript back into the genome.[22] SINEs nonetheless depend on enzymes coded by other DNA elements and are thus known as non-autonomous retrotransposons as they depend on the machinery of LINEs, which are known as autonomous retrotransposons.<[23]
The theory that short-interspersed nuclear elements have evolved to utilize the retrotransposon machinery of long-interspersed nuclear elements is supported by studies which examine the presence and distribution of LINEs and SINEs in taxa of different species.[24] For example, LINEs and SINEs in rodents and primates show very strong homology at the insertion-site motif.[24] Such evidence is a basis for the proposed mechanism in which integration of the SINE transcript can be co-opted with LINE-coded protein products. This is specifically demonstrated by a detailed analysis of over 20 rodent species profiled LINEs and SINEs, mainly L1s and B1s respectively; these are families of LINEs and SINEs found at high frequencies in rodents along with other mammals.[24] The study sought to provide phylogenetic clarity within the context of LINE and SINE activity.
The study arrived at a candidate taxa believed to be the first instance of L1 LINE extinction; it expectedly discovered that there was no evidence to suggest that B1 SINE activity occurred in species which did not have L1 LINE activity.[24] Also, the study suggested that B1 short-interspersed nuclear element silencing in fact occurred before L1 long-interspersed nuclear element extinction; this is due to the fact that B1 SINEs are silenced in the genus most-closely related to the genus which does not contain active L1 LINEs (though the genus with B1 SINE silencing still contains active L1 LINEs).[24] Another genus was also found which similarly contained active L1 long-interspersed nuclear elements but did not contain B1 short-interspersed nuclear elements; the opposite scenario, in which active B1 SINEs were present in a genus which did not possess active L1 LINEs was not found.[24] This result was expected and strongly supports the theory that SINEs have evolved to co-opt the RNA-binding proteins, endonucleases, and reverse-transcriptases coded by LINEs. In taxa which do not actively transcribe and translate long-interspersed nuclear elements protein-products, SINEs do not have the theoretical foundation by which to retrotranspose within the genome. The results obtained in Rinehart et al. are thus very supportive of the current model of SINE retrotransposition.
Effects of SINE transposition
Insertion of a SINE upstream of a coding region may result in
Common SINEs
Short-interspersed nuclear elements are believed to have
Apart from mammals, SINEs can reach high copy numbers in a range of species, including nonbony vertebrates (elephant shark) and some fish species (coelacanths).[27] In plants, SINEs are often restricted to closely related species and have emerged, decayed, and vanished frequently during evolution.[28] Nevertheless, some SINE families such as the Au-SINEs[29] and the Angio-SINEs[30] are unusually widespread across many often unrelated plant species.
Diseases
There are >50 human diseases associated with SINEs.
microRNAs
The role of short-interspersed nuclear elements in gene regulation within cells has been supported by multiple studies. One such study examined the correlation between a certain family of SINEs with
With such evidence suggesting that short-interspersed nuclear elements have been evolutionary sources for microRNA loci generation it is important to further discuss the potential relationships between the two as well as the mechanism by which the microRNA regulates RNA degradation and more broadly, gene expression. A microRNA is a non-coding RNA generally 22 nucleotides in length.[32] This non-protein coding oligonucleotide is itself coded by longer nuclear DNA sequence usually transcribed by RNA polymerase II which is also responsible for the transcription of most mRNAs and snRNAs in eukaryotes.[33] However, some research suggests that some microRNAs that possess upstream short-interspersed nuclear elements are transcribed by RNA polymerase III which is widely implicated in ribosomal RNA and tRNA, two transcripts vital to mRNA translation.[34] This provides an alternate mechanism by which short-interspersed nuclear elements could be interacting with or mediating gene-regulatory networks involving microRNAs.
The regions coding miRNA can be independent RNA-genes often being anti-sense to neighboring protein-coding genes, or can be found within the introns of protein-coding genes.[35] The co-localization of microRNA and protein-coding genes provides a mechanistic foundation by which microRNA regulates gene-expression. Furthermore, Scarpato et al. reveals (as discussed above) that genes predicted to possess short-interspersed nuclear elements (SINEs) through sequence analysis were targeted and hybridized by microRNAs significantly greater than other genes.[31] This provides an evolutionarily path by which the parasitic SINEs were co-opted and utilized to form RNA-genes (such as microRNAs) which have evolved to play a role in complex gene-regulatory networks.
The microRNAs are transcribed as part of longer RNA strands of generally about 80 nucleotides which through complementary base-pairing are able to form hairpin loop structures[36] These structures are recognized and processed in the nucleus by the nuclear protein DiGeorge Syndrome Critical Region 8 (DGCR8) which recruits and associates with the Drosha protein.[37] This complex is responsible for cleaving some of the hair-pin structures from the pre-microRNA which is transported to the cytoplasm. The pre-miRNA is processed by the protein DICER into a double stranded 22 nucleotide.[38] Thereafter, one of the strands is incorporated into a multi-protein RNA-induced silencing complex (RISC).[39] Among these proteins are proteins from the Argonaute family which are critical to the complex's ability to interact with and repress the translation of the target mRNA.[40]
Understanding the different ways in which microRNA regulates gene-expression, including mRNA-translation and degradation is key to understanding the potential evolutionary role of SINEs in gene-regulation and in the generation of microRNA loci. This, in addition to SINEs' direct role in regulatory networks (as discussed in SINEs as long non-coding RNAs) is crucial to beginning to understand the relationship between SINEs and certain diseases. Multiple studies have suggested that increased SINE activity is correlated with certain gene-expression profiles and post-transcription regulation of certain genes.[41][42][43] In fact, Peterson et al. 2013 demonstrated that high SINE RNA expression correlates with post-transcriptional downregulation of BRCA1, a tumor suppressor implicated in multiple forms of cancer, namely breast cancer.[43] Furthermore, studies have established a strong correlation between transcriptional mobilization of SINEs and certain cancers and conditions such as hypoxia; this can be due to the genomic instability caused by SINE activity as well as more direct-downstream effects.[42] SINEs have also been implicated in countless other diseases. In essence, short-interspersed nuclear elements have become deeply integrated in countless regulatory, metabolic and signaling pathways and thus play an inevitable role in causing disease. Much is still to be known about these genomic parasites but it is clear they play a significant role within eukaryotic organisms.
SINEs and pseudogenes
The activity of SINEs however has genetic vestiges which do not seem to play a significant role, positive or negative, and manifest themselves in the genome as pseudogenes. SINEs however should not be mistaken as RNA pseudogenes.[1] In general, pseudogenes are generated when processed mRNAs of protein-coding genes are reverse-transcribed and incorporated back into the genome (RNA pseudogenes are reverse transcribed RNA genes).[44] Pseudogenes are generally functionless as they descend from processed RNAs independent of their evolutionary-context which includes introns and different regulatory elements which enable transcription and processing. These pseudogenes, though non-functional may in some cases still possess promoters, CpG islands, and other features which enable transcription; they thus can still be transcribed and may possess a role in the regulation of gene expression (like SINEs and other non-coding elements).[44] Pseudogenes thus differ from SINEs in that they are derived from transcribed- functional RNA whereas SINEs are DNA elements which retrotranspose by co-opting RNA genes transcriptional machinery. However, there are studies which suggest that retro-transposable elements such as short-interspersed nuclear elements are not only capable of copying themselves in alternate regions in the genome but are also able to do so for random genes too.[45][46] Thus SINEs can be playing a vital role in the generation of pseudogenes, which themselves are known to be involved in regulatory networks. This is perhaps another means by which SINEs have been able to influence and contribute to gene-regulation.
References
- ^ PMID 23203982.
- .
- ^ PMID 17126948.
- ^ PMID 22406018.
- S2CID 32132898.
- PMID 17307271.
- PMID 9461397.
- PMID 12368238.
- S2CID 21123216.
- PMID 17304537.
- PMID 18000552.
- ^ PMID 18664128.
- S2CID 19399858.
- PMID 11486036.
- ^ S2CID 22274894.
- PMID 16705037.
- PMID 26996597.
- PMID 23497673.
- PMID 16877819.
- ^ PMID 18680436.
- ^ S2CID 10510149.
- ^ S2CID 22129236.
- S2CID 23872541.
- ^ S2CID 36518754.
- PMID 19763152.
- ^ PMID 16339378.
- PMID 25577199.
- PMID 21673742.
- S2CID 7840648.
- PMID 31610059.
- ^ S2CID 16759020.
- S2CID 205210153.
- PMID 15372072.
- PMID 18778799.
- S2CID 43262684.
- PMID 15525708.
- S2CID 4421030.
- PMID 14744438.
- PMID 12000786.
- PMID 19342379.
- PMID 26339299.
- ^ PMID 19508390.
- ^ PMID 24705161.
- ^ PMID 3909943.
- S2CID 32151696.
- PMID 15531153.