LTR retrotransposon
LTR retrotransposons are class I
Their size ranges from a few hundred base pairs to 25kb, for example the Ogre retrotransposon in the pea genome.
In plant genomes, LTR retrotransposons are the major repetitive sequence class, for example, constituting more than 75% of the maize genome.[1] LTR retrotransposons make up about 8% of the human genome and approximately 10% of the mouse genome.[2]
Structure and propagation
LTR retrotransposons have
All functional LTR-retrotransposons encode a minimum of two genes,
Typically, LTR-retrotransposon mRNAs are produced by the host RNA pol II acting on a promoter located in their 5’ LTR. The Gag and Pol genes are encoded in the same mRNA. Depending on the host species, two different strategies can be used to express the two polyproteins: a fusion into a single open reading frame (ORF) that is then cleaved or the introduction of a frameshift between the two ORFs.[8] Occasional ribosomal frameshifting allows the production of both proteins, while ensuring that much more Gag protein is produced to form virus-like particles.
Types
Ty1-copia retrotransposons
Ty1-copia retrotransposons are abundant in species ranging from single-cell
At least two classification systems exist for the subdivision of Ty1-copia retrotransposons into five lineages:[10][11] Sireviruses/Maximus, Oryco/Ivana, Retrofit/Ale, TORK (subdivided in Angela/Sto, TAR/Fourf, GMR/Tork), and Bianca.
Sireviruses/Maximus retrotransposons contain an additional putative envelope gene. This lineage is named for the founder element SIRE1 in the Glycine max genome,[12] and was later described in many species such as Zea mays,[13] Arabidopsis thaliana,[14] Beta vulgaris,[15] and Pinus pinaster.[16] Plant Sireviruses of many sequenced plant genomes are summarized at the MASIVEdb Sirevirus database.[17]
Ty3-retrotransposons (formally gypsy)
Ty3-retrotransposons are widely distributed in the plant kingdom, including both gymnosperms and angiosperms. They encode at least four protein domains in the order: protease, reverse transcriptase, ribonuclease H, and integrase. Based on structure, presence/absence of specific protein domains, and conserved protein sequence motifs, they can be subdivided into several lineages:
Errantiviruses contain an additional defective envelope ORF with similarities to the retroviral envelope gene. First described as Athila-elements in Arabidopsis thaliana,[18][19] they have been later identified in many species, such as Glycine max[20] and Beta vulgaris.[21]
Chromoviruses contain an additional chromodomain (chromatin organization modifier domain) at the C-terminus of their integrase protein.[22][23] They are widespread in plants and fungi, probably retaining protein domains during evolution of these two kingdoms.[24] It is thought that the chromodomain directs retrotransposon integration to specific target sites.[25] According to sequence and structure of the chromodomain, chromoviruses are subdivided into the four clades CRM, Tekay, Reina and Galadriel. Chromoviruses from each clade show distinctive integration patterns, e.g. into centromeres or into the rRNA genes.[26][27]
Ogre-elements are gigantic Ty3-retrotransposons reaching lengths up to 25 kb.[28] Ogre elements have been first described in Pisum sativum.[29]
Metaviruses describe conventional Ty3-gypsy retrotransposons that do not contain additional domains or ORFs.
The Sushi family of Ty3 long terminal repeat retrotransposons were first identified in teleost fish and Sushi-like neogenes were subsequently identified in mammals.[30] Mammalian retrotransposon-derived transcripts (MARTs) cannot transpose but have retained open reading frames, demonstrate high levels of evolutionary conservation and are subject to selective pressures, which suggests some have become neofunctionalized genes with new cellular functions.[31] Retrotransposon gag-like-3 (RTL3/ZCCHC5/MART3) is one of eleven Sushi-like neogenes identified in the human genome.[32]
BEL/pao family
The BEL/pao family is found in animals.[33]
Endogenous retroviruses (ERV)
Although retroviruses are often classified separately, they share many features with LTR retrotransposons. A major difference with Ty1-copia and Ty3-gypsy retrotransposons is that retroviruses have an envelope protein (ENV). A retrovirus can be transformed into an LTR retrotransposon through inactivation or deletion of the domains that enable extracellular mobility. If such a retrovirus infects and subsequently inserts itself in the genome in germ line cells, it may become transmitted vertically and become an Endogenous Retrovirus.[5]
Terminal repeat retrotransposons in miniature (TRIMs)
Some LTR retrotransposons lack all of their coding domains. Due to their short size, they are referred to as terminal repeat retrotransposons in miniature (TRIMs).[34][35] Nevertheless, TRIMs can be able to retrotranspose, as they may rely on the coding domains of autonomous Ty1-copia or Ty3-gypsy retrotransposons. Among the TRIMs, the Cassandra family plays an exceptional role, as the family is unusually wide-spread among higher plants.[36] In contrast to all other characterized TRIMs, Cassandra elements harbor a 5S rRNA promoter in their LTR sequence.[37] Due to their short overall length and the relatively high contribution of the flanking LTRs, TRIMs are prone to re-arrangements by recombination.[38]
References
- PMID 19936065.
- PMID 15003117.
- ^ "OSF". osf.io. Retrieved 2022-11-04.
- PMID 15725362.
- ^ S2CID 32132898.
- PMID 21189452.
- S2CID 32132898.
- PMID 14623998.
- ^ "What is Transposon's Target Site Duplication (TSD)?". 22 May 2019.
- PMID 17556529.
- PMID 19883502.
- PMID 9618510.
- PMID 21967390.
- S2CID 27249664.
- S2CID 24883110.
- S2CID 28629941.
- PMID 22545773.
- S2CID 8296300.
- PMID 9611185.
- PMID 11779837.
- PMID 22804913.
- PMID 10889217.
- PMID 14739248.
- PMID 20377908.
- PMID 18256242.
- PMID 21371312.
- PMID 23448600.
- PMID 17052864.
- S2CID 13412101.
- PMID 33043724.
- PMID 33043724.
- PMID 33043724.
- )
- PMID 11717436.)
{{cite journal}}
: CS1 maint: multiple names: authors list (link - S2CID 7035882.
- )
- PMID 18408163.
- PMID 33009553.