Auxiliary metabolic genes
Auxiliary metabolic genes (AMGs) are found in many
Classes
AMGs employ diverse functions including pathways not involved in metabolism despite what the name suggests. They are categorized in two classes based on their presence in the Kyoto Encyclopedia of Genes and Genomes (KEGG).[7] AMGs do not encompass metabolic genes involved in typical viral functions, such as nucleotide and protein metabolism since their functions achieve direct viral reproduction, rather than augmenting host function to indirectly enhance it.[8]
Class I
Class I AMGs encode for metabolism pathways in the cell and are found in
Class II
Class II AMGs encode for peripheral functions absent from the KEGG metabolic pathways. This includes genes typically involved in transport and assembly.[8] Major representatives of this class are involved in balancing TCA cycle intermediates.[7] Additionally, the acquisition of biogenic elements outside of carbon like phosphate, governed by pstS, are prevalent for this class.[11] Confidence of AMG identification for Class II AMGs is reduced without a database for reference.[12]
Abundance
Virus survival through inclusion of AMGs is governed by the laws of natural selection and has been made highly selective through co-evolution with their hosts.[13] As such, the AMGs that confer a fitness advantage to the virus's ability to infect a host and reproduce will be more abundant. AMG abundance is largely dictated by the lifestyle of the virus, environmental conditions surrounding it, and host characteristics.[6]
Lifestyle
Lytic and lysogenic viruses have different lifestyles which impact what AMGs they acquire. Lytic viruses tend to use AMGs to repurpose host cell metabolism and steal nutrients when in high cell density. Therefore, AMGs related to metabolism and transport are found more abundantly in lytic viruses.[14] Lytic viruses also encompass a more diverse set of AMGs than lysogenic viruses, in part due to their larger host range and higher infection frequency. Temperate viruses, on the other hand, may employ AMGs to improve host fitness and virulence due to their often longer lifespan in the cell as a prophage.[15] Gene density in these viruses is higher when compared to their lytic counterparts. Higher rates of HGT in lysogenic viruses allows for more AMG transfer but also lowers overall gene diversity.[6]
Photosynthesis capacity has also been correlated to AMG diversity. Aphotic viral communities possess greater AMG diversity than those in the photic zone.[16]
Environmental conditions
Pathways utilizing nutrients found in low concentrations in the local environment are generally found in higher abundance in the virus. In marine environments, AMGs can confer fitness advantages for both host and viruses under relatively nutrient-limited conditions compared to sediment and strong ultraviolet stress of water.[6] In sunlit versus dark ocean waters, AMGs in distinct pathways are unequally distributed to reprogram host energy production and viral replication based on available nutrients.[17] In sedimentary environments, carbon and sulfur metabolism AMGs are typically more prevalent to outcompete other organisms for the abundant resources.[18]
Host factors
A virus's host range determines which host it can acquire AMGs from. Additionally, the abundance of a host surrounding a virus will affect its likelihood to acquire genes from the host. Virus populations increasingly occupy lytic lifestyles as bacterial production increases.[14] The strong evolutionary connection between viruses and their hosts makes AMG acquisition mirror the host's own adaptation to its environment over time.[6]
Synechococcus and Prochlorococcus are the most abundant picocyanobacteria, accounting for up to 50% of primary production in the marine environment.[19] As such, many AMGs characterized have been discovered in phages of these host systems.
Identification
DRAM-v[20] is the standard for AMG annotation of metagenome assembled genomes (MAGs) identified as viruses.[21] DRAM-v searches the following databases for AMGs that match the input MAGs: Pfam, KEGG, UniProt, CAZy, MEROPS, VOGDB, and NCBI Viral RefSeq.[20] KEGG can then be referenced to classify annotated AMGs through VIBRANT.[22]
Cellular contamination
Since AMGs originate in hosts, distinguishing host and viral genes is critical for their study. This is not easily achieved as cultivation of viral-host systems in a laboratory setting proves challenging if even possible.[8] Additionally, filtering out cellular sequences before entry in bioinformatic pipelines is not possible with cellular gene transfer agents and membrane vesicles are unable to distinguish from viruses due to their many shared properties at this step of analysis.[23][24] The extent to which they have contaminated existing viral databases is unknown.[8] Some genes have distinctions between host and viral versions such as cyanophage photosynthesis easing the task of computational distinction. The most definitive way developed to determine gene origin has been identification of taxonomically informative genes colocalized on assembled contigs. ViromeQC[25] can display contamination for the dataset overall and DRAM-v assigns a confidence score for the AMG being on a viral MAG.[20] Viral identification is most popularly performed by VIBRANT,[22] VirSorter2,[26] DeepVirFinder,[27] and CheckV.[28]
Genomic context
AMGs are not randomly distributed throughout genomes. Current research is being done to determine the genes that most commonly surround specific AMGs.[29] Hyperplastic regions including the region between genes g15-g18 has been classified as locales where multiple AMGs have been inserted.[30] Possible AMG contexts can be divided into locally collinear blocks (LCBs), or homologous regions shared by multiple viruses without rearrangements.[31] AMGs have been found in just one or up to 14 LCBs. Those found in more diverse contexts have also shown up in variable locales within the LCB.[29]
Acquisition mechanisms
Horizontal gene transfer (HGT) from host to virus allows for AMGs to be acquired. Gene transfer from host eukaryotes to viruses occur about twice as frequently as virus to host gene transfers due to a higher number viral recipients than donors. The vast majority of gene transfer occurs in double-stranded DNA viruses since they have large and flexible genomes, co-evolution with eukaryotes, and wide host breadth. Additionally, unicellular hosts more commonly transfer genes.[13]
Mechanisms of action
Transcriptional regulation
AMGs may influence gene expression by modulating the activity of transcription factors, which control the rate at which specific genes are transcribed into mRNA, thereby impacting the levels of corresponding proteins involved in metabolic pathways.
Enzyme modulation
Certain AMGs encode proteins that directly interact with enzymes involved in metabolic reactions. This interaction can either enhance or inhibit enzyme activity, leading to changes in the rate of metabolic flux through specific pathways.
Signaling pathways
AMGs may be integrated into cellular signaling pathways, influencing the transmission of signals related to energy status, nutrient availability, or stress. By modulating these signaling pathways, AMGs can indirectly regulate metabolic processes.
Ecological implications
Biogeochemicalc cycling
AMGs have a large impact on biogeochemical cycles in multiple environments through nutrient degradation, mineralization, transportation, assimilation, and transformation.[6] By enhancing the metabolic capabilities of their hosts, bacteriophages contribute to the recycling of organic matter, influencing the availability of nutrients for other organisms in the ecosystem. Lytic viruses in particular have been shown to increase ammonium oxidation, nitric oxide reduction, nitrification, and denitrification to balance nutrient levels in nitrogen polluted environments.[6] Nutrient-enriched wetlands contain AMGs related to sulfur transport and metabolism.[32] AMG modification of host processes is another means other than the viral shunt by which viruses can directly impact biogeochemical cycles.[33]
Community structure
The ability of AMGs modulating the metabolic capacities of their hosts can influence the abundance and distribution of specific microbial taxa.[6] In turn, this shapes the overall composition of microbial communities, with potential cascading effects on higher trophic levels.[citation needed]
Adaptation to environment
AMGs play a crucial role in microbial adaptation to environmental changes. In extreme environments, AMGs can encode for alternate energy pathways such as subunits of dissimilatory sulfite reductase.[34] The ability of viruses to confer new metabolic traits to their hosts enhances the resilience of microbial communities facing shifts in temperature, nutrient availability, or other environmental stressors.[6] AMGs can also serve as a genetic pool in shaping the evolution of their hosts.[35]
References
- .
- PMID 27693926.
- S2CID 4411495.
- PMID 21844365.
- PMID 24666644.
- ^ PMID 36333738.
- ^ PMID 27088500.
- ^ S2CID 32998525.
- PMID 16802857.
- PMID 21844365.
- PMID 15828858.
- PMID 25093636.
- ^ S2CID 245616252.
- ^ PMID 26296067.
- PMID 28291233.
- PMID 25002514.
- PMID 24200126.
- PMID 30013236.
- ISSN 0967-0637.
- ^ PMID 32766782.
- PMID 34178438.
- ^ PMID 32522236.
- PMID 23140888.
- PMID 25314322.
- S2CID 208191024.
- PMID 33522966.
- PMID 34084563.
- PMID 33349699.
- ^ S2CID 205652754.
- PMID 19508343.
- PMID 15231754.
- S2CID 256192280.
- S2CID 207894289.
- S2CID 692770.
- S2CID 4397295.