Protein subcellular localization prediction
Protein subcellular localization prediction (or just protein localization prediction) involves the prediction of where a protein resides in a cell, its subcellular localization.
In general, prediction tools take as input information about a protein, such as a
Prediction of protein subcellular localization is an important component of
Background
Experimentally determining the subcellular localization of a protein can be a laborious and time consuming task. Immunolabeling or tagging (such as with a green fluorescent protein) to view localization using fluorescence microscope are often used. A high throughput alternative is to use prediction.
Through the development of new approaches in computer science, coupled with an increased dataset of proteins of known localization, computational tools can now provide fast and accurate localization predictions for many organisms. This has resulted in subcellular localization prediction becoming one of the challenges being successfully aided by bioinformatics, and machine learning.
Many prediction methods now exceed the accuracy of some high-throughput laboratory methods for the identification of protein subcellular localization.[1][2][3] Particularly, some predictors have been developed[4] that can be used to deal with proteins that may simultaneously exist, or move between, two or more different subcellular locations. Experimental validation is typically required to confirm the predicted localizations.
Tools
In 1999
The development of protein subcellular location prediction has been summarized in two comprehensive review articles.[13][14] Recent tools and an experience report can be found in a recent paper by Meinken and Min (2012).
Application
Knowledge of the subcellular localization of a protein can significantly improve target identification during the
Bacterial cell surface and secreted proteins are also of interest for their potential as vaccine candidates or as diagnostic targets. Aberrant subcellular localization of proteins has been observed in the cells of several diseases, such as cancer and Alzheimer's disease. Secreted proteins from some archaea that can survive in unusual environments have industrially important applications.
By using prediction a high number of proteins can be assessed in order to find candidates that are trafficked to the desired location.
Databases
The results of subcellular localization prediction can be stored in databases. Examples include the multi-species database Compartments, FunSecKB2, a fungal database;[15] PlantSecKB, a plant database;[16] MetazSecKB, an animal and human database;[17] and ProtSecKB, a protist database.[18]
References
- PMID 32142105.
- PMID 16288665.
- S2CID 234484678.
- S2CID 226104.
- ^ "Protein Subcellular Localization Prediction". www.ncbi.nlm.nih.gov. Retrieved 2016-12-31.
- PMID 21483473.
- PMID 19651102.
- PMID 20596258.
- S2CID 62781755.
- S2CID 234484678.
- PMID 32142105.
- S2CID 233449747.
- ^ Nakai, K. Protein sorting signals and prediction of subcellular localization. Adv. Protein Chem., 2000, 54, 277-344.
- ^ Chou, K. C.; Shen, H. B. Review: Recent progresses in protein subcellular location prediction" Anal. Biochem 2007, 370, 1-16.
- ^ "FunSecKB2 (The Fungal Secretome and Subcellular Proteome KnowledgeBase 2.1)". bioinformatics.ysu.edu. Archived from the original on 2016-04-10. Retrieved 2017-09-17.
- ^ "PlantSecKB (The Plant Secretome and Subcellular Proteome KnowledgeBase)". bioinformatics.ysu.edu. Archived from the original on 2016-04-06. Retrieved 2017-09-17.
- ^ "MetazSecKB (The Metazoa (Human & Animal) Protein Subcelluar Location, Secretome and Subcellular Proteome Database)". bioinformatics.ysu.edu. Archived from the original on 2016-04-06. Retrieved 2017-09-17.
- ^ "ProtSecKB (The Protist Secretome and Subcellular Proteome KnowledgeBase)". proteomics.ysu.edu. Retrieved 2017-09-17.
Further reading
- Bork P, Dandekar T, Diaz-Lazcoz Y, Eisenhaber F, Huynen M, Yuan Y (Nov 1998). "Predicting function: from genes to genomes and back". Journal of Molecular Biology. 283 (4): 707–25. PMID 9790834.
- Nakai K (2000). "Protein sorting signals and prediction of subcellular localization". Advances in Protein Chemistry. 54: 277–344. PMID 10829231.
- Emanuelsson O (Dec 2002). "Predicting protein subcellular localisation from amino acid sequence information". Briefings in Bioinformatics. 3 (4): 361–76. PMID 12511065.
- Schneider G, Fechner U (Jun 2004). "Advances in the prediction of protein targeting signals". Proteomics. 4 (6): 1571–80. S2CID 7217647.
- Gardy JL, Brinkman FS (Oct 2006). "Methods for predicting bacterial protein subcellular localization". Nature Reviews. Microbiology. 4 (10): 741–51. S2CID 62781755.
- Chou KC, Shen HB (Nov 2007). "Recent progress in protein subcellular location prediction". Analytical Biochemistry. 370 (1): 1–16. PMID 17698024.