Protein function prediction

Protein function prediction methods are techniques that

proteins. These proteins are usually ones that are poorly studied or predicted based on genomic sequence data. These predictions are often driven by data-intensive computational procedures. Information may come from nucleic acid sequence homology, gene expression profiles, protein domain structures, text mining of publications, phylogenetic profiles, phenotypic profiles, and protein-protein interaction. Protein function is a broad term: the roles of proteins range from catalysis of biochemical reactions to transport to signal transduction, and a single protein may play a role in multiple processes or cellular pathways.^[1]

Generally, function can be thought of as, "anything that happens to or through a protein".

Gene Ontology Consortium provides a useful classification of functions, based on a dictionary of well-defined terms divided into three main categories of molecular function, biological process and cellular component.^[2] Researchers can query this database with a protein name or accession number

to retrieve associated Gene Ontology (GO) terms or annotations based on computational or experimental evidence.

While techniques such as

yeast two-hybrid system can be used to experimentally demonstrate the function of a protein, advances in sequencing technologies have made the rate at which proteins can be experimentally characterized much slower than the rate at which new sequences become available.^[3] Thus, the annotation of new sequences is mostly by prediction through computational methods, as these types of annotation can often be done quickly and for many genes or proteins at once. The first such methods inferred function based on homologous proteins with known functions (homology-based function prediction). The development of context-based and structure based methods have expanded what information can be predicted, and a combination of methods can now be used to get a picture of complete cellular pathways based on sequence data.^[3] The importance and prevalence of computational prediction of gene function is underlined by an analysis of 'evidence codes' used by the GO database: as of 2010, 98% of annotations were listed under the code IEA (inferred from electronic annotation) while only 0.6% were based on experimental evidence.^[4]

Homology-based methods

Retrieved from "https://en.wikipedia.org/w/index.php?title=Protein_function_prediction&oldid=1174268335"

[Rost-1] 
S2CID 8800506
.

[2] PMID 10802651
.

[gabaldon-3] 
S2CID 18032660
.

[4] PMID 21330331
.

[5] S2CID 42949514
.

[whisstock-6] 
S2CID 27123114
.

[Platt2000-7] PMID 10737789
.

[8] PMID 12051862
.

[9] PMID 14568541
.

[10] PMID 19920124
.

[pmid23161684-11] PMID 23161684
.

[sleator-12] 
S2CID 8932206
.

[13] PMID 19858104
.

[14] PMID 11099261
.

[15] S2CID 16509924
.

[16] PMID 10592235
.

[17] PMID 15215455
.

[18] PMID 9796821
.

[DeepAlign-19] PMID 23486213
.

[20] PMID 21155016
.

[biomedcentral2013-21] 
PMID 23514271
.

[22] PMID 26773655
.

[23] S2CID 26066208
.

[24] PMID 14681376
.

[25] PMID 25343578
.

[:4-26] 
PMID 16878974
.

[27] S2CID 20273975
.

[eisenberg-28] 
S2CID 4398864
.

[29] PMID 21051344
.

[marcotte-30] 
PMID 10427000
.

[overbeek-31] PMID 10077608
.

[32] PMID 12695325
.

[33] PMID 10613842
.

[34] PMID 22824328
.

[pavlidis-35] 
PMID 23936626
.

[Eksi-36] PMID 24244129
.

[37] PMID 24951248
.

[38] S2CID 3009359
.

[sharan-39] 
PMID 17353930
.

[mostafavi-40] 
PMID 18613948
.

[41] PMID 16420673
.

[42] PMID 18613946
.

[43] PMID 27924014
.

[44] PMID 27081850
.

[45] PMID 34076241
.

[1]

[2]

[3]

[4]

[5]

[7]

[8]

[9]

[12]

[13]

[14]

[15]

[6]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[32]

[33]

[34]

[35]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

v t e Proteins: key methods of study
Experimental	Protein purification Green fluorescent protein Western blot Protein immunostaining Protein sequencing Gel electrophoresis/Protein electrophoresis Protein immunoprecipitation Peptide mass fingerprinting/Protein mass spectrometry Dual-polarization interferometry Microscale thermophoresis Chromatin immunoprecipitation Surface plasmon resonance Isothermal titration calorimetry X-ray crystallography Protein NMR Cryo-electron microscopy Freeze-fracture electron microscopy
Bioinformatics	Protein structure prediction Protein function prediction Protein–protein docking Protein structural alignment Protein ontology Protein–protein interaction prediction
Assay	Enzyme assay Protein assay Secretion assay
Display techniques	Bacterial display mRNA display Phage display Ribosome display Yeast display
Super-resolution microscopy	Photoactivated localization microscopy Vertico SMI

Protein function prediction

Homology-based methods

Sequence motif-based methods

Structure-based methods

Protein structure prediction

Computational solvent mapping

Genome context-based methods

Gene expression and location-based methods

Network-based methods

Integrated networks

Tools and databases for protein function prediction

See also

References

External links