Protein Engineering, Vol. 12, No. 2, 107-118,
February 1999
© 1999 Oxford University Press
Protein subcellular location prediction
Computer-Aided Drug Discovery, Pharmacia & Upjohn, Kalamazoo, MI 49007-4940, USA
The function of a protein is closely correlated with its subcellular location. With the rapid increase in new protein sequences entering into data banks, we are confronted with a challenge: is it possible to utilize a bioinformatic approach to help expedite the determination of protein subcellular locations? To explore this problem, proteins were classified, according to their subcellular locations, into the following 12 groups: (1) chloroplast, (2) cytoplasm, (3) cytoskeleton, (4) endoplasmic reticulum, (5) extracell, (6) Golgi apparatus, (7) lysosome, (8) mitochondria, (9) nucleus, (10) peroxisome, (11) plasma membrane and (12) vacuole. Based on the classification scheme that has covered almost all the organelles and subcellular compartments in an animal or plant cell, a covariant discriminant algorithm was proposed to predict the subcellular location of a query protein according to its amino acid composition. Results obtained through self-consistency, jackknife and independent dataset tests indicated that the rates of correct prediction by the current algorithm are significantly higher than those by the existing methods. It is anticipated that the classification scheme and concept and also the prediction algorithm can expedite the functionality determination of new proteins, which can also be of use in the prioritization of genes and proteins identified by genomic efforts as potential molecular targets for drug design.
Keywords: amino acid composition/bioinformatics/covariant discriminant/organelles/subcellular compartments
1 To whom correspondence should be addressed. E-mail: kuo-chen.chou{at}am.pnu.com
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W.-Z. Lin, X. Xiao, and K.-C. Chou GPCR-GIA: a web-server for identifying G-protein coupled receptors and their families with grey incidence analysis Protein Eng. Des. Sel., November 1, 2009; 22(11): 699 - 705. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lee, H.-Y. Chuang, A. Beyer, M.-K. Sung, W.-K. Huh, B. Lee, and T. Ideker Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species Nucleic Acids Res., November 1, 2008; 36(20): e136 - e136. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-B. Shen and K.-C. Chou Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM Protein Eng. Des. Sel., November 10, 2007; (2007) gzm057v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-B. Shen and K.-C. Chou Gpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins Protein Eng. Des. Sel., January 23, 2007; (2007) gzl053v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lee, D.-W. Kim, D. Na, K. H. Lee, and D. Lee PLPD: reliable protein localization prediction from imbalanced and overlapped datasets Nucleic Acids Res., October 18, 2006; 34(17): 4655 - 4666. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Guo and Y. Lin TSSub: eukaryotic protein subcellular localization by extracting features from profiles Bioinformatics, July 15, 2006; 22(14): 1784 - 1785. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Garg, M. Bhasin, and G. P. S. Raghava Support Vector Machine-based Method for Subcellular Localization of Human Proteins Using Amino Acid Compositions, Their Order, and Similarity Search J. Biol. Chem., April 15, 2005; 280(15): 14427 - 14432. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-C. Chou and Y.-D. Cai Predicting protein localization in budding Yeast Bioinformatics, April 1, 2005; 21(7): 944 - 950. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-C. Chou Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes Bioinformatics, January 1, 2005; 21(1): 10 - 19. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-C. Chou and Y.-D. Cai Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location J. Biol. Chem., November 22, 2002; 277(48): 45765 - 45769. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Elrod and K.-C. Chou A study on the correlation of G-protein-coupled receptor types with amino acid composition Protein Eng. Des. Sel., September 1, 2002; 15(9): 713 - 715. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. I. Schein, J. C. Kissinger, and L. H. Ungar Chloroplast transit peptide prediction: a peek inside the black box Nucleic Acids Res., August 15, 2001; 29(16): e82 - e82. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-C. Chou Using subsite coupling to predict signal peptides Protein Eng. Des. Sel., February 1, 2001; 14(2): 75 - 79. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Marcotte, I. Xenarios, A. M. van der Bliek, and D. Eisenberg Localizing proteins in the cell from their phylogenetic profiles PNAS, October 12, 2000; (2000) 220399497. [Abstract] [Full Text] |
||||
![]() |
W.-m. Liu and K.-C. Chou Prediction of protein secondary structure content Protein Eng. Des. Sel., December 1, 1999; 12(12): 1041 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Marcotte, I. Xenarios, A. M. van der Bliek, and D. Eisenberg Localizing proteins in the cell from their phylogenetic profiles PNAS, October 24, 2000; 97(22): 12115 - 12120. [Abstract] [Full Text] [PDF] |
||||




