Protein Engineering, Vol 11, 739-747, Copyright © 1998 by Oxford University Press
IN Shindyalov and PE Bourne
A new algorithm is reported which builds an alignment between two protein
structures. The algorithm involves a combinatorial extension (CE) of an
alignment path defined by aligned fragment pairs (AFPs) rather than the
more conventional techniques using dynamic programming and Monte Carlo
optimization. AFPs, as the name suggests, are pairs of fragments, one from
each protein, which confer structure similarity. AFPs are based on local
geometry, rather than global features such as orientation of secondary
structures and overall topology. Combinations of AFPs that represent
possible continuous alignment paths are selectively extended or discarded
thereby leading to a single optimal alignment. The algorithm is fast and
accurate in finding an optimal structure alignment and hence suitable for
database scanning and detailed analysis of large protein families. The
method has been tested and compared with results from Dali and VAST using a
representative sample of similar structures. Several new structural
similarities not detected by these other methods are reported. Specific
one-on-one alignments and searches against all structures as found in the
Protein Data Bank (PDB) can be performed via the Web at
http://cl.sdsc.edu/ce.html.
ARTICLES
Protein structure alignment by incremental combinatorial extension (CE) of the optimal path
San Diego Supercomputer Center, CA 92186, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. J. Huang, D. Hang, L. J. Lu, L. Tong, M. B. Gerstein, and G. T. Montelione Targeting the Human Cancer Pathway Protein Interaction Network by Structural Genomics Mol. Cell. Proteomics, October 1, 2008; 7(10): 2048 - 2060. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Csaba, F. Birzele, and R. Zimmer Protein structure alignment considering phenotypic plasticity Bioinformatics, August 15, 2008; 24(16): i98 - i104. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. K. Baral, N. Jajcanin-Jozic, S. Deller, P. Macheroux, M. Abramic, and K. Gruber The First Structure of Dipeptidyl-peptidase III Provides Insight into the Catalytic Mechanism and Mode of Substrate Binding J. Biol. Chem., August 8, 2008; 283(32): 22316 - 22324. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hu, D. Wu, J. Chen, J. Ding, H. Jiang, and X. Shen The Catalytic Intermediate Stabilized by a "Down" Active Site Loop for Diaminopimelate Decarboxylase from Helicobacter pylori: ENZYMATIC CHARACTERIZATION WITH CRYSTAL STRUCTURE ANALYSIS J. Biol. Chem., July 25, 2008; 283(30): 21284 - 21293. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. F. Gherardini and M. Helmer-Citterich Structure-based function prediction: approaches and applications Brief Funct Genomic Proteomic, July 3, 2008; (2008) eln030v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gao and J. Skolnick DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions Nucleic Acids Res., July 1, 2008; 36(12): 3978 - 3992. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mosca and T. R. Schneider RAPIDO: a web server for the alignment of protein structures in the presence of conformational changes Nucleic Acids Res., July 1, 2008; 36(suppl_2): W42 - W46. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.-L. Abraham, E. P. C. Rocha, and J. Pothier Swelfe: a detector of internal repeats in sequences and structures Bioinformatics, July 1, 2008; 24(13): 1536 - 1537. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Bauer, P. E. Bourne, A. Formella, C. Frommel, C. Gille, A. Goede, A. Guerler, A. Hoppe, E.-W. Knapp, T. Poschel, et al. Superimpose: a 3D structural superposition server Nucleic Acids Res., July 1, 2008; 36(suppl_2): W47 - W54. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nigham, L. Tucker-Kellogg, I. Mihalek, C. Verma, and D. Hsu pFlexAna: detecting conformational changes in remotely related proteins Nucleic Acids Res., July 1, 2008; 36(suppl_2): W246 - W251. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Valle, A. N. Mabbett, G. C. Ulett, A. Toledo-Arana, K. Wecker, M. Totsika, M. A. Schembri, J.-M. Ghigo, and C. Beloin UpaG, a New Member of the Trimeric Autotransporter Family of Adhesins in Uropathogenic Escherichia coli J. Bacteriol., June 15, 2008; 190(12): 4147 - 4161. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wu, T. Hu, L. Zhang, J. Chen, J. Du, J. Ding, H. Jiang, and X. Shen Residues Asp164 and Glu165 at the substrate entryway function potently in substrate orientation of alanine racemase from E. coli: Enzymatic characterization with crystal structure analysis Protein Sci., June 1, 2008; 17(6): 1066 - 1076. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Houtz, R. Magnani, N. R. Nayak, and L. M. A. Dirk Co- and post-translational modifications in Rubisco: unanswered questions J. Exp. Bot., May 1, 2008; 59(7): 1635 - 1645. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Courville, E. Urbankova, C. Rensing, R. Chaloupka, M. Quick, and M. F. M. Cellier Solute Carrier 11 Cation Symport Requires Distinct Residues in Transmembrane Helices 1 and 6 J. Biol. Chem., April 11, 2008; 283(15): 9651 - 9658. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. G. Hocking, A. P. Herbert, D. Kavanagh, D. C. Soares, V. P. Ferreira, M. K. Pangburn, D. Uhrin, and P. N. Barlow Structure of the N-terminal Region of Complement Factor H and Conformational Implications of Disease-linked Sequence Variations J. Biol. Chem., April 4, 2008; 283(14): 9475 - 9487. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, B.-H. Kim, and N. V. Grishin PROMALS3D: a tool for multiple protein sequence and structure alignments Nucleic Acids Res., April 1, 2008; 36(7): 2295 - 2300. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Araiso, S. Palioura, R. Ishitani, R. L. Sherrer, P. O'Donoghue, J. Yuan, H. Oshikane, N. Domae, J. DeFranco, D. Soll, et al. Structural insights into RNA-dependent eukaryal and archaeal selenocysteine formation Nucleic Acids Res., March 27, 2008; 36(4): 1187 - 1199. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Konagurthu, P. J. Stuckey, and A. M. Lesk Structural search and retrieval using a tableau representation of protein folding patterns Bioinformatics, March 1, 2008; 24(5): 645 - 651. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-Y. Lu, Y.-C. Hsieh, M.-Y. Liu, Y.-H. Lin, C.-J. Chen, and Y.-S. Yang Identification and Characterization of Two Amino Acids Critical for the Substrate Inhibition of Human Dehydroepiandrosterone Sulfotransferase (SULT2A1) Mol. Pharmacol., March 1, 2008; 73(3): 660 - 668. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, W. Liu, T. Hu, L. Du, C. Luo, K. Chen, X. Shen, and H. Jiang Structural Basis for Catalytic and Inhibitory Mechanisms of {beta}-Hydroxyacyl-acyl Carrier Protein Dehydratase (FabZ) J. Biol. Chem., February 29, 2008; 283(9): 5370 - 5379. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Di Lello, L. M. Miller Jenkins, C. Mas, C. Langlois, E. Malitskaya, A. Fradet-Turcotte, J. Archambault, P. Legault, and J. G. Omichinski p53 and TFIIE{alpha} share a common binding site on the Tfb1/p62 subunit of TFIIH PNAS, January 8, 2008; 105(1): 106 - 111. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wu, Y. Yang, J. Zhang, P. Ji, W. Du, P. Jiang, D. Xie, H. Huang, M. Wu, G. Zhang, et al. Domain-swapped Dimerization of the Second PDZ Domain of ZO2 May Provide a Structural Basis for the Polymerization of Claudins J. Biol. Chem., December 7, 2007; 282(49): 35988 - 35999. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Zemla, B. Geisbrecht, J. Smith, M. Lam, B. Kirkpatrick, M. Wagner, T. Slezak, and C. E. Zhou STRALCP structure alignment-based clustering of proteins Nucleic Acids Res., December 3, 2007; 35(22): e150 - e150. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Sommer, O. Muller, F. S. Domingues, O. Sander, J. Weickert, and T. Lengauer Moment invariants as shape recognition technique for comparing protein binding sites Bioinformatics, December 1, 2007; 23(23): 3139 - 3146. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Nanatani, T. Fujiki, K. Kanou, M. Takeda-Shitaka, H. Umeyama, L. Ye, X. Wang, T. Nakajima, T. Uchida, P. C. Maloney, et al. Topology of AspT, the Aspartate:Alanine Antiporter of Tetragenococcus halophilus, Determined by Site-Directed Fluorescence Labeling J. Bacteriol., October 1, 2007; 189(19): 7089 - 7097. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-L. Chung, J. E. Beaver, E. D. Scheeff, and P. E. Bourne Con-Struct Map: a comparative contact map analysis tool Bioinformatics, September 15, 2007; 23(18): 2491 - 2492. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ritco-Vonsovici, A. Ababou, and M. Horton Molecular plasticity of beta-catenin: New insights from single-molecule measurements and MD simulation Protein Sci., September 1, 2007; 16(9): 1984 - 1998. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-I A. Wang, G. Guncar, J. K. Forwood, T. Teh, A.-M. Catanzariti, G. J. Lawrence, F. E. Loughlin, J. P. Mackay, H. J. Schirra, P. A. Anderson, et al. Crystal Structures of Flax Rust Avirulence Proteins AvrL567-A and -D Reveal Details of the Structural Basis for Flax Disease Resistance Specificity PLANT CELL, September 1, 2007; 19(9): 2898 - 2912. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Yeam, J. R. Cavatorta, D. R. Ripoll, B.-C. Kang, and M. M. Jahn Functional Dissection of Naturally Occurring Amino Acid Substitutions in eIF4E That Confers Recessive Potyvirus Resistance in Plants PLANT CELL, September 1, 2007; 19(9): 2913 - 2928. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Fasnacht, J. Zhu, and B. Honig Local quality assessment in homology models using statistical potentials and support vector machines Protein Sci., August 1, 2007; 16(8): 1557 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. R. Dalton and R. M. Jackson An evaluation of automated homology modelling methods at low target template sequence similarity Bioinformatics, August 1, 2007; 23(15): 1901 - 1908. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Tung and J.-M. Yang fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies Nucleic Acids Res., July 13, 2007; 35(suppl_2): W438 - W443. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. K. Dalton, D. Ako-Adjei, P. S. Murray, D. Murray, and V. M. Vogt Electrostatic Interactions Drive Membrane Association of the Human Immunodeficiency Virus Type 1 Gag MA Domain J. Virol., June 15, 2007; 81(12): 6434 - 6445. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, W. Liu, J. Xiao, T. Hu, J. Chen, K. Chen, H. Jiang, and X. Shen Malonyl-CoA: acyl carrier protein transacylase from Helicobacter pylori: Crystal structure and its interaction with acyl carrier protein Protein Sci., June 1, 2007; 16(6): 1184 - 1192. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr, V. Hintermair, and R. Zimmer AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings Bioinformatics, May 15, 2007; 23(10): 1203 - 1210. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Qiu, M. Hue, A. Ben-Hur, J.-P. Vert, and W. S. Noble A structural alignment kernel for protein structures Bioinformatics, May 1, 2007; 23(9): 1090 - 1098. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Papadopoulos and R. Agarwala COBALT: constraint-based alignment tool for multiple protein sequences Bioinformatics, May 1, 2007; 23(9): 1073 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sun, S. Zhao, H. Yu, G. Gao, and J. Luo ABCGrid: Application for Bioinformatics Computing Grid Bioinformatics, May 1, 2007; 23(9): 1175 - 1177. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Berndt, T. Kottke, H. Breitkreuz, R. Dvorsky, S. Hennig, M. Alexander, and E. Wolf A Novel Photoreaction Mechanism for the Circadian Blue Light Photoreceptor Drosophila Cryptochrome J. Biol. Chem., April 27, 2007; 282(17): 13011 - 13021. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Niv, D. R. Ripoll, J. A. Vila, A. Liwo, E. S. Vanamee, A. K. Aggarwal, H. Weinstein, and H. A. Scheraga Topology of Type II REases revisited; structural classes and the common conserved core Nucleic Acids Res., April 1, 2007; 35(7): 2227 - 2237. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin PROMALS: towards accurate multiple sequence alignments of distantly related proteins Bioinformatics, April 1, 2007; 23(7): 802 - 808. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ababou, M. Gautel, and M. Pfuhl Dissecting the N-terminal Myosin Binding Site of Human Cardiac Myosin-binding Protein C: STRUCTURE AND MYOSIN BINDING OF DOMAIN C2 J. Biol. Chem., March 23, 2007; 282(12): 9204 - 9215. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio The WWWH of remote homolog detection: The state of the art Brief Bioinform, March 1, 2007; 8(2): 78 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Nieto, I. Cherny, S. K. Khoo, M. G. de Lacoba, W. T. Chan, C. C. Yeo, E. Gazit, and M. Espinosa The yefM-yoeB Toxin-Antitoxin Systems of Escherichia coli and Streptococcus pneumoniae: Functional and Structural Correlation J. Bacteriol., February 15, 2007; 189(4): 1266 - 1278. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Rangwala and G. Karypis Incremental window-based protein sequence alignment algorithms Bioinformatics, January 15, 2007; 23(2): e17 - e23. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele, J. E. Gewehr, G. Csaba, and R. Zimmer Vorolign--fast structural alignment using Voronoi contacts Bioinformatics, January 15, 2007; 23(2): e205 - e211. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Carugo Statistical validation of the root-mean-square-distance, a measure of protein structural proximity Protein Eng. Des. Sel., January 12, 2007; (2007) gzl051v2. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Leslin, A. Abyzov, and V. A. Ilyin TOPOFIT-DB, a database of protein structural alignments based on the TOPOFIT method Nucleic Acids Res., January 12, 2007; 35(suppl_1): D317 - D321. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Aoyagi, D. Zhai, C. Jin, A. E. Aleshin, B. Stec, J. C. Reed, and R. C. Liddington Vaccinia virus N1L protein resembles a B cell lymphoma-2 (Bcl-2) family protein Protein Sci., January 1, 2007; 16(1): 118 - 124. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Lisewski and O. Lichtarge Rapid detection of similarity in protein structure and function through contact metric distances Nucleic Acids Res., December 2, 2006; 34(22): e152 - e152. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Lewis, T. Jebara, and W. S. Noble Support vector machine learning from heterogeneous data: an empirical analysis using protein sequence and structure Bioinformatics, November 15, 2006; 22(22): 2753 - 2760. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Jeffries, S. C. Graham, P. H. Stokes, C. A. Collyer, J. M. Guss, and J. M. Matthews Stabilization of a binary protein complex by intein-mediated cyclization Protein Sci., November 1, 2006; 15(11): 2612 - 2618. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Larson, J. L. Jenkins, J. P. Schuermann, Y. Zhou, D. F. Becker, and J. J. Tanner Crystal structures of the DNA-binding domain of Escherichia coli proline utilization A flavoprotein and analysis of the role of Lys9 in DNA recognition Protein Sci., November 1, 2006; 15(11): 2630 - 2641. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sadekar, J. Raymond, and R. E. Blankenship Conservation of Distantly Related Membrane Proteins: Photosynthetic Reaction Centers Share a Common Structural Core Mol. Biol. Evol., November 1, 2006; 23(11): 2001 - 2007. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Chivian and D. Baker Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection Nucleic Acids Res., October 18, 2006; 34(17): e112 - e112. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. C. Godoi, R. S. Galhardo, D. D. Luche, M.-A. Van Sluys, C. F. M. Menck, and G. Oliva Structure of the Thiazole Biosynthetic Enzyme THI1 from Arabidopsis thaliana J. Biol. Chem., October 13, 2006; 281(41): 30957 - 30966. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald Hypothesis: bacterial clamp loader ATPase activation through DNA-dependent repositioning of the catalytic base and of a trans-acting catalytic threonine Nucleic Acids Res., October 6, 2006; 34(18): 5280 - 5290. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Armougom, O. Poirot, S. Moretti, D. G. Higgins, P. Bucher, V. Keduas, and C. Notredame APDB: a web server to evaluate the accuracy of sequence alignments using structural information. Bioinformatics, October 1, 2006; 22(19): 2439 - 2440. [Abstract] [Full Text] [PDF] |
||||














