Skip Navigation

Protein Engineering Design and Selection 2006 19(12):571-578; doi:10.1093/protein/gzl046
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nakahara, T.
Right arrow Articles by Nishimura, S.-I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nakahara, T.
Right arrow Articles by Nishimura, S.-I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oxfordjournals.org

Computational design and experimental evaluation of glycosyltransferase mutants: engineering of a blood type B galactosyltransferase with enhanced glucosyltransferase activity

Taku Nakahara1, Ole Hindsgaul2, Monica M. Palcic2 and Shin-Ichiro Nishimura1,3,4

1 Laboratory of Advanced Chemical Biology, Graduate School of Advanced Life Science, Frontier Research Center for the Post-Genome Science and Technology, Hokkaido University Sapporo 001-0021, Japan 2 Carlsberg Laboratory, Gamle Carlsberg Vej 10, DK-2500 Copenhagen Valby Denmark 3 Drug-Seeds Discovery Research Laboratory, National Institute of Advanced Industrial Science and Technology (AIST) Sapporo 062-8517, Japan.

4To whom correspondence should be addressed. E-mail: shin{at}glyco.sci.hokudai.ac.jp


    Abstract
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
Glycosyltransferases are an enormous and diverse class of enzyme encompassing 1% of all sequenced genomes. They catalyze the transfer of a monosaccharide from an activated donor such as a sugar-nucleotide to an acceptor molecule. Though the primary sequences of glycosyltransferases have little homology, X-ray structural studies on glycosyltransferases have revealed that there are two main folds and that the orientation of the sugar donors with respect to the folds is highly conserved. It seems that glycosyltransferases have evolved diversified specificities toward donor sugars by changing the amino acids around the monosaccharide moiety without altering the orientation of the nucleotide moiety. In this study, we designed new glycosyltransferases with altered donor specificities by use of a novel empirical model called the Epimer Propensity Index (EPI). The EPI was constructed using 221 carbohydrate–protein complex structures in the Protein Data Bank with either galactose or glucose in the complex. The blood type B synthesizing glycosyltransferase GTB, a galactosyltransferase was our target enzyme. Two GTB mutants designed to exhibit enhanced glucosyltransferase activity were cloned, expressed and characterized experimentally. The predicted GTB mutants, Ser185Asn and Ser185Cys, exhibited 4.3- and 4.8-fold elevations in kcat/Km for UDP-Glc relative to that of wild-type enzyme.

Keywords: carbohydrate–protein complex/computational design/glycosyltransferase


    Introduction
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
Sugar chains by themselves contain no genetic information. They are synthesized, recognized and degraded by proteins. Various functions of sugar chains have emerged through evolution. Understanding the evolution of the proteins that interact with sugar chains would be very helpful in understanding glycobiological evolution as well as for the engineering of synthetically useful enzymes.

Glycosyltransferases are enzymes that transfer a monosaccharide residue from a sugar donor, such as a sugar-nucleotide, to an acceptor molecule. Glycosyltransferases have been classified into 79 distinct families based on their amino acid sequences (CAZy, January 2006 release; Coutinho et al., 2003Go). This classification utilizes BLAST (Altschul et al., 1997Go) so that homologous sequence pairs with a BLAST e-value better than 10–5 are grouped together. Despite the numerous classes based on amino acid sequences, the three-dimensional crystal structures of all glycosyltransferases solved to date have only two folds termed the GT-A fold and GT-B fold (Bourne and Henrissat, 2001Go; Davies et al., 2005Go). In all GT-A fold glycosyltransferases, a DXD motif anchors the pyrophosphate moiety of the sugar-nucleotide donor via a divalent cation such that the location of the sugar donor on the fold is conserved [sialyltransferases are an exception because they utilize CMP-sialic acid, which has monophosphate moiety instead of pyrophosphate moiety (Chiu et al., 2004Go)]. This suggests that glycosyltransferases evolved to recognize various donors by changing amino acids around the sugar moiety of the donor, without changing the major mode of interaction with the nucleotide moiety. This idea is supported by the human blood type glycosyltransferases. The human genome codes for two types of blood group glycosyltransferase genes A and B. Blood type A glycosyltransferase (GTA) is an N-acetylgalactosaminyltransferase which transfers N-acetylgalactosamine from a UDP-GalNAc donor to the disaccharide H-antigen to synthesize the A-antigen. Blood type B glycosyltransferase (GTB) is a galactosyltransferase that transfers a galactose from UDP-Gal to the same H-antigen to synthesize the B-antigen. Blood type O is a consequence of a nucleotide deletion or substitution yielding inactive enzyme. There are only four amino acid differences between GTA and GTB, and only two of these (Leu/Met-266 and Gly/Ala-268 of GTA/GTB) are critical for the donor specificity (Seto et al., 1997Go, 1999Go). There are also several reports of donor specificity engineering of glycosyltransferases accomplished by changing a few amino acids (Ouzzine et al., 2002Go; Kubo et al., 2004Go; Ramakrishnan et al., 2005Go). In these examples of donor specificity engineering, the amino acids targeted for mutagenesis were selected on the basis of at least partial structural information on the enzyme. The ability to quantitatively predict the donor specificity of the mutated enzyme in silico would greatly facilitate this process.

In this study, we aim to establish a protein design method to engineer the donor specificities of glycosyltransferases by mimicking the evolutionary process of glycosyltransferases. The primary focus is on the orientation (configuration) of the hydroxyl groups on the sugar being transferred. For example, OH-4' of D-glucose in the 4C1 chair is directed in the equatorial orientation. For D-galactose, the 4C1 chair conformer is same as the D-glucose 4C1 chair conformer except that OH-4' is in the axial orientation. The purpose of this study was to develop an index to distinguish between a protein + equatorial-OH sugar complex and a protein + axial-OH sugar complex, and to apply this index to the design of novel glycosyltransferases.

There is abundant information about protein–sugar interactions in the Protein Data Bank (PDB; Berman et al., 2000Go) where many sugar structures are co-crystallized with proteins. In a previous study, all of the sugar structures in the PDB were extracted by our original program getCARBO in order to compile a database of these structures (Glycoconjugate Data Bank:Structures available at http://glycoconjugate.jp/structures.html and http://daisy2.nagahama-i-bio.ac.jp/structures). There were 23 608 sugar residues in the PDB (September 2005 release, 32 658 entries). Details on the program and the database will be reported elsewhere. In this study, we utilize this database to establish an empirical index to distinguish between protein + equatorial-OH sugar complexes and protein + axial-OH sugar complexes. We name this index the Epimer Propensity Index (EPI). EPIs were constructed using protein–[sugar bearing equatorial-OH group] complexes and protein–[sugar bearing axial-OH group] complexes, separately. Differences between EPIequatorial and EPIaxial ({Delta}EPI) can be used as an index to predict whether the OH group of a putative protein–sugar complex would prefer to be equatorial or axial.

Here, we develop a {Delta}EPI field for the 4'-OH group of pyranoses in the 4C1 chair conformer. As a first test case, the {Delta}EPI field was applied to wild-type ß-1,4-galactosyltransferase I (ß4GalT-I) and its Arg228Lys mutant which exhibits enhanced glucosyltransferase activity (Ramakrishnan et al., 2005Go). We then applied this method to GTB to predict mutants with enhanced glucosyltransferase activity. Based on our prediction, we experimentally examined whether these mutants transferred glucose more effectively than the wild-type enzyme.


    Materials and methods
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
Dataset preparation

Extraction of sugar structures from the PDB. A scheme for dataset preparation is depicted in Figure 1. The structures of sugars were extracted from the PDB using the program, ‘getCARBO’. Details on this program will be reported elsewhere (TN, in preparation). The full dataset of sugar structures in the PDB can be seen in the Glycoconjugate Data Bank:structures. The PDB released in September 2005 that contains 32 658 entries was used. The nomenclature of the sugar structures in the PDB followed IUPAC recommendations (IUPAC Nomenclature of carbohydrates, 1996Go).


Figure 1
View larger version (35K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Dataset preparation for {Delta}EPI field construction. The status of the dataset appears in boxes. Processes used for dataset selection are enclosed in ovals. Dashed boxes indicate the dataset use. ‘getCARBO’ is a program developed in-house for extraction of sugar structures from PDB files. Details of ‘Non-redundant dataset preparation’ are explained in the Materials and methods section. A total of 525 non-redundant sugar residue dataset (representative sugar residues) were selected from 23 608 sugar residues in the PDB. For {Delta}EPI field evaluation, 221 representative sugar residues with more than one protein atom within 4Å of the C4 atom were prepared.

 
Preparation of a non-redundant sugar residue set. All of the retrieved sugar structures were clustered according to protein sequence, sugar residue structure and sugar location in the protein structure. In this study, only PDB entries with resolutions better than 2.0Å were used. Protein sequences of the PDB entries that contain sugar residue(s) were clustered by using blastclust (Altschul et al., 1997Go). The threshold of this clustering was 60% identity. In each protein sequence cluster, all of the combinations of sequence pairs were aligned by using bl2seq (Altschul et al., 1997Go). Based on the alignments, pairs of proteins were superimposed on the corresponding C{alpha} atoms. If two sugar residues with the same IUPAC name had the center-of-mass of each sugar residue located within 4Å in the superimposed coordinates, then these two sugars were clustered into the same group. The sugar residue with the best resolution within a sugar residue cluster was selected to represent that cluster. These representative sugar residues were used as the non-redundant sugar residue set.

{Delta}EPI field construction

Weighting atom types. Protein atom distributions around the OH group of sugar molecules were assigned to grid points in three-dimensional Cartesian coordinate space. In this study, only the 4'-OH group of 4C1 chair conformers were examined, however, this methodology can be extended to other OH-groups. The dataset for {Delta}EPI field construction consisted of representative sugar residues in the 4C1 chair conformation with their 4'-OH group free (i.e. non-glycosylated). The grid points were defined every 0.2 Å along the three axes and spread within 4Å from the C4 atom of the sugar ring. A given protein–sugar complex was superimposed on the reference system according to three atoms: C3, C4 and C5 (Figure 2A). The C4 atom of the ring was set to the origin, the C3 atom was set along the x-axis, and the C5 atom lay on the xy-plane. Protein atoms were classified into four types as follows: hydrogen bond acceptor, hydrogen bond donor, both acceptor and donor, and neutral (Jiang et al., 2002Go). Each atom type contains the following protein atoms (atom nomenclature used here follows that used in the PDB); acceptor of hydrogen bond: backbone O, OD of ASN, OE of GLN, carboxyl oxygen in ASP and GLU; donor of hydrogen bond: NE of TRP, ND of ASN, NE of GLN, and N of LYS and ARG; both acceptor and donor: hydroxyl oxygen, ND and NE of HIS, carboxyl oxygen in C-terminal, and oxygen of water molecule; neutral: carbon and sulfur atoms and N of PRO. The weights of each atom type were assigned on the grid points according to the position of the protein atoms on the reference system. The weight of a protein atom assigned on a grid point follows a Gaussian curve:

Formula 1(1)
where wp(a,r) is the weight of protein atom type a assigned on a grid point p, and r is a distance (Å) between a and p. The term rcut defines a reach of weighting. We used 2.0Å for rcut. We only assigned exp{–(r/rcut)} > 0.01 (weighting cut-off).


Figure 2
View larger version (21K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. (A) Reference system of {Delta}EPI field. All the representative protein–sugar complexes were superimposed on this reference system. The C4 atom of the ring was set to the origin, the C3 atom was set along the x-axis, and the C5 atom lay on the xy-plane. The shaded area indicates the range of grid point assignment, within 4Å from the C4 atom. (B) Protein atom type distributions around equatorial 4'-OH and axial-4'OH groups. Atom type distribution of equatorial-4'OH sugar complexes (left) and axial-4'OH sugar complexes (right) are depicted. All the sugar residues used were in the 4C1 conformation. Spheres are protein atoms within 4Å from the C4 atom. Sphere colors for hydrogen bond acceptor, donor, both and neutral were red, blue, green and yellow, respectively. This figure was created using PyMOL (DeLano, 2002Go).

 
Construction of EPI field. Weights on the grid points were normalized by dividing by the summation of all the weights assigned on all the grid points so that the total weight assigned on the grid space is 1. Here, we define W(a,p) as a summation of a on p. Probability of a on grid point p is defined as

Formula 2 (2)

Then, we define an empirical potential, named EPI:

Formula 3(3)
where f(a, p) is the normalized weight of atom type a assigned on grid point p, and C is a term of the partition function. The EPI field of protein–sugar that bears equatorial 4'-OH group complexes and that of protein–sugar which bears axial 4'-OH group complexes were constructed separately (Figure 2B). We prepared the {Delta}EPI field by subtracting EPIaxial from EPIequatorial:

Formula 4(4)

In the above equation, we assume C of EPIequatorial and that of EPIaxial are equal. {Delta}EPI score of a given protein–sugar complex is calculated by

Formula 5(5)
where N is the total number of protein atoms in the given complex, P is the number of grid point, {Delta}EPIi,p is {Delta}EPI value of protein atom i at grid point p, and r is a distance between i and p. A positive value of the {Delta}EPI score indicates a OH group of focused position of a sugar complexed with a protein tends to be equatorial, and a negative value of the {Delta}EPI score indicates a OH group of that tends to be axial.

Evaluation of {Delta}EPI field. A new dataset was prepared for evaluation of the constructed {Delta}EPI field. The new dataset contained the representative sugar residues which have more than one protein atom within 4Å of the C4 atom. Separation of protein + equatorial 4'-OH sugar complex and protein + axial 4'-OH sugar complex by {Delta}EPI was quantified by a square-root correlation ratio ({eta}):

Formula 6 (6)
where Nall is the total number of protein–sugar complexes, Nequatorial is the total number of protein + equatorial 4'-OH sugar complexes, Naxial is protein + axial 4'-OH sugar complexes, µall is average {Delta}EPI score of the all protein–sugar complexes, µequatorial is average {Delta}EPI score of the all protein + equatorial 4'-OH sugar complexes, µaxial is average {Delta}EPI score of the all protein + axial 4'-OH sugar complexes, and {sigma}all is standard deviation of all protein–sugar complexes. {eta} ranges between 0 and 1 and higher values of {eta} indicate better discrimination between equatorial/axial by the {Delta}EPI field. For cross-validation, the dataset for {Delta}EPI evaluation was randomly divided into two groups (group 1 and group 2). {Delta}EPI field constructed by using group 1 and {Delta}EPI field constructed by group 2 were prepared, and cross-validations (using group 1 as samples evaluated by {Delta}EPI field of group 2 and vice versa) were also done.

{Delta}EPI evaluation of ß4GalT-I. The structures used to evaluate {Delta}EPI values of ß4GalT-I were 1O0R (wild-type; Ramakrishnan et al., 2002Go) and 1YRO (Arg228Lys, glucosyltransferase enhanced mutant; Ramakrishnan et al., 2005Go). 1O0R is the entry for wild-type ß4GalT-I complexed with UDP-Gal. 1YRO is the entry for the Arg228Lys mutant complexed with UDP-Gal and {alpha}-lactalbumin. The resolution of 1O0R is 2.30Å and that of 1YRO is 1.90Å. Calculations of the {Delta}EPI score were done with all atoms in these PDB entries.

Molecular modeling of blood type B glycosyltransferase and its mutants

Modeling of the missing loop and donor sugar of wild-type GTB. The missing loop of GTB (Glu174-Arg198) and the missing sugar moiety of the donor sugar were modeled using the molecular modeling platform MOE (Chemical Computing Group Inc.) with the MMFF94s force field (Halgren, 1996Go). The main template for the GTB model was the crystal structure of the blood group-B glycosyltransferase (PDB ID: 1LZJ [PDB] ; Patenaude et al., 2002Go). The missing loop was complemented by the bovine {alpha}1,3-galactosyltransferase complex with UDP-Gal (PDB ID: 1G93 [PDB] ; Gastinel et al., 2001Go). The missing loop on 1LZJ (Glu174-Arg198) was built with the homology modeling program in MOE using Lys188-Ala212 of 1G93 as the template for the missing loop (Figure 4). The sugar moiety of the donor was modeled in according to the position of the sugar moiety of the donor in 1G93. The 4'-OH group was inverted to equatorial to build the wild-type GTB + UDP-Glc complex.

Modeling of mutant GTBs. Molecular modeling of GTB mutants was carried out using a function implemented in MOE. Only the side chain at position 185 of the wild-type GTB + UDP-Glc was changed to other amino acids except proline. In addition to the first modeled mutant structure, the same mutant with the {chi} angle of the modeled residue rotated by +120° and –120° were prepared. Each model was optimized by molecular mechanics using the force field MMFF94s. All the atoms remained fixed except those of the sugar moiety, amino acids on Glu174-Arg198 and Asp302, and all the hydrogen atoms in the system.

Enzymatic characterization of GTB mutants

Preparation of mutated GTB plasmids. A wild-type GTB plasmid that codes for the soluble catalytic domain (amino acids 63-354) was used as a template for mutagenesis. The Ser185Asn mutant construct was prepared by two-fragment site-directed mutagenesis. The PCR primer for the 5' terminal was 5'-ATATGAATTCATGGTTTCCCTGCCGCGTATGGTTTACCCGCAGCCGAA-3', named MIN2, having an EcoRI restriction site. The PCR primer for the 3' terminal was 5'-ATAATTAAGCTTCTATCACGGGTTACGAACAGCCTGGTGGTTTTT-3', named PCR-3B, having a HindIII restriction site. The internal mutagenesis primers for Ser185Asn were the following: forward primer, 5'-GTTGGCAGGACGTTAACATGCGTCGTATGGAAATGATCAGCGAC-3'; reverse primer, 5'-CATACGACGCATGTTAACGTCCTGCCAACGTTTGTAGGCGCCCAC-3'. The internal mutagenesis primers for Ser185Asp were the following: forward primer, 5'-GTTGGCAGGACGTTGACATGCGTCGTATGGAAATGATCAGCGAC-3'; reverse primer, 5'-CATACGACGCATGTCAACGTCCTGCCAACGTTTGTAGGCGCCCAC-3'. The internal mutagenesis primers for Ser185Glu were the following: forward primer, 5'-GTTGGCAGGACGTTGAGATGCGTCGTATGGAAATGATCAGCGAC-3'; reverse primer, 5'-CATACGACGCATCTCAACGTCCTGCCAACGTTTGTAGGCGCCCAC-3'.The mutated fragments were inserted into a pCW{Delta}lac vector (gift of W. Wakarchuk). The Ser185Cys mutant construct was prepared by using the QuikChange site-directed mutagenesis kit (Stratagene). 5'- and 3'-terminal primers were MIN2 and PCR-3B. The internal mutagenesis primers were the following: forward primer, 5'-GTTGGCAGGACGTTTGCATGCGTCGTATGGAA-3'; reverse primer, 5'-TTCCATACGACGCATGCAAACGTCCTGCCAAC-3'. The prepared mutated plasmids were transformed into Escherichia coli BL21-Gold competent cells (Stratagene).

Isolation and enzymatic characterization of wild-type and mutant GTBs. Recombinant GTBs were expressed in E.coli BL21-Gold cells and purified by sequential chromatography on SP-Sepharose Fast Flow (cation-exchange chromatography) and UDP-hexanolamine-agarose (affinity chromatography) (Seto et al., 2000Go; Marcus et al., 2003Go). Storage buffer for the purified enzymes contained 50 mM MOPS buffer (pH 7.0), 0.5 M NaCl, 1 mM dithiothreitol, 5 mM MnCl2 and 10 mM mercaptoethanol. Protein concentration was estimated with the Bradford method (Bio-Rad) using bovine gamma globulin as a protein standard. Specific activities were determined in a standard assay with 495 µM Fuc{alpha}1-2Galß-O(CH2)7CH3 as an acceptor and 266 µM of either UDP-Gal or UDP-Glc donor as reported previously (Palcic et al., 1988Go; Seto et al., 1997Go). Tritium-labeled donors were used. For kinetic characterization, seven reaction mixtures which contained different concentrations of donor, 500 µM acceptor and the enzyme were incubated at 37°C for an appropriate time (depending on the activity of the enzyme). The assay buffer contained 50 mM MOPS buffer (pH 7.0), 20 mM MnCl2 and 1 mg/ml bovine serum albumin. The reactions were stopped by dilution with water. Radioactive products of the enzyme reaction were isolated by reverse-phase extraction (Sep-Pak plus C-18 cartridge, Millipore-Waters), and quantified by counting on a Beckman LS 6500 counter after the addition of 10 ml EcoLite+ liquid scintillation cocktail. The kinetic parameters kcat and Km were calculated by non-linear regression analysis of the Michaelis–Menten equation using a PRISM program (GraphPad Software).


    Results
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
Dataset preparation

To construct an empirical potential, a non-redundant sugar residue set was prepared (Figure 1). This was based on structural information for sugar residues in the PDB that was retrieved using the getCARBO program we have developed. The program getCARBO extracts sugar residue structures from a PDB format file. It detects and annotates sugar residue structures by using coordinates and element type of atoms in a PDB file. Of the 32 658 PDB entries that were processed by getCARBO, 5673 of the entries contained 23 608 sugar residues in 15 868 glycan structures. Only ligand sugar residues were used because we focused on recognition of sugars by proteins; the set contained 16 101 residues in 11 776 glycans.

To construct a reliable empirical potential, only PDB entries with resolutions better than 2Å were used. Only 2457 of 5673 PDB entries had the required resolution. A non-redundant sugar residue set was prepared by using protein sequence similarity, IUPAC nomenclature of sugar structures and sugar residue locations on homologous proteins. Initially, sequences of proteins that form complexes with sugar residue(s) were clustered using blastclust (Altschul et al., 1997Go). Sequences that showed more than 60% identity were clustered. A total of 3311 protein sequences of 2457 PDB entries were clustered into 843 sequence clusters. All the combinations of protein sequence pairs in a sequence cluster were aligned using bl2seq (Altschul et al., 1997Go). All the protein sequence pairs were superimposed on the corresponding C{alpha} atoms in the alignment. In superimposed structures, sugar residues with center-of-masses that were within 4.0Å were clustered. The sugar residue with the best resolution in a cluster was selected as a representative sugar residue for that cluster. After these processes, 2629 representative sugar residues were selected.

In this work, we focused on the 4'-OH group of the 4C1 chair conformer. There were only 525 representative sugar residues that satisfied the following conditions: (i) 4C1 chair conformer, (ii) 4'-OH group was not glycosidically bonded. These included 410 O4'-equatorial sugar residues and 115 O4'-axial sugar residues. These 525 sugar residues were used to construct the {Delta}EPI field.

To evaluate the {Delta}EPI field, only 221 of 525 sugar residues that had more than one protein atom within 4Å from the C4 atom were used (See supplementary material for the list of the 221 structures). We excluded 304 sugar residues because these had only one or no protein atom within 4Å of the C4 atom. {Delta}EPI scores of these sugar residues were very small. A substantial number of these barely recognized 4'-OH groups were not suitable for evaluation of discrimination ability of {Delta}EPI ({eta}). These 221 sugar residues contained 154 O4'-equatorial sugar residues and 67 O4'-axial sugar residues.

Construction and evaluation of {Delta}EPI

An empirical potential field, the {Delta}EPI field of 4'-OH of 4C1 chair conformers, was constructed as described in Materials and methods. A total of 525 representative sugar residues and protein atoms around these sugar residues were superimposed on the reference system depicted in Figure 2A. The protein atoms were classified into four groups based on their hydrogen bonding properties; donor, acceptor, both donor and acceptor, and neutral. The weights of protein atoms were assigned on grid points using a Gaussian curve function (Equation 1). All of the weights were normalized and converted to {Delta}EPI fields by Equations 3 and 4. The {Delta}EPI score of a given protein–sugar complex was calculated using Equation 5.

The {Delta}EPI field was evaluated by using a squared-root correlation ratio ({eta}, Equation 6) of protein + equatorial-4'-OH sugar complexes and protein + axial-4'-OH sugar complexes. {eta} is an index that expresses the degree of separation between protein + equatorial 4'-OH sugar complexes and protein + axial 4'-OH sugar complexes. Self-evaluation and cross-validations were done. For cross-validations, the evaluation dataset (221 sugar residues) was divided into two groups and {Delta}EPI fields for each group were constructed. In self-evaluations, {eta} was 0.716 (Figure 3). In cross-validations, {eta} of group 2 samples evaluated by group 1 {Delta}EPI field was 0.693, and {eta} of group 1 samples evaluated by group 2 {Delta}EPI field was 0.629.


Figure 3
View larger version (16K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 3. {Delta}EPI score distributions of protein + equatorial 4'OH sugar complexes and protein + axial 4'OH sugar complexes. The horizontal axis indicates the {Delta}EPI score of each sugar–protein complex. The summation of each sugar complexes were normalized to 1, and the normalized ratio of each sugar complex are indicated by bars (blue bars for the protein + equatorial 4'OH sugar complexes and red bars for the protein + axial 4'OH sugar complexes). The dataset for this evaluation consists of 221 representative sugar residues that includes 154 protein + equatorial 4'OH sugar complexes and 67 protein + axial 4'OH sugar complexes. The discrimination ability of the {Delta}EPI field was scaled by the square-root correlation ratio ({eta}, Equation 6). The {eta} value of self-evaluation was 0.716 and {eta}s of cross-validation were 0.691 and 0.629.

 
{Delta}EPI evaluation of ß4GalT-I and its mutant

ß4GalT-I is a glycosyltransferase that transfers galactose from UDP-Gal to OH-4 of N-acetylglucosamine. A mutant of this enzyme (Arg228Lys) is known to have enhanced glucosyltransferase activity. Both wild-type (PDB ID: 1O0R) and mutant (1YRO) crystal structures had been solved.

Calculation of the {Delta}EPI score was done using all atoms in a PDB file including water molecules. The {Delta}EPI score of the wild-type enzyme was –2163, and that of the Arg228Lys mutant was –536 (Table I). The glucosyltransferase-enhanced mutant had a {Delta}EPI score 1627 points higher than the wild-type ß4GalT-I enzyme.


View this table:
[in this window]
[in a new window]

 
Table I. Values of Epimer Propensity Indexes and kinetic parameters for GTB, ß4GalT-I and their mutants

 
Molecular modeling and {Delta}EPI evaluation for GTB and its mutants

In the structure of GTB deposited in the PDB (PDB ID: 1LZJ [PDB] ), there is no electron density for a flexible loop (Glu174-Arg198, Figure 4). We therefore modeled in these residues using homology modeling. The galactose moiety of the sugar donor was also added. The structure of a homologous galactosyltransferase, {alpha}1,3-galactosyltransferase (PDB ID:1G93) was used as a template structure for the homology modeling. This latter structure was solved complexed with UDP-Gal. Amino acids which interact with the galactose moiety of the donor on the modeled fragment of {alpha}1,3-galactosyltransferase are completely conserved between GTB and {alpha}1,3-galactosyltransferase. The direction of the 4'-OH group was inverted to equatorial and the entire modeled structure was optimized using molecular mechanics with the MMFF94s force field.


Figure 4
View larger version (22K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 4. An amino acid sequence alignment of the modeled GTB and the template bovine {alpha}1,3-galactosyltransferase. The alignment was generated by clustalw (Thompson et al., 1994Go). Arrowheads indicate amino acids interacting with the galactose moiety of the donor sugar in the structure of bovine {alpha}1,3-galactosyltransferase (1G93). The red box indicates modeled fragments build by homology modeling.

 
Ser185 was selected as a mutagenesis target because this amino acid is located the closest to the 4'-OH position of the donor sugar. We therefore modeled GTB mutants in which Ser185 was transformed to 18 different amino acids. We did not mutate to proline because proline mutation often causes substantial rearrangement of nearby amino acids. Three model structures were prepared for each mutant structure whose {chi} angle of the mutated side chains were different (0°, +120° and –120°), except alanine and glycine. These initial models were relaxed and the most energetically stable structure was selected as a mutant model.

Mutated GTB is required to (i) exhibit enhanced glucosyltransferase activity, and (ii) be energetically stable. {Delta}EPI values and potential energies of each mutant model were calculated (supplementary data), and only aspartic acid, glutamic acid and aspargine met these criteria. The {Delta}EPI score of the wild-type was –260, and that of the Ser185Asn, Ser185Asp, Ser185Glu mutant were 201, 267, 940, respectively.

Characterization of glucosyltransferase- activity-enhanced GTB

We prepared expression constructs for the selected mutants (Ser185Asn, Ser185Asp and Ser185Glu) and expressed them in E.coli. The Ser185Cys mutant was also expressed because it had been cloned previously for another purpose. According to the modeled structure, the {Delta}EPI score of the Ser185Cys mutant was –189, a slightly positive shift from that of wild-type enzyme. The potential energy of this mutant was 17.0 kcal/mol higher than that of wild-type GTB.

Wild-type and mutated GTB were expressed and purified. Enzymatic activity was initially quantified by measuring the transfer of monosaccharide from radiolabeled donor substrate in a standard assay. The Ser185Asp and Ser185Glu mutants had very low activity with UDP-Gal, 4.9 mU/mg and 1.9 mU/mg, respectively, which are 0.09 and 0.04% of the specific activity of wild-type GTB (5.2 U/mg). The ratio of the mutants specific activities for UDP-Glc/UDP-Gal were 0.27 and 0.55 compared to 0.007 for wild-type GTB. While they both exhibit a great enhancement in Glc transfer, their absolute activities were so low that the mutants were not characterized further. The Ser185Asn and Ser185Cys mutants had considerably higher specific activities with UDP-Gal, therefore, kinetic parameters were determined using seven different concentrations of donor with less than 10% substrate consumption in the assay. The kinetic parameters for wild-type and mutant enzymes using UDP-Gal and UDP-Glc as donors are summarized in Table I.

Wild-type GTB had a kcat of 0.00053 s–1 with UDP-Glc donor and a Km of 188 µM. This is only 0.01% of the kcat of UDP-Gal (5.2 s–1) and 0.0028% of kcat/Km. The Ser185Asn mutant had a kcat of 0.0014 s–1 for UDP-Glc which is 2.7 times higher than that of wild-type enzyme and a Km of 120 µM. The kcat for UDP-Gal was reduced to 0.52 s–1, which is 10 times lower than that of wild-type enzyme. The ratio of glucosyltransferase/galactosyltransferase activity was 27 times greater than wild-type enzyme based on kcat and 135 times great based on kcat/Km.

The Ser185Cys mutant exhibited 3.8 times greater kcat with UDP-Glc (0.0020 s–1) compared to wild-type enzyme while the kcat for UDP-Gal was 57 times lower than wild-type enzyme. The ratio of glucosyltransferase/galactosyltransferase activity increased 219-fold in comparison with the wild-type activity based on kcat and 168-fold based on kcat/Km.


    Discussion
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
In this study, we introduce a new index, {Delta}EPI, for discrimination between protein–sugar complexes that bear equatorial-4'-OH and axial-4'-OH groups, and our experimental results showed it can predict donor specificities for ß4GalT-I and GTB.

To construct {Delta}EPI, we selected a non-redundant sugar residue set from all of the sugar structures in the PDB. A total of 525 of 23 608 sugar residues were selected as the representative sugar residue dataset (Figure 1). The atom distributions are shown in Figure 2B. Hydrogen bonding atoms (red/blue/green spheres in Figure 2) distributed around the hydroxyl group of each form, and neutral atoms (yellow spheres) distributed on the opposite side of the hydroxyl group. High density of neutral atoms around the B face of axial-4'-OH sugars is well known (Sujatha et al., 2004Go), and it was reproduced in this study. Interestingly, a high density of neutral atoms was also found on the A face of equatorial-4'-OH sugars.

The quality of carbohydrate structure information in the PDB is controversial. Lütteke et al. have pointed out the existence of many erroneous carbohydrate structures in the PDB and service a validation program at their website GLYCOSCIENCES.de (Lütteke and von der Lieth, 2004Go; Lütteke et al., 2004Go). In this study, we extracted carbohydrate structures with getCARBO, a program that extracts carbohydrate structures in a PDB format file. getCARBO extracts carbohydrate structures based on atom types and the coordinates of a molecule. Because of this, the carbohydrate residues used for EPI field construction do not contain errors from the viewpoint of their atom types and coordinates.

Using this dataset, the {Delta}EPI field was constructed and this interaction field could distinguish epimer propensity in self-evaluation (Figure 3, {eta} = 0.716). In cross-validations, {eta} was 0.691 when group 1 {Delta}EPI field evaluates group 2 samples and 0.629 when group 2 {Delta}EPI field evaluates group 1 samples. Because the {eta}s values were reasonably high and cross-validations showed about the same high {eta} points, this {Delta}EPI field was considered to give a good index to discriminate the epimer propensity for the protein–sugar complexes existing in the PDB.

Calculation of the {Delta}EPI score was applied to two specific glycosyltransferases, ß4GalT-I and GTB. In the case of ß4GalT-I, the wild-type structure complexed with UDP-Gal, and the Arg228Lys mutant structure complexed with UDP-Gal, were known. The Arg228Lys mutant had been shown to transfer glucose more efficiently than galactose. By mutating the arginine to lysine, the side chain conformation of Glu317, which is proximal to the lysine, was changed. In this altered conformation, the glutamic acid became a steric hindrance to OH4' of the galactose moiety of the donor. Additionally, the mutation created a space which is suitable for binding to equatorial O4 of glucose (Ramakrishnan et al., 2005Go). The {Delta}EPI score of the wild-type ß4GalT-I was –2163 and that of Arg228Lys mutant was –536 (Table I). The {Delta}EPI score increased by 1626 points in the Arg228Lys mutant, and the transfer of UDP-Glc was increased 2.8 times based on kcat and 4.5 times based on kcat/Km in the Arg228Lys mutant (Table I). The {Delta}EPI score therefore correctly reflected the enhanced glucosyltransfease activity. The {Delta}EPI score for the wild-type reflects the position of guanidine group of Arg288 which is favorable for galactose binding, and the positive shift of {Delta}EPI score by mutating this residue is a consequence of an absence of the guanidine group.

In the case of GTB, we had to model the GTB + UDP-Gal complex because there were missing residues in the structure deposited in the PDB. The missing internal loop that interacts with the sugar moiety of the donor was modeled using homology modeling. The structure of bovine {alpha}1,3-galactosyltrasferase was used as the template for the missing loop on GTB since the amino acids which interact with the donor sugar are completely conserved between GTB and {alpha}1,3-galactosyltrasferase on the modeled fragment (Figure 4). The location of the missing sugar moiety of the donor sugar was modeled in according to the location of the sugar moiety commonly seen in other GT-A fold glycosyltransferase structures. The manner of donor sugar binding in the GT-A fold glycosyltransferases is highly conserved. The nucleotide pyrophosphate moiety is anchored to the conserved DXD motif via a divalent cation and the sugar moiety is bent backwards towards the base moiety.

Homology modeling of the GTB structure complexed with UDP-Gal has been previously carried out by Heissigerova et al. In our model GTB was complexed with UDP-Glc and we retained water molecules in the template structure. Interactions between the sugar moiety of the donor and protein amino acids are slightly different in the models. In Heissigerova et al., Ser185 is hydrogen bonded to O6' of the donor sugar, while in our model, Ser185 is closer to O4' than O6'. This difference is attributed to the water molecule in our model that is sandwiched between Arg188 and Asp302 (Figure 5). This water molecule is also seen in the almost same position on the structure of the {alpha}1,3-galactosyltransferase (1G93, HOH 84). And Ser199 on 1G93 is closer to O4' of donor sugar than O6', as reproduced in our model.


Figure 5
View larger version (20K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 5. Stereo view of the donor sugar and its interactions with amino acids in the model. The magenta sphere is the manganese ion. Hydrogen bonds between donor sugar and amino acids are shown as dashed lines.

 
We selected Ser185 as a target of mutagenesis and designed mutant structures in silico. Then, we calculated potential energies and {Delta}EPI values for them. Ser185Asn, Ser185Asp and Ser185Glu were selected as promising mutants. Enzymatic characterizations of these promising mutants and Ser185Cys mutant, which had already been constructed for another purpose, were carried out. Shifts of {Delta}EPI value of Ser185Asn, Ser185Cys, Ser185Asp, Ser185Glu mutagenesis were 461, 71, 527, and 1200, respectively.

Experimental characterization of these mutants revealed that Ser185Asp and Ser185Glu had very low activity; therefore, we did not determine their kinetic parameters. However, the ratios of specific activities for UDP-Glc/UDP-Gal enhancement were 39 times and 79 times for Ser185Asp and Ser185Glu, respectively, which correlate to those of {Delta}EPI. Further characterizations were carried out only for Ser185Asn and Ser185Cys. Experimental determination of the kinetic parameters of the predicted mutants revealed that kcat of the Ser185Asn mutant with UDP-Glc was 2.7 times higher than that of wild-type enzyme. The positive value of the {Delta}EPI score for this enzyme correctly predicted this glucosyltransferase enhancement. In the model of this mutant, the N{sigma} atom of the Asn185 side chain was located favorably for protein–equatorial 4'OH interaction (Figure 6). We speculate that this is the reason for the high positive {Delta}EPI score and for the high glucosyltransferase activity of this mutant. In the model of the Ser185Cys mutant, the {chi} angle of the Cys side chain was different from that of the wild-type Ser side chain. This creates space for the equatorial-4'OH group of UDP-Glc and leads to a rearrangement of the orientation between the donor sugar and surrounding amino acids. It results in slight positive-shift of the {Delta}EPI score and may be the explanation for the 3.8 times higher activity with UDP-Glc.


Figure 6
View larger version (25K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 6. Interactions between the donor sugar moiety and GTB mutant models. Molecular surfaces of the amino acids Ser/Asn/Cys185, Arg188 and Asp302 are shown. The space for O4' in the Ser185 wild-type enzyme is very small. The Ser185Asn mutant model has space for O4' of the sugar moiety and the N{sigma} atom of Asn185 is located in a position suitable for hydrogen bonding with O4'. The Ser185Cys mutant model also has space for O4' of the sugar moiety.

 
The aim of this work was to develop a method to design glycosyltransferases that exhibits novel donor specificity. Protein–sugar complexes existing in the PDB and, as a test case, ß4GalT-I and its mutant, were favorably evaluated by the {Delta}EPI field presented in this work. In the case of GTB, two mutants (Ser185Asn and Ser185Cys), that were computationally predicted to have enhanced glucosyltransferase activities, were experimentally revealed to exhibit the predicted activities. However, the degree of positive shift of {Delta}EPI was not consistent with the experimental results. The enhancement of glucosyltransferase activity was larger in the Ser185Cys mutant despite the fact that the positive shift of the {Delta}EPI score was larger in the Ser185Asn mutant. Because the missing sugar moiety of the donor sugar and the amino acids around that sugar moiety were built by molecular modeling, the structure used in this work may differ somewhat from the true structure contributing to the poor correlation. A better template structure containing the missing loop structure or more advanced side chain modeling may help to improve the quantitation. Even though there still remains room for improvement of the {Delta}EPI evaluation, the predicted GTB mutants in this work did exhibit improved glucosyltransferase activities. Improvement of glucosyltransferase activity of the GTB mutants compared favorably with that for the ß4GalT-I mutant (Table I).

Several empirical approaches for protein–ligand interaction have been reported (Boer et al., 2001Go; Shionyu-Mitsuyama et al., 2003Go). These approaches were used to investigate the interaction sites on a protein or to search for ligands that bind to a protein, but not to design a new protein. In the present work, we specifically sought to design a new glycosyltransferase with altered donor specificity. Because the binding mode of the donor sugar is conserved among GT-A fold glycosyltransferases, we assumed that new donor specificity can be created by changing amino acid(s) around the donor sugar without changing the major binding mode. This would mimic the evolutionary emergence of the polyspecificity of glycosyltransferases. The {Delta}EPI score was designed to evaluate such an evolutionary model of glycosyltransferases.

In summary, this report presents a new methodology to engineer altered donor specificity into glycosyltransferases. An empirical score, termed {Delta}EPI, successfully evaluated sugar–OH protein interactions using three-dimensional structures of proteins. By using the {Delta}EPI score, mutant galactosyltransferases (GTB mutants, Ser185Asn and Ser185Cys) were predicted to transfer glucose more efficiently. The predicted mutants were prepared, and experimentally confirmed to transfer glucose more efficiently.


    Footnotes
 
Edited By Stephen Mayo


    Acknowledgements
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
We thank Dr Hannelore Peters for the gift of the S185C mutant and Ms Sidsel Ehlers for expert technical assistance in the characterization of the GTB mutants.


    References
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 Acknowledgements
 References
 
Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. (1997) Nucleic Acids Res. 25:3389–3402.[Abstract/Free Full Text]

Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E. (2000) Nucleic Acids Res. 28:235–242.[Abstract/Free Full Text]

Boer D.R., Kroon J., Cole J.C., Smith B., Verdonk M.L. (2001) J. Mol. Biol. 312:275–287.[CrossRef][Web of Science][Medline]

Bourne Y. and Henrissat B. (2001) Curr. Opin. Struct. Biol. 11:593–600.[CrossRef][Web of Science][Medline]

Chiu C.P.C., Watts A.G., Lairson L.L., Gilbert M., Lim D., Wakarchuk W.W., Withers S.G., Strynadka N.C. (2004) Nat. Struct. Mol. Biol. 11:163–170.[CrossRef][Web of Science][Medline]

Coutinho P.M., Deleury E., Davies G.J., Henrissat B. (2003) J. Mol. Biol. 328:307–317.[CrossRef][Web of Science][Medline]

Davies G.J., Gloster T.M., Henrissat B. (2005) Curr. Opin. Struct. Biol. 15:637–645.[CrossRef][Web of Science][Medline]

DeLano W.L. (2002) DeLano Scientific. (San Carlos, CA, USA).

Gastinel L.N., Bignon C., Misra A.K., Hindsgaul O., Shaper J.H., Joziasse D.H. (2001) EMBO J. 20:638–649.[CrossRef][Web of Science][Medline]

Halgren T.A. (1996) J. Comp. Chem. 17:490–519.[CrossRef]

Heissigerova H., Breton C., Moravcova J., Imberty A. (2003) Glycobiology 13:377–386.[Abstract/Free Full Text]

IUPAC. (1997) Nomenclature of carbohydrates (Recommendations 1996). Adv. Carbohydr. Chem. Biochem. 52:43–177.[Web of Science][Medline]

IUPAC. (1997) Nomenclature of carbohydrates (Recommendations 1996). Carbohydr. Res. 297:1–90.[CrossRef][Web of Science][Medline]

IUPAC. (1997) Nomenclature of carbohydrates (Recommendations 1996). J. Carbohydr. Chem. 16:1191–1280.

IUPAC. (1996) Nomenclature of carbohydrates (Recommendations 1996). Pure Appl. Chem. 68:1919–2008.[CrossRef][Web of Science]

Jiang L., Gao Y., Mao F., Liu Z., Lai L. (2002) Proteins 46:190–196.[CrossRef][Web of Science][Medline]

Kubo A., Arai Y., Nagashima S., Yoshikawa T. (2004) Arch. Biochem. Biophys. 429:198–203.[CrossRef][Web of Science][Medline]

Lütteke T. and von der Lieth C.W. (2004) BMC Bioinformatics 5:69.[CrossRef][Medline]

Lütteke T., Frank M., von der Lieth C.W. (2004) Carbohydr. Res. 339:1015–1020.[CrossRef][Web of Science][Medline]

Marcus S.L., Polakowski R., Seto N.O., Leinala E., Borisova S., Blancher A., Roubinet F., Evans S.V., Palcic M.M. (2003) J. Biol. Chem. 278:12403–12405.[Abstract/Free Full Text]

Ouzzine M., Gulberti S., Levoin N., Netter P., Magdalou J., Fournel-Gigleux S. (2002) J. Biol. Chem. 277:25439–25445.[Abstract/Free Full Text]

Palcic M.M., Heerze L.D., Pierce M., Hindsgaul O. (1988) Glycoconj. J. 5:49–63.[CrossRef][Web of Science]

Patenaude S.I., Seto N.O., Borisova S.N., Szpacenko A., Marcus S.L., Palcic M.M., Evans S.V. (2002) Nat. Struct. Biol. 9:685–690.[CrossRef][Web of Science][Medline]

Ramakrishnan B., Balaji P.V., Qasba P.K. (2002) J. Mol. Biol. 318:491–502.[CrossRef][Web of Science][Medline]

Ramakrishnan B., Boeggeman E., Qasba P.K. (2005) Biochemistry 44:3202–3210.[CrossRef][Medline]

Seto N.O., Palcic M.M., Compston C.A., Li H., Bundle D.R., Narang S.A. (1997) J. Biol. Chem. 272:14133–14138.[Abstract/Free Full Text]

Seto N.O., Compston C.A., Evans S.V., Bundle D.R., Narang S.A., Palcic M.M. (1999) Eur. J. Biochem. 259:770–775.[Web of Science][Medline]

Seto N.O., Compston C.A., Szpacenko A., Palcic M.M. (2000) Carbohydr. Res. 324:161–169.[CrossRef][Web of Science][Medline]

Shionyu-Mitsuyama C., Shirai T., Ishida H., Yamane T. (2003) Protein Eng. 16:467–478.[Abstract/Free Full Text]

Sujatha M.S., Sasidhar Y.U., Balaji P.V. (2004) Protein Sci. 13:2502–2514.[CrossRef][Web of Science][Medline]

Thompson J.D., Higgins D.G., Gibson T.J. (1994) Nucleic Acids Res. 22:4673–4680.[Abstract/Free Full Text]

Received July 6, 2006; revised October 3, 2006; accepted October 10, 2006.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
J. A. Alfaro, R. B. Zheng, M. Persson, J. A. Letts, R. Polakowski, Y. Bai, S. N. Borisova, N. O. L. Seto, T. L. Lowary, M. M. Palcic, et al.
ABO(H) Blood Group A and B Glycosyltransferases Recognize Substrate via Specific Conformational Changes
J. Biol. Chem., April 11, 2008; 283(15): 10097 - 10108.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nakahara, T.
Right arrow Articles by Nishimura, S.-I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nakahara, T.
Right arrow Articles by Nishimura, S.-I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?