Skip Navigation



PEDS Advance Access published online on October 28, 2008

Protein Engineering Design and Selection, doi:10.1093/protein/gzn059
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
22/3/217    most recent
gzn059v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Dudgeon, K.
Right arrow Articles by Christ, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dudgeon, K.
Right arrow Articles by Christ, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oxfordjournals.org

Short Communication

Sequence determinants of protein aggregation in human VH domains

Kip Dudgeon1,3, Kristoffer Famm2,3 and Daniel Christ1,4

1Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, Sydney, NSW 2010, Australia 2Centre for Protein Engineering, Medical Research Council Centre, Hills Road, Cambridge CB2 2QH, UK

4 To whom correspondence should be addressed. E-mail: d.christ{at}garvan.org.au


    Abstract
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
Human antibody variable heavy (VH) domains tend to aggregate upon denaturation, for instance, by heat or acid. We have previously demonstrated that domains resisting protein aggregation can be selected from CDR-only repertoires by phage display. Here we analysed their sequences to identify determinants governing protein aggregation. We found that, while many different CDR sequences conferred aggregation–resistance, certain physico-chemical properties were strongly selected for. Thus, hydrophobicity and beta-sheet propensity were significantly lower among the selected domains, whereas net negative charge was increased. Our results provide guidelines for the design of human VH repertoires with reduced levels of protein aggregation.

Keywords: antibodies/combinatorial repertoire/molecular evolution/phage display/protein aggregation

Many human proteins aggregate, particularly in response to non-physiological conditions such as heat and acidic pH. This is also the case for human antibody domains, such as variable heavy (VH) chains, which tend to aggregate in isolation. This is in marked contrast to VH from other species, such as camels, which naturally occur in the absence of light-chain partners. Structural studies of camelid single domains have revealed specific features that are believed to improve their solubility (Desmyter et al., 1996Go; Spinelli et al., 1996Go). These include increased hydrophilicity of surface patches that form the light chain interface in other species. Consequently, mutation of residues in this interface has been shown to improve solubility and to reduce aggregation of human VH domains (Davies and Riechmann, 1994Go; Barthelemy et al., 2008Go).

However, other studies have shown that similar aggregation–resistant properties can be observed among human VH domains lacking mutations in the light chain interface (Jespers et al., 2004bGo). These domains had been derived from a synthetic antibody library based on a single human framework, but different CDR regions, highlighting the effect of variable regions on the biophysical properties of human VH domains. Other studies have reported similar results, with CDR sequences shown to affect expression yields, retention on gel-filtration columns and thermodynamic stability (Jespers et al., 2004aGo; Famm et al., 2008Go; Barthelemy et al., 2008Go).

We have previously reported a method that allowed the selection of human variable heavy chains resisting aggregation from a synthetic single framework CDR-only library (Jespers et al., 2004aGo; Famm and Winter, 2006Go). This method was based on the display of VH domains on phage, followed by heat-induced unfolding and capture of aggregation–resistant domains using protein A, a conformation-specific superantigen. The domains selected by this method not only resisted aggregation when heated above their unfolding temperature, but also displayed favourable expression and purification yields (Jespers et al., 2004aGo). However, while we noted that many of the selected domains exhibited acidic isoelectric points, detailed sequence characteristics remained unknown. Here we describe the CDR-sequence determinants governing protein aggregation in human VH domains.


    Material and Methods
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
Selection for aggregation–resistance on phage

Phages from a VH CDR-only library based on the human V3-23/D47 framework [library 1, 2 x 109 clones (Jespers et al., 2004aGo)] were purified from culture supernatant by two precipitations with polyethylene-glycol/NaCl solution [as previously described (Lee et al., 2007Go)]. Phages were subjected to a single round of heat treatment [in contrast to three rounds of treatment used by Jespers et al. (Jespers et al., 2004aGo)]. Approximately 1010 phages were heated in 10 µl of phosphate-buffered saline (PBS) at 80°C for 10 min and cooled to 4°C (heat-selected repertoire). As a control, phages were incubated in PBS at room temperature (unselected repertoire). Phages from both repertoires were diluted into PBS supplemented with 2% (w/v) marvel milk powder (MPBS) and captured on a protein A coated MaxiSorp Immuno-plate (Nunc) (which had been coated over night with protein A at 10 µg/ml in PBS at room temperature and blocked using 2% MPBS). The selected phages were added to the blocked wells for 2 h at room temperature. After 10 washes with PBS, protein A-bound phages were eluted in 1 mg/ml trypsin in PBS for 10 min at room temperature. The eluted phages were used to infect exponentially growing Escherichia coli TG1 bacteria (Gibson, 1984Go) (with an OD600 of approximately 0.5) by incubation for 30 min at 37°C. Bacteria were pelleted by centrifugation, plated on TYE agar plates supplemented with 15 µg/ml tetracycline and grown at 37°C over night.

Phage ELISA for aggregation–resistance

Aggregation–resistance of clones was analysed by measuring retention of signal in a phage ELISA format (McCafferty et al., 1990Go). Wells of a Nunc Maxisorp Immuno-plate were coated over night with protein A at a concentration of 2.5 µg/ml in PBS. The plate was washed once with PBS and blocked with 2% MPBS. Single colonies were picked from agar plates and grown over night in 2 x TY medium supplemented with 15 µg/ml tetracycline in 96-well round bottom plates shaking at 37°C. Cells were removed by centrifugation and phages were biotinylated directly in the culture supernatant by adding biotin-PEO4-N-hydroxysuccinimide (Pierce; 50 µM final concentration). For heat selection, supernatant was incubated at 80°C for 10 min and then cooled to 4°C. Supernatant was added to the blocked ELISA wells. After three washes with PBS, bound phage particles were detected using an Extravidin-HRP conjugate (Sigma) and 3,3',5,5'-tetramethylbenzidine (TMB) substrate.

Sequencing of repertoires

Regions of the phage genome encoding the VH domains were amplified by PCR from bacterial cultures using primers 5'-CGGTTTATCAGCTTGCTTTCG-3' and 5'-GTTTTGTCGTCTTTCCAGACG-3' and subjected to DNA sequence analysis.


    Results
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
We first selected aggregation–resistant domains by heating a large CDR-only VH library (2 x 109 clones) (Jespers et al., 2004aGo) on phage: the phages were heated to 80°C, followed by capture of domains resistant to heat-induced aggregation by protein A (heat-selected repertoire). As a control, the VH library was incubated at room temperature and captured by protein A (unselected repertoire). We next analysed the aggregation propensity of the two repertoires by phage ELISA. For this purpose, phages were heated and aggregation–resistant clones detected with an anti-phage conjugate after protein A capture. Using this assay, we identified 40 clones from each repertoire, which differed considerably in their aggregation–resistance (with an average retention of ELISA signal after heating of 88% for the heat-selected clones and 42% for the unselected clones).

As the clones from the two repertoires strongly differed in their resistance to aggregation, we wondered whether this property was reflected in their sequence characteristics. We therefore determined their sequences and calculated amino acid frequencies at all individual CDR positions. This revealed considerable diversity at all of the analysed positions (Fig. 1A and B). However, further analyses found that the frequencies of amino acids at several positions differed significantly between the two repertoires, indicating selection pressure associated with the heat-denaturation step (Fig. 1C). This included strong enrichment (p < 0.01) of valine at position 29, glycine at position 35 and serine at position 102. A considerable preference (p < 0.001) was observed for valine at position 100l and in particular for glutamate at position 32 (p < 10–6). We also found that certain amino acids were less frequent among the selected domains, in particular arginine and lysine. However, with the exception of arginine at position 52, these biases were generally weak (p < 0.05). Many of these positions are located within CDR 1. In total, we detected six positions, three in CDR 1 (29, 32, 35), one in CDR 2 (52) and two in CDR 3 (100l, 102) that differed strongly (p < 0.01) in their amino acid composition between the two repertoires.


Figure 1
View larger version (51K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. CDR amino acid frequencies of VH repertoires. Frequencies were calculated from sequences of 40 clones from the unselected VH repertoire (A), as well as 40 clones selected by heating on phage (B). Amino acids are indicated by single letter codes and frequencies correlate with letter size. Amino acids are colour-coded according to physico-chemical properties: green (G, S, T, Y, C, Q, N) indicates polar residues, blue (K, R, H) basic residues, red (D, E) acidic residues, black (F, V, L, A, M, I, W, P) hydrophobic residues. Numbering of CDR residues according to Kabat and Wu (Kabat and Wu, 1991Go). Residues 99–100 k are shown as a single, averaged position to account for variable CDR 3-length in repertoires. (A) and (B) were generated using WebLogo (Crooks et al., 2004Go). (C) shows amino acid frequencies (in percent) at positions displaying significant sequence bias. Upper values refer to the unselected repertoire, lower values to the heat-selected repertoire. Shading indicates statistically significant bias as follows: light gray p < 0.05; gray p < 0.01; dark gray p < 0.001; black p < 10–6. p-values were calculated using {chi}2 statistics with one degree of freedom; amino acids rare at a given position in the unselected repertoire (frequency of 10% or less) were excluded from the analysis.

 
We next determined physico-chemical properties of longer stretches of the proteins, both at the domain-level (Table I, first row), as well as for individual CDRs (Table I, rows 2–4). This revealed strong increase of net negative charge for the heat-selected VH domains, as well as reduced hydrophobicity. This is in agreement with the enrichment of acidic and polar residues at individual positions (as above). We also observed that beta-sheet propensity of the domains was significantly reduced. Similar trends were observed for individual CDRs, particularly CDR 1 and, to a lesser degree, CDR 3 and CDR 2 (Table I; rows 2–4).


View this table:
[in this window]
[in a new window]

 
Table I. Physico-chemical properties of VH domains and their CDR regions

 
In addition to heat-induced aggregation, we also analysed the sequence features of a repertoire of VH domains that had previously been selected by incubation at acidic pH (Famm et al., 2008Go). In contrast to the domains selected by heat, we did not observe physico-chemical properties to differ significantly from the unselected repertoire (data not shown). This is in agreement with our earlier findings that these proteins resist acid-induced aggregation mainly through the selection of specific point mutations (such as Arg28) (Famm et al., 2008Go).


    Discussion
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
The use of phage display for the selection of antibody domains from a large combinatorial repertoire has allowed us to identify CDR-sequence determinants governing aggregation–resistance. For the VH domains that had been selected by heating on phage, we detected significant enrichment of amino acids at several positions, particularly within CDR 1. This included glycine 35, a residue previously shown to increase the solubility of VH domains (Jespers et al., 2004bGo; Barthelemy et al., 2008Go). However, other substitutions had not been previously described, including a strong preference for glutamate at position 32.

We also wondered whether we could detect changes that may not be obvious from the analyses of amino acid frequencies at single positions. Such changes may originate from the effect of combined amino acid changes at multiple positions or may be obscured by co-variation. Further analysis did indeed reveal properties that had not been obvious from the analyses of single positions. Thus, hydrophobicity and beta-sheet propensity were significantly lower among the selected domains, while net negative charge was increased. All of the observed properties have previously been implicated in protein aggregation and the formation of amyloid-like aggregates (Dobson, 1999Go). These properties are also the basis of algorithms that aim to predict the formation of protein aggregates (DuBay et al., 2004Go; Fernandez-Escamilla et al., 2004Go). Indeed, we observed that these algorithms predicted lower levels of protein aggregation for heat-selected VH repertoires (data not shown).

The effect of charge on protein aggregation and solubility has long been known and proteins display minimal solubility near their isoelectric points (Loeb, 1921Go). Our observation that increased negative charge plays a major role in aggregation–resistance is in agreement with reports on the engineering of proteins other than VH. In a recent study, Whitesides and co-workers reported that increasing negative charge in alpha-amylase (by chemical acetylation of lysines) significantly improved resistance against thermal and chemical inactivation (Shaw et al., 2008Go). Liu and co-workers reported the engineering of green fluorescent protein (GFP) into a highly aggregation–resistant form by the introduction of negatively charged surface residues (Lawrence et al., 2007Go). In contrast to our results, they also observed similar effects for a highly positively charged GFP variant (although not for other proteins, such as glutathione-S-transferase).

In summary, we found that CDRs with highly diverse sequences, but common physico-chemical properties, can give rise to human variable domains resisting protein aggregation. This is in agreement with previous results, demonstrating that repertoires of aggregation–resistant domains can be generated by combinatorial assembly of pre-selected CDR regions (Christ et al., 2006Go). Although good antigen binders were obtained in these experiments, pre-selection did lead to a considerable loss of CDR-diversity (Christ et al., 2007Go). Our results here suggest an alternative path to the construction of such repertoires. Thus, the identification of sequence determinants governing protein aggregation may open up the path towards the design of human VH repertoires with reduced levels of protein aggregation. This may include the use of trinucleotide mutagenesis (Virnekas et al., 1994Go) to limit diversity at restrictive positions (such as CDR 1, residue 32) and the targeting of diversity to other regions (such as CDR 2) that display little sequence bias in our analyses.


    Funding
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
The work was funded by the United Kingdom Medical Research Council (K.F.), the Garvan Institute of Medical Research, the Australian National Health and Medical Research Council and the Cancer Institute NSW (K.D. and D.C.).


    Footnotes
 
3 Both authors contributed equally to this work Back

Edited by P. Holliger


    References
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Funding
 References
 
Barthelemy P.A., Raab H., Appleton B.A., Bond C.J., Wu P., Wiesmann C., Sidhu S.S. J. Biol. Chem. (2008) 283:3639–3654.[Abstract/Free Full Text]

Chiti F., Stefani M., Taddei N., Ramponi G., Dobson C.M. Nature (2003) 424:805–808.[CrossRef][Web of Science][Medline]

Christ D., Famm K., Winter G. Nucleic Acids Res. (2006) 34:e108.[Abstract/Free Full Text]

Christ D., Famm K., Winter G. Protein Eng. Des. Sel. (2007) 20:413–416.[Abstract/Free Full Text]

Cowan R., Whittaker R.G. Pept. Res. (1990) 3:75–80.[Medline]

Crooks G., Hon G., Chandonia J., Brenner S. Genome Res. (2004) 14:1188–1190.[Abstract/Free Full Text]

Davies J., Riechmann L. FEBS Lett. (1994) 339:285–290.[CrossRef][Web of Science][Medline]

Desmyter A., Transue T.R., Ghahroudi M.A., Thi M.H., Poortmans F., Hamers R., Muyldermans S., Wyns L. Nat. Struct. Biol. (1996) 3:803–811.[CrossRef][Web of Science][Medline]

Dobson C.M. Trends Biochem. Sci. (1999) 24:329–332.[CrossRef][Web of Science][Medline]

DuBay K.F., Pawar A.P., Chiti F., Zurdo J., Dobson C.M., Vendruscolo M. J. Mol. Biol. (2004) 341:1317–1326.[CrossRef][Web of Science][Medline]

Famm K., Winter G. Protein Eng. Des. Sel. (2006) 19:479–481.[Abstract/Free Full Text]

Famm K., Hansen L., Christ D., Winter G. J. Mol. Biol. (2008) 376:926–931.[CrossRef][Web of Science][Medline]

Fernandez-Escamilla A.M., Rousseau F., Schymkowitz J., Serrano L. Nat. Biotechnol. (2004) 22:1302–1306.[CrossRef][Web of Science][Medline]

Fersht A.R. Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding (1998) Freeman.

Gibson T.J. (1984) Cambridge: University of Cambridge. PhD thesis.

Jespers L., Schon O., Famm K., Winter G. Nat. Biotechnol. (2004) a 22:1161–1165.[CrossRef][Web of Science][Medline]

Jespers L., Schon O., James L.C., Veprintsev D., Winter G. J. Mol. Biol. (2004) b 337:893–903.[CrossRef][Web of Science][Medline]

Kabat E.A., Wu T.T. J. Immunol. (1991) 147:1709–1719.[Abstract]

Lawrence M.S., Phillips J.P., Liu D.R. J. Am. Chem. Soc. (2007) 129:10110–10112.[CrossRef][Web of Science][Medline]

Lee C.M., Iorno N., Sierro F., Christ D. Nat. Protoc. (2007) 2:3001–3008.[CrossRef][Medline]

Loeb J. J. Gen. Physiol. (1921) 4:547–555.

McCafferty J., Griffiths A.D., Winter G., Chiswell D.J. Nature (1990) 348:552–554.[CrossRef][Web of Science][Medline]

Shaw B.F., Schneider G.F., Bilgicer B., Kaufman G.K., Neveu J.M., Lane W.S., Whitelegge J.P., Whitesides G.M. Protein Sci. (2008) 17:1446–1455.[CrossRef][Web of Science][Medline]

Spinelli S., Frenken L., Bourgeois D., de Ron L., Bos W., Verrips T., Anguille C., Cambillau C., Tegoni M. Nat. Struct. Biol. (1996) 3:752–757.[CrossRef][Web of Science][Medline]

Virnekas B., Ge L., Pluckthun A., Schneider K.C., Wellnhofer G., Moroney S.E. Nucleic Acids Res. (1994) 22:5600–5607.[Free Full Text]

Received September 30, 2008; revised September 30, 2008; accepted October 6, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
22/3/217    most recent
gzn059v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Dudgeon, K.
Right arrow Articles by Christ, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dudgeon, K.
Right arrow Articles by Christ, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?