`
`OPEN ACCESS
`
`
`International Journal of
`
`Molecular Sciences
`
`ISSN 1422-0067
`www.mdpi.com/journal/ijms
`
`Article
`
`Computational Docking of Antibody-Antigen Complexes,
`Opportunities and Pitfalls Illustrated by Influenza
`Hemagglutinin
`
`Mattia Pedotti †, Luca Simonelli †, Elsa Livoti and Luca Varani *
`
`Institute for Research in Biomedicine, via Vela 6, 6500 Bellinzona, Switzerland;
`
`E-Mails: mattia.pedotti@irb.unisi.ch (M.P.); luca.simonelli@irb.unisi.ch (L.S.);
`
`elsa.livoti@irb.unisi.ch (E.L.)
`
`† These authors contributed equally to this work.
`
`* Author to whom correspondence should be addressed; E-Mail: luca.varani@irb.unisi.ch;
`
`Tel.: +41-91-820-0321; Fax: +41-91-820-0302.
`
`Received: 2 November 2010; in revised form: 22 December 2010 / Accepted: 4 January 2011 /
`
`Published: 5 January 2011
`
`
`
`Abstract: Antibodies play an increasingly important role in both basic research and the
`
`pharmaceutical industry. Since their efficiency depends, in ultimate analysis, on their
`
`atomic interactions with an antigen, studying such interactions is important to understand
`
`how they function and, in the long run, to design new molecules with desired properties.
`
`Computational docking, the process of predicting the conformation of a complex from its
`
`separated components, is emerging as a fast and affordable technique for the structural
`
`characterization of antibody-antigen complexes. In this manuscript, we first describe the
`
`different computational strategies for the modeling of antibodies and docking of their
`
`complexes, and then predict the binding of two antibodies to the stalk region of influenza
`
`hemagglutinin, an important pharmaceutical target. The purpose is two-fold: on a general
`
`note, we want to illustrate the advantages and pitfalls of computational docking with a
`
`practical example, using different approaches and comparing the results to known
`
`experimental structures. On a more specific note, we want to assess if docking can be
`
`successful in characterizing the binding to the same influenza epitope of other antibodies
`
`with unknown structure, which has practical relevance for pharmaceutical and biological
`
`research. The paper clearly shows that some of the computational docking predictions can
`
`be very accurate, but the algorithm often fails to discriminate them from inaccurate
`
`Lassen - Exhibit 1035, p. 1
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`227
`
`
`
`solutions. It is of paramount importance, therefore, to use rapidly obtained experimental
`
`data to validate the computational results.
`
`Keywords: antibody modeling; computational docking;
`
`influenza; hemagglutinin;
`
`antibody-antigen complexes
`
`
`
`1. Introduction
`
`Individuals that recover from the attack of a pathogen have antibodies (Abs) capable of detecting
`
`and neutralizing the same pathogen in a future encounter, usually conferring life-long protection from
`
`it. Detection and neutralization are initiated by the binding of these antibodies to antigens, often
`
`surface proteins, through specific atomic interactions between the antibody and the region of the
`
`antigen (Ag) that it recognizes (epitope). A better understanding of these interactions is expected to
`
`accelerate vaccine development, since most current vaccines are based on the generation of
`
`neutralizing antibody responses. If we understand the structural rules governing Ab-Ag interactions in
`
`a given virus, for instance, then we have the molecular basis to attempt to design and synthesize new
`
`epitopes to be used as vaccines, optimize the antibodies themselves for passive immunization or design
`
`new drugs mimicking the antibodies or their effect.
`
`In addition to pharmaceutical development, antibodies play an increasingly relevant role in basic
`
`research and industrial processes, where they are starting to be used as recognition elements sensitive
`
`to the presence of a given antigen. Designing and synthesizing new antibodies with desired properties
`
`would, therefore, have a profound impact, but we are very far away from being able to do that. Despite
`
`antibodies having been known and characterized for several decades [1,2], in fact, we still know
`
`remarkably little about their interactions. Given an antibody structure, for instance, we cannot even
`
`predict whether it can bind a protein, nucleic acid or sugar, let alone the specific antigen or
`
`conformational epitope that it recognizes. The study of Ab-Ag complexes should further our
`
`understanding of the general principles of recognition and, in the long run, gives us the basis for the
`
`successful design of new molecules or the rational optimization of existing ones.
`
`The best way to study atomic interaction is to obtain the three-dimensional structure of
`
`antibody-antigen complexes. Traditionally, this is achieved by experimental techniques like X-ray
`
`crystallography, an often long and laborious process with high failure rate. Thanks to advances in
`
`algorithms and processing power, however, we can now use computational techniques for the
`
`structural characterization of intermolecular complexes. Computational docking—the process of
`
`predicting the conformation of a complex starting from its separated components—provides a fast and
`
`inexpensive route to obtain structures, including those which are not suitable for experimental
`
`determination. Although computational docking is still in its infancy and marred by several limitations,
`
`there is no doubt that it will become more and more accurate, relevant and widespread in the
`
`coming years.
`
`Here we first illustrate the application of computational docking to the study of antibody-antigen
`
`interactions, and then highlight the strengths and weaknesses of the approach by predicting the binding
`
`of two different antibodies to hemagglutinin, the surface protein of influenza virus and an important
`
`
`
`Lassen - Exhibit 1035, p. 2
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`228
`
`
`
`pharmaceutical target. Being able to accurately predict those structures, for which X-ray information is
`
`available, would strengthen our belief that computational techniques can be used to characterize the
`
`binding of new antibodies against the same epitope.
`
`1.1. Computational Docking
`
`Computational docking, a relatively new and constantly evolving technique, is the process of
`
`predicting the structure of a multi-molecular complex from the structures of its separated components.
`
`Its progress has been monitored since 2002 by the ―Critical Assessment of PRediction of Interactions‖
`
`project (CAPRI) [3], a comparative evaluation of protein-protein docking algorithms on a set of known
`
`targets. Here we focus on docking of antibodies to protein antigens, which presents specific challenges
`
`but also has peculiar features exploitable to ease the calculations.
`
`In a typical docking protocol, the structures of the antigen and antibody are separated by
`
`approximately 25 Å and subsequently brought together by the chosen algorithm. The first necessary
`
`step, therefore, is obtaining the structures of the isolated antigen and antibody. The starting structure
`
`may be defined as follows:
`
`(i) ―Bound‖, if it originates from an experimental structure of the complex that needs to be docked.
`
`This is interesting when developing docking procedures but it is generally not biologically
`
`attractive, because computational docking is unlikely to add relevant information if an
`
`experimental structure is already available.
`
`(ii) ―Unbound‖, if it originates from an experimental structure of the molecule not bound to the
`
`partner that needs to be docked, i.e., either free or bound to a different partner. This is the most
`
`common scenario for antigens, especially since the number of available protein structures is
`
`increasing thanks to several structural genomics efforts. Structures of free antibodies, instead, are
`
`usually not available, nor they would be particularly useful since Abs are known to drastically
`
`change conformation upon binding [4].
`
`(iii) ―Modeled‖, if it has been predicted by homology modeling and/or other computational techniques
`
`like ab initio predictions or molecular dynamics. A thorough description of homology modeling
`
`for protein antigens is beyond the scope of this manuscript. Suffice to say that the results are
`
`remarkably accurate if the target protein has sequence similarity to a protein with known structure
`
`and that even ab initio predictions are starting to produce accurate results, albeit much less than
`
`homology modeling [5–7]. Antibody structures can be predicted with remarkable accuracy and
`
`precision as well; the process is relatively different from standard protein modeling and is covered
`
`in the next sections.
`
`1.2. Antibody Structure, Implications for Modeling
`
`Antibodies are large (~150 kDa), y-shaped molecules containing a so-called Fc region (Fragment,
`
`Crystallizable, it binds to various cell receptors and mediates a response of the immune system) and
`
`two Fab regions (Fragment, Antigen Binding). The latter are composed by one heavy and one light
`
`chain, each with a constant and a variable domain called FV (Figure 1). The FV is the only domain
`responsible for antigen binding and, therefore, the only one that needs to be considered for docking. It
`
`
`
`Lassen - Exhibit 1035, p. 3
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`229
`
`
`
`is further subdivided in a framework region, highly conserved in both sequence and conformation, and
`
`six highly variable CDR loops (Complementarity Determining Region), three from each chain and
`
`often referred to as L1, L2, L3, H1, H2, and H3.
`
`Figure 1. Schematic (a) and cartoon (b) representation of a full antibody structure.
`
`Antigens bind to the tip of the VH and VL domains.
`
`
`
`Despite their high sequence variability, five of the six loops (all except H3) can assume just a small
`
`repertoire of main-chain conformations, called ―canonical structures‖ [5–7]. These conformations are
`
`determined by the length of the loops and by the presence of key residues at specific positions in the
`
`antibody sequence. The specific pattern of residues that determines each canonical structure forms a
`
`signature that can be recognized in the sequence of an antibody of unknown structure, allowing
`
`successful prediction of the canonical structure itself with high accuracy [8,9]. Uncertainties arise in
`
`the relatively rare cases when a loop is particularly long and/or does not follow canonical structures.
`
`The H3 loop does not appear to adopt canonical structures, instead, and predicting its conformation
`
`requires more sophisticated and less accurate approaches.
`
`The framework regions can also be reliably predicted since known structures with high sequence
`
`identity are often available. Due to the presence of conserved residues at the interface between the light
`
`and heavy chain, the relative geometry of these domains is also well preserved [10]. Correct
`
`assembling of the heavy and light chain is nonetheless critical for the accurate orientation of the
`
`antigen binding interface and errors may arise in the modeling.
`
`It is important to note that the rules and templates used for modeling are based on structures of
`
`antibodies bound to their antigen and are therefore accurate in the context of the bound conformation
`
`of an antibody.
`
`1.3. Antibody Modeling Based on Canonical Structures, the PIGS Server
`
`PIGS (Prediction of ImmunoGlobulin Structure [11]) is a web-based server for the automatic
`
`prediction of antibody structure [12] based on the canonical structure method [13]. The Web Antibody
`
`Modelling server, WAM [14], utilizes the same approach but offers less features and is generally less
`
`convenient to utilize.
`
`
`
`Lassen - Exhibit 1035, p. 4
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`230
`
`
`
`In the canonical structure method, the sequence of each variable domain (VL and VH) of the
`antibody of unknown structure (target) is independently aligned with the corresponding variable
`
`domain sequences of all the immunoglobulins of known structure. For this step, standard database
`
`searching (e.g., BLAST) [15], and multiple sequence alignment (e.g., Clustalw) [16] programs can be
`
`used, but it is important to verify that residues at key structural positions are correctly aligned. The
`
`backbone structure of the framework is then modeled using the known structures with highest
`
`sequence identity as template. The rationale for this is that, in general, the higher the residue identity in
`
`the core of two proteins the more similar the conformation in this region [8] and, hence, the higher the
`
`quality of the model. Similarly, the conformation of the CDR loops is predicted using known templates
`
`with the same canonical loop conformation and high sequence identity. Different combinations of
`
`templates can be used as illustrated below.
`
`(i) Best heavy and light chains. Use the chains with highest sequence identity as templates. Since
`
`they come from different antibodies, the two chains need to be packed together by a least-squares
`
`fit of the residues conserved at the interface. This may introduce errors in the relative orientation
`
`of the two chains, with adverse consequences for the accurate modeling of the antigen binding site.
`
`(ii) Same canonical structures. Use a template whose CDR loops have the same canonical structures
`
`as the target even if a template with higher sequence identity exists for one or both chains. If
`
`framework and loops are taken from different templates, then the loops need to be grafted in,
`
`possibly introducing errors: the residues adjacent to the loop are superimposed to the framework
`
`by a weighted least-square fit of the main chain.
`(iii) Same antibody. Use the same antibody as template for both heavy and light chain, even if
`
`templates with higher sequence identity exist. This does not require optimization of the relative
`
`orientation of the two chains and thus avoids the errors illustrated earlier.
`(iv) Same antibody and canonical structures. The template is an antibody with the same canonical
`
`structures as the target and it is used to model both framework and the CDR loops. This option
`
`does not require optimization of framework orientation nor loop grafting and may offer more
`
`accurate results even if templates with higher sequence identity are available for one of the chains.
`
`The approach tends to fail, however, if the identity is too low.
`
`The conformation of five of the six CDR loops can be modeled as described but no canonical
`
`structure is known for the H3 loop. However, the so-called ―torso‖ region, i.e., the H3 residues closer
`
`to the framework, can still be predicted by similarity to antibodies sharing the same torso
`
`conformation [17–19]. The ―head‖ region of H3, instead, follows rules of standard protein hairpins and
`
`can be predicted by similarity to protein loops (not just antibodies) with high sequence identity, but the
`
`result is usually less accurate than for other CDR loops.
`
`The subsequent step consists in the modeling of the side chains conformations. At sites where the
`
`parent structure and the model have the same amino acid the conformation of the parent structure is
`
`retained. Otherwise, the side chain conformation is copied from antibodies with high sequence
`
`similarity or imported from standard rotamer libraries [20]. Finally, the model is refined by a few
`
`cycles of energy minimization to improve the stereochemistry, especially in those regions where
`
`segments of structures coming from different immunoglobulins have been joined, but not to
`
`significantly refine the models.
`
`
`
`Lassen - Exhibit 1035, p. 5
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`231
`
`
`
`1.4. Antibody Modeling by Rosetta Antibody
`
`Rosetta Antibody [21] is a homology modeling program to predict antibody FV structures. It uses a
`simple energy function to simultaneously optimize the CDR loop backbone dihedral angles, the
`
`relative orientation of the light and heavy chains and the side chain conformations. The program can be
`
`downloaded and run on local computers or modeling requests can be submitted to a web
`
`server [22,23]. Rosetta Antibody first identifies the antibody templates with highest sequence identity
`
`for each framework and CDR loops; the loop templates are then grafted onto the framework and
`
`the full FV is assembled. This crude model is used as input for a second stage: a multi-start,
`Monte-Carlo-plus-minimization algorithm that generates two thousand candidate structures. H3 loop
`
`conformations are generated by assembling small peptide fragments [24] and sidechains are finally
`
`optimized via rotamer packing and energy minimization [25]. The CDR backbone torsion angles and
`
`relative orientation of light and heavy framework are also perturbed and minimized with a
`
`pseudo-energy function that includes van der Waals energy, orientation-dependent hydrogen
`
`bonding [26],
`
`implicit Gaussian salvation [27], side chain rotamer propensities [28] and a
`
`low-weighted distance-dependent dielectric electrostatic energy [29]. In the end, a scoring function is
`
`used to discriminate the 10 best antibody models that are offered as standard result.
`
`1.5. Other Procedures for Antibody Modeling
`
`Methods based on canonical structures are generally very effective but somehow limited by the lack
`
`of structural templates for a few loop conformations. Other methods model the CDR loops using
`
`templates selected by sequence identity (to other Abs or proteins in general) rather than by the
`
`presence of key residues as in the canonical structures method [30–32]. Alternative approaches have
`
`been used to model CDR loops with ab initio methods based on physicochemical principles [33–38],
`
`which have the advantage of not requiring any template and can thus be applied even if no suitable
`
`canonical structure is found. Their major limitation is that, due to our poor comprehension of the
`
`physicochemical principles governing protein structures, the pseudo-energy functions used to evaluate
`
`the different conformations often fail to distinguish a correct prediction. Another limitation is that
`
`ab initio methods tend to have higher computational costs than similarity-based approaches. As our
`
`understanding of protein structure and energy function increases, the combination of the canonical
`
`structure procedure with other more sophisticated computational approaches may offer improvements.
`
`1.6. The Docking Calculation
`
`Having chosen or generated the starting structures for antibody and antigen, the molecules are then
`
`brought together by the preferred algorithm. Computational docking must face two problems [39]:
`
`(1) Finding the correct solution, which is usually achieved by changing the relative position of the
`
`partners and repeating the calculation thousands of times; (2) discriminating the correct solution from
`
`the inaccurate ones by use of a so-called ―scoring function‖. Simply put, the scoring function rewards
`
`positive interaction between the docking partners (e.g., the formation of a hydrogen bond) and
`
`penalizes negative interactions (e.g., steric clashes). The assumption is that the correct biological
`
`structure is energetically favored and has, therefore, the lowest energy. Scoring functions, also referred
`
`
`
`Lassen - Exhibit 1035, p. 6
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`232
`
`
`
`to as pseudo-energy, try to simulate this energy by accounting for biophysical considerations such as
`
`hydrophobic and electrostatic interactions, salt bridges, hydrogen bonds, etc, but also statistical and
`
`empirical considerations such as the degree of conserved residues at the interface.
`
`When searching for the correct binding orientation, the two (or more) molecules are allowed to
`
`move and the score is assessed after each step. Minimization protocols only retain conformations with
`
`a lower energy (better score) than the previous; other protocols (e.g., Monte-Carlo) may retain
`
`conformations with higher energy in an attempt to overcome local energy minima that do not
`
`correspond to the global minimum. The movement is stopped after a predefined number of steps or
`
`when the score does not improve further. The conformational parameters changed between each step
`
`vary in different docking algorithms, which may be divided in three general classes as described
`
`below: (i) only the relative position of the docking partner is changed; (ii) the relative position and
`
`the sidechain conformations are changed; (iii) the backbone conformation is altered in addition
`
`to the above.
`
`In the simplest case, the conformation of the starting structures is not altered at all during the
`
`docking process and
`
`the scoring function only needs
`
`to account for
`
`the
`
`intermolecular
`
`interactions [40,41]. This is called ―rigid body docking‖ and exploits the fact that biological interfaces
`
`have highly complementary shapes [42–45]. Needless to say, the approach works best if the starting
`
`structures are identical to the bound conformation; although this is not too common in biological
`
`complexes, good results can nonetheless be obtained in several cases. Various research groups have
`
`used this approach [46–52]; amongst them the program ZDock [53] has achieved good results in the
`
`CAPRI experiment [54–57].
`
`RosettaDock [58,59] has a first rigid body phase in which sidechains are removed, but in a second
`
`phase they are re-introduced and their orientation is optimized [60,61]. Since the sidechain
`
`conformation is dictated mainly by a limited number of allowed torsion angles, the task can be
`
`completed with reasonable success and limited computational requirements [51].
`
`Accurately simulating the backbone movements that often happen upon formation of biological
`
`complexes remains a daunting task for docking, which has a very high failure rate when molecules
`
`undergo significant conformational changes upon binding. The degrees of freedom available to protein
`
`backbone, especially in loop regions, make it extremely difficult to sample and effectively score the
`
`sheer amount of possible stable conformations. Among the programs that incorporate backbone
`
`flexibility in the docking [62–64], HADDOCK [65,66] uses a rigid body phase followed by sidechain
`
`optimization to select the best scoring decoys, and then simulates backbone flexibility on a selected
`
`number of decoys (200 with the default options) in a final stage. It is not practical to run the final
`
`stage for all the thousands of initial decoys because of the high computational requirements, although
`
`this problem will become less significant with increase in computing power. Likewise, a recent
`
`update to RosettaDock adds the option of backbone minimization to the standard protocol with
`
`moderate success [58].
`
`It is not clear which approach should work best when docking antibody-antigen complexes. It is
`
`conceivable that in vivo antibodies adapt to and are selected against existing antigen conformations,
`
`thus it might be tempting to believe that antigens should not experience drastic changes upon antibody
`
`binding. Rigid body docking might be best in this case, but first of all it is doubtful that proteins are
`
`not subjected to any conformational motion in solution, not even at the sidechain level, and
`
`
`
`Lassen - Exhibit 1035, p. 7
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`233
`
`
`
`furthermore, there are examples in which antibodies provoke relatively large allosteric effects on the
`
`antigen [67]. The issue is slightly different for the antibody, instead: since antibody modeling uses
`
`bound conformations as templates, the conformational rearrangements experienced by the antibody
`
`upon binding can be ignored. It should be noted, however, that the canonical structures used for
`
`antibody modeling describe the backbone but not the sidechain conformations, which are probably best
`
`explored during the docking run. In conclusion, if one believes that the antibody model is accurate and
`
`that antigen binding loops are relatively rigid, then it should not be necessary to sample antibody
`
`backbone flexibility in the docking run. This assumption appears reasonable for the five CDR loops
`
`following canonical structural rules but it might fail for models of the H3 loop, which may be slightly
`
`inaccurate and/or might indeed be flexible in the biological context. Conversely, docking methods that
`
`vary the CDR loops’ conformation might introduce deviations from the canonical structure and
`
`decrease the accuracy.
`
`Although it is impossible to draw general rules, using rigid body approaches for the backbone but
`
`sampling different sidechain conformations might be a reasonable compromise. It might also prove
`
`useful to allow backbone movement for the H3 loop (and others when they do not follow canonical
`
`structures) while allowing only sidechain optimization of the remaining antigen binding loops. This
`
`behavior can also be approximated by generating multiple antibody models, presumably differing
`
`mainly in the H3 conformation, and using all of them as starting structures to be docked without
`
`backbone optimization, either as an ensemble [68] or serially. Besides requiring more computational
`
`time, this approach exacerbates the problem of providing a reliable scoring function: Errors are
`
`generated both by the inability to correctly assess intermolecular interactions and also by
`
`intramolecular differences amongst the various starting conformations.
`
`1.7. Exploiting the Peculiarities of Antibodies to Simplify the Docking Search
`
`Antibodies have a number of features that can be exploited to improve, speed up and simplify the
`
`docking search, much like the existence of canonical structures simplifies CDR loop modeling. The
`
`recently introduced SNUGDOCK [69,70], for instance, is geared towards antibodies and builds upon
`
`the RosettaDock protocol by adding simultaneous optimization of the antibody-antigen position, CDR
`
`loops conformation and heavy and light chain relative position.
`
`More generally, since we know that Abs interact with antigens through their antigen binding loops,
`
`there is no need to search for possible intermolecular contacts in the rest of the molecule. Typically,
`
`the antibody is initially positioned with its CDR loops facing the antigen and it is not allowed to
`
`deviate from this general orientation for the entire docking process. This not only increases the
`
`calculation speed but also generates much fewer possible models of the bound complex (decoys),
`
`easing the burden of scoring and analyzing the results. Constraints can also be introduced to reward the
`
`CDR residues for being at the interface, penalize them if they are not or penalize contacts between the
`
`rest of the antibody and the antigen. However, particular care must be exercised when introducing
`
`constraints or bonuses affecting the final score, since it is relatively easy to force an inaccurate solution
`
`or discard a valid, albeit slightly inaccurate one. For example, residues close to the CDR loops but not
`
`formally belonging to them can be at relatively close distance to the antigen, therefore rejecting any
`
`solution involving proximity of non CDR residues to the antigen would be inappropriate. Conversely,
`
`
`
`Lassen - Exhibit 1035, p. 8
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`234
`
`
`
`in the known experimental structures not every CDR residues interact with the antigen and forcing
`
`them to do so during docking would be a mistake.
`
`2. Results and Discussion
`
`Although an extensive benchmarking of modeling approaches goes beyond the scope of this
`
`manuscript, we used computational docking to predict the structures of two antibody-antigen
`
`complexes recently determined by X-ray crystallography, with the aim of illustrating the potentials,
`
`pitfalls and opportunities of antibody modeling and docking.
`
`Most of the antibodies capable of neutralizing influenza virus bind to the highly variable ―globular
`
`head‖ region of hemagglutinin, which covers the viral surface. Due to this variability, however, their
`
`efficacy is limited to few viral strains and to the narrow timeframe before the virus changes its
`
`sequence to prevent antibody binding (anti-flu seasonal vaccines are usually changed every four years
`
`for this reason) [71–74]. Two independent research groups have recently described antibodies against
`
`the highly conserved ―stalk‖ region of hemagglutinin [75,76]; remarkably, X-ray structures show that
`
`they bind to an almost identical epitope utilizing very similar intermolecular contacts, for instance
`
`between aromatic residues of the Ab and a conserved hydrophobic patch on the antigen. Antibodies
`
`against the stalk have potentially broad reactivity, since the region is conserved in different viral
`
`strains, and the virus is not likely to develop resistance against them because of its inability to mutate
`
`that part of the molecule. As a consequence, the stalk is pharmaceutically attractive and generating
`
`more antibodies targeted against this very region is a worthwhile effort: any computational strategy
`
`capable of rapidly and reliably characterizing the binding properties of such new antibodies would be a
`
`valuable research tool.
`
`2.1. Modeling Antibodies against Influenza Virus Hemagglutinin
`
`In this work, we used PIGS and the Rosetta Antibody server to predict the structure of antibody F10
`
`(PDB code 3FKU) and CR6261 (PDB code 3GBM) [75,76], results are summarized in Table 1 and
`
`cartoon representations of antibodies are shown in Figure 2. The ―same antibody and canonical
`
`structure‖ approach (see description in Section 1.3) of the PIGS server is expected to offer the best
`
`results, but no viable template with high sequence identity is available. When the ―same antibody‖
`
`method is chosen, PIGS returns accurate results: the RMSD to the experimental structure is about 1 Å
`
`for CR6261, either for the whole antibody or individual loops, and 1.3 Å for F10. Choosing the
`
`individual chains with highest identity as templates (―best heavy and light chains‖ approach described
`
`above) brings the RMSD of CR6261 to 1.7 Å, still accurate but somehow less precise than before.
`
`Curiously, the most problematic loop is H2 and not H3 as it usually happens. Finally, the ―same
`
`canonical structures‖ approach yields the worst model, with RMSD of 2.1 Å (2.4 Å in the least
`
`accurate loops). In the case of F10, the last two approaches return models with unacceptable steric
`
`clashes and highly unusual features that are consequently discarded, while choosing templates with
`
`lower sequence identity gives results worse than the ―same antibody‖ option and not further analyzed.
`
`Even if no benchmarking is available, the authors’ recommendation to use the ―same antibody‖
`
`approach is in agreement with our general experience and with these particular antibodies.
`
`
`
`Lassen - Exhibit 1035, p. 9
`
`
`
`Int. J. Mol. Sci. 2011, 12
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`235
`
`
`
`In contrast to PIGS, Rosetta Antibody does not offer a choice of different modeling methods but,
`
`starting from the same templates, returns 10 different models for each target antibody. We refer to
`
`them as PIGS or Rosetta1-10 according to the modeling program used and to their relative score,
`
`i.e., Rosetta1 is the best scoring model, Rosetta2 the second best scoring and so forth. The best models
`
`for CR6261 (e.g., Rosetta2 and Rosetta5 in Table 1) are as accurate as those predicted by PIGS, with
`
`RMSD slightly higher but not significantly so; the worst model has RMSD of 1.9 Å (2.5 Å for the H3
`
`loop), still an accurate result. Similar considerations are true for F10, with RMSD of 1.7 Å (2.0 Å for
`
`H3) for the best model and 2.6 Å (3.4 Å for H3) for the worst one.
`
`There is no way to assess which of the 10 Rosetta models is best in a blind experiment, when the
`
`antibody structure is not known. The model with the best score, thus preferred by the algorithm, is not
`
`the most accurate, neither in this example nor in our general experience but, in a worst-case scenario,
`
`even selecting the worst generated model would be acceptable. The issue is not overly important as far
`
`as docking is concerned, as it will be shown later, and the ensemble of different models might actually
`
`represent a conformational fl