throbber
Pfizer v. Genentech
`IPR201(cid:26)-01488(cid:3)
`Genentech Exhibit 202(cid:25)
`
`

`

`
`
`Our listing of sequences will be kept up to date. investigators are invited to send additional
`sequence data when accepted for publication. Send two copies of the manuscript together with a
`letter of acceptance from a journal to:
`Dr. E.A. Kabat
`National institutes of Health
`Building 8, Room 126
`9000 Rockville Pike
`Bethesda, Maryland 20892
`
`It would be extremely helpful if you can send us your sequence data on magnetic tapes or floppy
`diskettes or a clean copy of the sequences. The file formats should be such that they can be read
`by a generic word processor.
`
`When published, three reprints should be provided.
`
`If any published sequences have been overlooked or if any errors are found, please bring them to
`our attention.
`
` .tilt w
`
`at<h€jég>m>wnmxfl<Wa.
`
`”wow
`
`Page 2
`Page 2
`
`

`

`vii
`
`INTRODUCTION
`
`Our earlier “Variable Regions of Immunoglobulin Chains” (1), the second edition “Sequences
`of immunoglobulin Chains" (2) and the third edition “Sequences of Proteins of Immunological interest”
`(3) have been further expanded herein to include amino acid and nucleotide sequences of precursors,
`variable regions, constant regions, J-Chain. flgwmicroglobuiins, antigens of the major histocompatibility
`complex (HLA, H-2, la, DR) as well as of Thy—1 , complement, T—lymphocyte receptors for antigens, other
`T—ceil antigens ofthe immunoglobulin superfamily, interieukins and various other proteins related to
`immune functions. The identification and sequencing of clones obtained using recombinant DNA
`techniques has yielded nucleotide sequences of signal, variable, and constant regions of immunoglo-
`bulins (4,5), and these nucleotide sequences have been translated into amino acid sequences. instances
`of the latter have been included in the tables of amino acid sequences and are indicated by an apostrophe
`followed by CL afterthe name of the clone. We have continued to use the PROPHET Computer System
`of the Division of Research Resources, National institutes of Health (6,7) to tabulate the sequences.
`
`in compiling the data we have tried to be as up—to—date as possible and have included only
`sequences which have been published or which have been accepted for publication. Residues which
`have not been definitely determined have been excluded. it should be remembered that sequences
`are often published in review articles without detailed documentary evidence. These have often been
`revised. We have listed such revisions in the notes in many instances; others can readily be found by
`comparison with sequences in previous editions.
`
`Since the preparation of camera-ready copy for printing the pages is carried out in sequence
`from page 1 in batches, the amino acid sequences were set several months before the nucleotide
`sequences. We have continued to include new nucleotide sequences up to the point at'which camera-
`ready copy for them had to be set, but translated amino acid sequences were not able to be included.
`Thus many nucleotide sequences appear without translation. When antibody activities were known,
`they have been listed at the end of the nucleotide sequences and are included in the index.
`
`When doubts arise as to the validity of any residue in a sequence, the original reference
`should be examined to ascertain whether definitive evidence for the sequence has been provided. We
`have sent the amino acid and nucleotide sequences as stored in the computer to the original authors
`for verification. if so verified, this is denoted by “(checked by author)" at the end of each reference.
`Except for the earliest sequences, the date on which the checked sequence was returned to us is given.
`Whenever possible, nucieotide sequences from GenBank (8) have been used. Programs for converting
`a GenBank sequence to the codon format of our tables have been developed. The correctness of the
`table sequence has been verified by converting back into the linear form and comparing with GenBank.
`When this has been done the sequence is listed as “(from GenBank)”. if the sequences were entered
`by us from the literature and then checked with GenBank, this is indicated by "(checked with GenBank)”.
`We have entered many nucleotide sequences which were not available from GenBank. In general, we
`have not included stretches of sequence such as enhancers, switch regions and introns for which no
`codification of the nucleotide sequences is as yet agreed upon with respect to function, etc. Much
`information about such stretches may be found in references 9, 10.
`
`it is also possible, by examining the numbers of sequences at the end of each table and the
`summary tables, to evaluate the probability that a given amino acid at a given position (may not be
`correct. This is most readily done forthe framework residues of the V—region and for the C-region; in the
`compiementarity-determining regions this is more difficult because of the high variability.
`
`'
`
`Amino Acid Sequences
`
`The first column in each table gives the residue number. Except for complement, T-ceil surface
`antigens and miscellaneous proteins, the second column is a tabulation of invariant residues. Since
`exceptions to invariance are found, the frequency, if less than 1.0 and greater than or equal to 0.95,
`is indicated alongside the residue listed as invariant; when only a single sequence is available, this
`is not given. Each sequence is tabulated in each subsequent column. Dashes (———) indicate that no
`amino acid is present at that position and that the sequence continues. in all instances residues
`
`Page 3
`Page 3
`
`

`

`viii
`
`considered uncertain by the authors have not been included in the table. in some instances the symbol
`# is used to indicate that several amino acid residues were found in one position, and these residues
`are listed in the notes. The four columns at the end of each table give:
`
`1.
`2.
`3.
`4.
`
`the number of residues sequenced at that position,
`the number of different amino acids found at that position,
`the number oftimes the most common amino acid occurred and that amino acid in parentheses, and
`the variability.
`
`Variability is calculated (11) as:
`
`Variability =
`
`Number of different amino acids occurring
`at a given posrtion
`Frequency of the most common amino acid
`at that position
`
`An invariant position would have a variability of one; if 20 amino acids occurred with equal
`frequency, the variability would be 20 divided by 0.05 equals 400. if, for example, four different amino
`acids Ser, Asp, Pro, and Thr occurred at a given position, and of 100 sequences available at that position,
`Ser occurred 80 times, the variability would be 4/08 = 5. When any of the amino acid residues
`sequenced were not identified completely and are listed as Glx (or Asx), two values, separated by a
`colon, are given in the last three columns. The first value in each of these columns is calculated assuming
`that only one ofthe two possibilities, e.g., Glu or Gln (or Asp or Asn) occurred, while the second considers
`that both were present and maximizes variability. in the variability plots, the horizontal bars indicate
`the two values.
`
`When two or more amino acids are most common and occur with equal frequency, they are
`tabulated as a note, and the symbol + is used in the next to last column. If no sequence data have been
`reported for any position, there are no entries in the last four columns. Variability is not calculated for
`insertions or ifonly a single sequence is known. When the translated sequence of a clone corresponds
`to a previously listed sequence of a plasmacytoma from which it was prepared, only one sequence
`is listed so that the variability computations are not affected, and a note is included.
`
`If a given sequence is associated with any antibody activity, this is indicated by an asterisk alongside
`the protein heading, and the antibody specificities are given in a separate list with binding constants
`if available. The notes list the a-allotypes for the rabbit heavy chain V-region and the b-allotypes for
`the constant domain of the rabbit kappa light chain. A key reference to the sequence is given; generally
`the most recent reference since it is usually the most nearly complete, but often several references
`are included, especially when revisions of a sequence have been made. Notes are now of two types;
`general notes about a table indicated by the symbol #, and specific notes indicated by the sequence
`number.
`
`Signal Sequences
`
`The signal (precursor) amino acid sequences of immunoglobulin chains are listed in three tables:
`one for kappa light chains, one for lambda light chains, and one for heavy chains. They were obtained
`either by direct sequencing of signal proteins (12-14) or bytranslating nucleotide sequences from DNA
`clones. Signal segments range from 17-29 amino acid residues in length and are thus numbered from
`~29 to —1. Genomic DNA clones contain introns of varying length that interrupt the coding sequence
`of the precursor within the codon for position —4, and in rare cases for position -6. Thus, the L—gene
`encodes the leader peptide to position —4 and the 5’ end of the V-gene codes for positions -4 to ‘1.
`
`The signal amino acid sequences of the T—cell receptors for antigens, [32-microglobulins,
`major histocompatibility complex proteins, and complement components are listed in separate tables.
`
`By conformational energy calculations, the core VK hydrophobic Leu-Leu-Leu-Trp—Val-Leu-Leu—
`Leu (MOPCB21, MOPCES) exists in an alpha helical conformation, terminated by chain reversal confor-
`mations in the four C-terminal residues Trp—Val-Pro—Gly; the four amino terminal residues are compatible
`with the alpha heiix (15).
`
`Variable Region Sequences
`
`The variable regions (16) of immunoglobulins have been shown to contain hypervariable segments
`in their light (11 ,17-23) and heavy (2224—27) chains, of which certain residues have been affinity labeled
`(28-41). Three hypervariable segments of light chain were delineated from a statistical examination
`Page 4
`Page 4
`
`
`

`

`of sequences of human Vx, human VA, and mouse Vx light chains aligned for maximum homology
`(11 ,22). These and the three corresponding segments of the heavy chains (22,26,27) were hypothesized
`(11,22) to be the complementarity-determining regions or segments (CDR) containing the residues
`which make contact with various antigenic determinants, and this has been verified by X—ray diffraction
`studies at high resolution (42-67). The rest of the V—region constitutes the framework (11 22,6668).
`It is convenient to identify the framework segments (FR1, FR2, FR3, and FR4) and the complementarity-
`determining segments (CDFli, CDR2, and CDRS) with the three CDRs separating the four FRs. The
`residue numbers for these segments are as follows:
`
`Light Chain
`
`Heavy Chain
`
`Segment
`
`FFtt
`
`CDFtt
`
`FR2
`
`CDR2
`
`‘
`
`1-23 (with an occasional
`residue at O, and a
`deletion at 10 in V) chains)
`
`24-34 (with possible
`insertions numbered
`as 27A,B,C,D,E,F)
`
`35-49
`
`50—56
`
`130 (with an occasional
`residue at O)
`
`31—35 (with possible
`insertions numbered
`as 35A,B)
`
`36-49
`
`50-65 (with possible
`insertions numbered
`as 52A,B,C)a
`
`66—94 (with possible
`insertions numbered
`as 82A,B,C)
`
`95-102 (with possible
`insertions numbered as
`100A,B,C,D,E,F,G,H,|,J,K)
`
`103-113
`
`FRS
`
`57-88
`
`CDFi3
`
`FR4
`
`89-97 (with possible
`insertions numbered as
`95A,B,C,D,E,F)
`
`98-107 (with a possible
`insertion numbered
`as 106A)
`
`‘5 in the rabbit, Mage et al. (69) consider position 65 in VH to be in FRS, since it is allotype related.
`
`in the tables of V-regions, the FR and CDR are separated by horizontal lines for convenience
`in reading. One mouse kappa light chain, MP011, has an extra segment of 12 amino acid residues
`between position 1 and the signal sequence (70). Several chains have internal deletions.
`
`in the tables, the V—genes for the light chains code to amino acid position 95, and the J~rninigenes
`from position 97to 107for lambda and 108 for kappa light chains. Position 96 is usually the site of V~J
`joining by recombination and may be coded partly by the V~gene and partly by the J—minigene. Because
`the site of V—J recombination could occur at different positions within a cotton, different amino acid
`residues may resuit at this position. We have changed the location ofthe inserted residues from 97A-
`F (2) to 95A-F, since it makes for better alignment by confining chains of different lengths to the V-gene
`region. in V,C chains, J1 and J2 were used 5 to 10 times more frequently than J4 and J5 (71 ).
`
`The V—genes for the heavy chains code up to amino acid position 94 and are followed by the D-
`and J—minigenes. Because of the extensive variation in the lengths of D—minigenes, the exact boundary
`between D and J is not always located at the same amino acid position. In addition, the lengths of the
`J encoded amino acid sequences vary by a few amino acid residues. Moreover, the process of D-J
`joining appears to invoive insertions of extra nucleotides between V and D and between D and J, termed
`the N region (72—76) and correlates with the appearance of terminal deoxytransferase in B cells (75).
`The original numbering system for the heavy chains has therefore been retained. Wysocki eta]. (76)
`have provided some evidence suggesting a non-random origin for the VH-DH junction, perhaps a
`minigene, rather than random addition of the N nucleotides.
`
`It has become evident that a critical understanding of the architecture of antibody combining sites
`and the genetics of the generation of diversity and of antibody complementarity will depend to a great
`extent on the evaluation of a large number of sequences ofthe variable regions and especially of the
`complementarity-determining segments of light and heaw chains of immunoglobulins of different spe-
`cies. Ability to locate residues in the site making contact with antigenic determinants (77) and to predict
`(67,7862) the structures of antibody combining sites will depend heavily upon such sequences.
`
`Figures 1 and 2 are stereoviews of the a-carbon skeletons of the four Fv regions for which high
`resolution X-ray structures have been determined, NEWM (44), KOL (62), MCPCBOS (47, 48, 63), and
`J539 (64). The residues in the CDRs are shown as solid circles. ln Fig. 1 the combining site is at the
`
`Page 5
`Page 5
`
`

`

`MCPC603
`
`
`
`FIG. 1. Stereodrawings of the a-carbon skeletons of four Fv regions studied crystallographically. Top to bottom:
`NEWM(43,44,49,59), KOL (62), MCP0603(47, 48, 53, 55, 63), J539(64). Coordinates for NEWM, KOL from the
`Protein Data Bank (Bernstein et al. 1977, J. MOI. Biol. 112:532-544); for MCPCBOS and J539 courtesy of David
`R. Davies. The stereodrawings of Figures 1,2,3 and 4 were prepared by Dr. Eduardo Padlan.
`VL is on the left, VH on the right. The first and last residues of each chain as well as several other residues are
`shown as open circles for reference; residues of the CDT-ts are shown as solid circles. The Fv’s are aligned by
`least-squares superposition using the program ALIGN (G.H. Cohen, NIH); the stereoplots were prepared using
`the program PLUTO (S. Motherwell, Cambridge, England). The view shown is with the combining site at the top.
`With a stereo viewer it is possible to see two adjacent models at the same time, so that a comparison may be made
`in three dimensions.
`
`Page 6
`Page 6
`
`

`

`
`
`Page 7
`Page 7
`
`

`

`
`
`xii
`
`REI
`
`LOC
`
`RHE-1
`
`
`RHE-2
`
`FIG. 3. Stereodrawings ot the a-carbon skeletons of four Bence Jones VL dimers studied crystallographically.
`Top to bottom: MCG(42,46,50), REI (45, 51, 57), LOC(61), RHE(60). The bottom View is of RHE with its twofold
`axis toward the top of the page. The left VL is oriented like the VL of the Fv’s in Fig. 1 . The view shown is like that
`of Fig. 1. Coordinates for MCG, REl, RHE from the Protein Data Bank; for LOC, courtesy of Dr. Marianne Schiffer.
`
`Page 8
`Page 8
`
`

`

`
`xiii
`
`MCG
`
`REI
`
`LOC
`
`RHE-1
`
`RHE-2
`
`FIG. 4. Same as Fig. 3. but looking into the combining site. Models perpendicular to those of Fig 3. The bottom
`View of RHE is Iooking into its twofold axis.
`
`Page 9
`Page 9
`
`

`

`
`
`xiv
`
`top; the View of Fig. 2 is perpendicular to Fig. 1 and is looking into the combining site. The different
`orientations of the loops containing the complementarity-determining regions provide some insight
`into how specificity of various sites might differ (67,82). It the amino acid side chains were included,
`the differences would become much more detailed.
`
`Figures 3 and 4 are stereoviews of the fourVL dimers, Bence Jones proteins, for which high reso-
`lution X-ray structures are available, MCG (42,46,50), REI (45, 51, 57), LOC (61) and RHE (60). The
`VL chains each contribute four p—strands to the VL—VL or VH-VL interaction whereas the VH chains each
`provide five (65). Thus, although in a Bence Jones VL dimer one VL assumes the position of VH (50),
`nevertheless the absence of one B—strand in each VL may make the sites of VL dimers less specific
`than those of the Fab fragments. This is supported by the finding that the VL dimer of MOS binds a
`wide variety of ligands whereas no ligand which binds has yet been found for the MCG protein (83).
`
`A recent high resolution x-ray crystallographic study (84) of a crystalline complex of lysozyme
`with a monoclonal anti~lysozyme shows that contact between lysozyme and antibody occur on a rather
`flat surface with the interactions largely due to protruberances and depressions formed by the amino
`acid side chains producing a tightly packed region of interaction. The lysozyme determinant involves
`two noncontiguous stretches, residues 18 to 27 and 116 to 119 of its polypeptide chain. All six CDRs
`of the antibody and two residues outside the CDRs but adjacent to the CDRs, Tyr 49 in VL and Thr
`30in VH, make contact with the lysozyme. Ten of the 17 contacting residues are in VH. Four of the 10
`contacting VH residues and three of the seven contacting residues in VL are in the corresponding
`CDR3s. Table 1 lists the residues on the anti—lysozyme and on lysozyme which are in contact. These
`findings, if and to the extent applicable to anti-carbohydrate sites, with respect to interactions of side
`chains on essentially flat surfaces could have substantial implications for our understanding of these
`antigen-antibody interactions. However, the J539 site would appear to be some type of groove
`complementary to a tetrasaccharide. Unfortunately thus farthe crystal form has not allowed the ligand
`to enter the site (64). Figure 5 is a stereoview of the lysozyme-antilysozyme carbon skeleton showing
`the region of interaction.
`
`
`
`Bence Jones dimer LOC (61) has a convex (male) binding site quite different from the usual Bence
`Jones dimers MCG and REL if such a male type combining site were to be found for an Fab, the
`possibility would have to be considered that a reciprocal type of antigen-antibody interaction might occur
`in which the side chains of the CDRs would fit into a groove or depression on the surface of an antigen.
`The possibility has been noted that interactions might occur with the CDFis of the two faces of the
`projecting convex site (61). RHE also has a quite distinct type of binding site based on a unique VL-
`VL interaction. The basis for such differences is not understood but could contribute a new parameter
`to site complementarity and diversity.
`
`The sequence data may be used to make rough screens of a new sequence for homology with
`the V-region. lithe sequence to be compared is aligned with the large V- region summary tables, one
`can ascertain whether any homology exists. if homology involves the less frequently occurring residues,
`they can be found in the individual tables and homology evaluated.
`
`The variable region a—group allotypes and allotype a~negative rabbit VH chains have been
`correlated with certain amino acids in FRt and FR3 as follows (69):
`
`Allotype
`
`Amino Acid Position
`
`FRt
`
`5
`
`8
`
`to
`
`12
`
`13
`
`VHat
`
`glu
`
`gly
`
`ARG val
`
`THFi
`
`16
`
`thr
`pro
`gly
`
`17
`
`pro
`gly
`ser
`
`65
`
`gly
`
`67
`
`7O
`
`phe
`
`ser
`
`71
`
`lys
`
`FRS
`
`74
`
`thr
`
`75
`
`Ha
`
`76
`
`[—]
`
`84
`
`85
`
`87
`
`THFi GLU thr
`
`VH a2
`
`LYS GLU gly
`
`PHE lys
`
`ASP THR
`
`SER SER THR ARG ASN GLU asn ala
`GLN thr
`ser
`GLY ala
`ala
`ala
`
`glu
`VH 33b
`VHatoo glu
`
`VH a—
`
`glu
`val
`
`gly
`gly
`
`gly
`
`ASP val
`gly
`val
`
`gly
`
`val
`
`lys
`gin
`
`gin
`
`ala
`ala
`
`gly
`glu
`thr
`
`ser
`ser
`
`gly
`ser
`
`gly
`
`gly
`
`phe
`thr
`
`phe
`
`ser
`ser
`
`ser
`
`lys
`lys
`
`ser
`
`thr
`ser
`
`ala
`
`[—i
`[-1
`
`[—1
`[-1
`
`ala
`'2
`
`gin
`
`asn ala
`
`ala
`?
`
`ala
`
`thr
`MET
`
`thr
`
`3 Square brackets indicate gaps to maximize homology. Allotype related residues are in capitals.
`
`b In some a3-like genes, codons for amino acids 75 and 76 were found (85).
`
`Page 10
`Page 10
`
`

`

`XV
`
`in Figs. 1 and 2, the location of the allotypic regions may clearly be seen to be on the outside of
`VH away from the combining site. Residues 13 and 65 of VH are numbered and will facilitate location
`of the VHa allotypes. The few cDNA sequences (69) available have provided no evidence as yet that
`germ line sequences encoding latent allotypes may exist in some rabbits. Antisera to rabbit VHa
`allotypes crossreact with human lgG (86), various other species of lgM and lgG, and with the Galapagos
`shark 7S immunoglobulin and correlate with the N~terminal amino acid sequence (87).
`
`There are substantial species differences between the human, rat and rabbit Cx allotypes.
`The amino acid sequences of rabbit C,C allotypic determinants K—1, b4, b5 and b9 differ at 47 of 106
`positions, the differences occuring in clusters; the K-2 bas isotype differs at three additional positions
`(88) whereas the human CK allotypes differ by two positions (89) and the rat R1-1a and R1-1b differ at
`11 positions (90).
`
`TABLE 1
`
`Antibody Residues involved in Contact with Lysozyme
`
`[
`Antibody residues
`Lysozyme residues in contact
`
`
`Light Chain
`CDR1
`
`FFl2
`CDFt2
`CDR3
`
`Heavy Chain
`FFl1
`CDR1
`
`CDR2
`
`His
`Tyr
`
`30
`32
`
`49
`Tyr
`50
`Tyr
`Phe 91
`Trp
`92
`Ser 93
`
`Thr
`Gly
`Tyr
`
`30
`31
`32
`
`52
`Trp
`53
`Gly
`Asp 54
`
`Leu 129
`Leu 25, Gln 121, He 124
`
`Gly 22
`Asp 18, Asn 19, Leu 25
`Gln 121
`Gln 121, Me 124
`Gln 121
`
`Lys 116, Gly 117
`Lys 116, Gly 117
`Lys 116, Gly 117
`
`Gly117, Thr 118, Asp 119
`Gly 117
`Gly 117
`
`
`
`
`
`CDR3
`
`)
`Arg 21, Gly 22, Tyr 23
`99 (96
`Arg
`97)
`Gly 22, Tyr 23, Ser 24, Asn 27
`Asp 100 (
`Thr 118, Asp 119, Val 12o, Gln 121
`Tyr 101 (98)
`Asn19,Gly22
`99)
`
`
`Sequence positions are numbered as in this book except for VH CDR3, where the numbers are
`given in parentheses; the others are sequential. (From (84). Amit, Mariuzza, Phillips, and Poljak (1986)
`Science 233:747—753; courtesy of Dr. Roberto Poljak and Science. Copyright 1986 by the AAAS.)
`
`
`
`FIG. 5. Stereo diagram of the Ca skeleton of the complex. Fab is shown (upper right) with the heavy and light
`chains with thick and thin bonds, respectively. The lysozyme active site is the cleft containing the label HEL.
`Antibody-antigen interactions are most numerous between lysozyme and the heavy chain CDR loops. (From (84).
`Amit, Mariuzza, Phillips, and Poljak (1986) Science 233:747-753; courtesy of Dr. Roberto Poljak and Science.
`Copyright 1986 by the AAAS.)
`
`Page 11
`Page 1 1
`
`

`

`
`
`It has proven extremely useful, except for mouse V,c chains, to order the VL and VH sequences
`into sets (68) such that all chains with identical FRI are listed together, the set with the most members
`being listed first. Chains differing in sequence from this set by a single residue are then listed in order
`of substitution, beginning at residue 22 for light chains and residue 30 for heavy chains and proceeding
`in decreasing position number to residue 1. These are then followed by chains with two amino acid
`differences, again listing in the same decreasing order, and followed by chains with three amino acid
`substitutions, etc. Amino acid residues differing from the major FR1 sequence are given in lower case
`letters, so that one can readily see the pattern of substitution. In this ordering procedure, missing residues
`are treated as potentially different from the main sequence. If residues are missing at position 23 in
`the light chain and position 22 in the heavy chain, they are assumed to be Cys to preserve the essential
`V-domain structure. Finally, sequences which are incomplete in FRi are given. Within a given FR1
`set, identical FR2 sets are also listed together.
`
`The human V2C rearranged and germ-line genes of all four subgroups have been sequenced
`(91-95). Human VKIV has but a single germ—line gene (92, 93); thus somatic mutation must play a
`dominant role in the utilization of this gene. Vxll genes are characterized by a much longer intron between
`the signal and V-region (94) than the other subgroups. Unlike the mouse the human in,Vxll, and VKlll
`genes are not separated in the genome, but a large section of the VK locus has been duplicated; both
`sections existing as two non-allelic clusters, containing eight and six genes of all three subgroups (91 ,95).
`All genes are in the 5’93’ orientation. These findings have necessitated reclassification of some
`incomplete sequences (RPMl-6410’CL).
`
`The tables of mouse Vxlight chains have been rearranged (96,97). In previous editions(1-3), mouse
`VX light chains were listed in one table with the length from residue 1 to Trp 35 specified. They have
`now been separated into eight tables. The first six tables vary from 41 to 34 residues by the different
`lengths of CDR1 (residues 27A—F); the sixth also lacks residue 28 and the seventh is also missing residue
`22 in FRt. If residues 1 through 35 have not been determined completely, the sequences are listed
`in the eighth table, unless they show good homology to more complete sequences in one of the other
`tables. In each table, the group number is given below the name of the chain. When residues 1 through
`35 have not been determined, the eariier group designation based on residues 1-23 (96) is given in
`square brackets [ ]. However, for each table the same principles have been used in ordering the chains.
`In all instances, residues 1-35 for the largest group are given in capitals. Variations from this sequence
`are in lower case, beginning at residue 34 and proceeding toward residue 1 as in the other tables.
`
`The mouse VH sequences have been revised to take into account their division into families by
`Dildrop (98) based on amino acid sequences and of Brodeur and Riblet (100) based on nucleotide
`sequences of completely sequenced V-regions. We have however retained the earlier classification
`of VH chains into subgroups but have subdivided the subgroups to list the families as follows:
`
`Antibody specificities found
`
`Dildrop (98)
`Dildrop et al (99)
`
`FamilyBrodeur and Riblet
`(100, 101)
`Winter at al (102)
`
`Subgroup IA
`B
`
`Ars, DNP, HEL, DIG, poly—GA
`2—PHEOX, HEL
`
`Subgroup II A
`
`a(1»3)a(1»6)DEX, RNA, Ars, ansepEx, DIG,
`HEL, ldAcSB, GAT
`
`B
`
`C
`
`NP, GAT, DIG, PC, |dAc38, 2-PHEOX
`
`GAT, GA, H-2K—k
`
`Subgroup III A
`
`PC, DNP, HEL, DNA
`
`a
`
`C
`D
`
`{i(1»6)GAL, a(1—>6)DEX, NAclVlAN, 5(2»5)FRU, GAT,
`STR‘AUS-DGICNAC)
`
`(3(2—>I)FRU, STRvA, SRBC, GAT
`DIG. H~2K-l<, SRBC
`
`Subgroup v A
`B
`
`Ans cni +, ARS car
`ARS, AFiS CRI +
`
`'
`
`Miscellaneous
`
`ARS, DNA, HEL, H-2K~k, GAT, DIG, 2-PHEOX, CEA
`
`3
`2
`8
`
`9
`
`i
`
`7
`
`4
`
`6
`i
`
`‘l
`
`5
`
`VH36~60
`VHQS2
`vHseoe,
`MV31
`
`desss
`VMUJ,
`VGAM3-8
`
`VHJ606
`
`VHSiO7
`
`VHJ558,
`VHX24
`
`VHJ606
`J558
`
`J558
`
`VH7183
`
`For newly sequenced VH regions, the nucleotide or amino acid sequence homologies have been
`compared with the previously classified sequences.
`
`Page 12
`Page 12
`
`

`

`xvii
`
`It is evident that antibody specificities for a given antigen fall into several of these subgroups and
`that many framework residues which are characteristic of the subgroups are sufficiently different to
`indicate that different germ-line genes may give rise to antibodies of a given specificity (103).
`
`The classification given is in better accord with the amino acids (98) than with the nucleotides
`(100), since the latter used probes whereas the former was based on complete sequences. Two rabbit
`germ—line a—negative VH clones showed the greatest nucleotide sequence homology, 70.4-79.40/0 to
`VH X24, VH 7183, VH d606, and VH S107 (100) and considerably less homology to the other families.
`
`Each table of mouse V,C and VH chains is followed by a list of strains other than BALB/c.
`
`The members of identical FR and identical CDR sets are given in the notes. Members of individual
`FR1 sets may be associated with different FR2 sets, etc. (68), suggesting independent assortment of
`FR sets.
`
`A sequence identical to the FR2 sequence of the light chain of McP0603 has been found in two
`human Vxl, one human V7C lV, 31 mouse V,C (14 NZB and 17 BALB/c), one each mouse VXI and V,c
`VI, and 15 rabbit V,C sequences, and thus has been preserved for about 80 million years (104). it and
`the corresponding loop ofthe heavy chain are seen at the bottom center of each of the four stereo figures
`(Fig. 1); L40 and H41 are numbered to facilitate its location. Despite its preservation, there are 12 other
`FR2 sets in the mouse and 8 in the rabbit with sequence variation which may involve 13 of the 15 posi-
`tions, only Trp 35 and Gin 39 being invariant. The loop is in a relatively open position so that substitutions
`are readily permitted (104,63). The evidence of assortment of FR segments (68) suggested the
`hypothesis that the V-region was coded for by sets of minigenes for the FR and CDR segments and
`that these minigenes were assembled somatically during embryogenesis. FR2 of VH also shows
`substantial preservation, one set having six mouse and eight rabbit chains of identical amino acid
`sequence. It is extraordinary that one human VHlil genomic clone, VH26 (105), and a rabbit cDNA clone
`(106), were identical in nucleotide sequence of the codons for amino acids 36 through 47, differences
`being seen in codons for amino acids 48 and 49.
`
`Since only FR sets were used in demonstrating the assortment (68), it would be independent of
`and would be seen whether or not any CDR residues assorted with any FR. The early cloning studies
`(4,5) showed that the genes coding for residues comprising FR1 almost through CDR3 of mouse VA and
`V,C light chains were assembled in twelve-day-old mouse embryo DNA, and each was followed by an
`intervening sequence; in one VAi clone, two residues of CDR3 were included with FR4 (107). In an adult
`V) myeloma, the genes coding for the entire V—region were assembled (108). Thus, the minigene coding
`for FR4 plus the last two residues of CDR3 (107), termed the J segment (108), had been joined to the
`rest of the V-region between the twelfth day of embryonic life and the adult, and thus was added somat-
`ically by recombination. Milstein (17), in his original description of human Vx groups, had pointed out that
`subgroup associated residues extended only through residue 94 and that it seemed as if frequent cross-
`ing over occurred beyond residue 94. Weigert etal. (109), in studying the V,C 21 group in NZB myelomas,
`assorted the last two or three residues of CDR3 together with FR4 and suggested that this would
`contribute to the generation of diversity. With rabbit light chains, it was possible to assort the individual
`FR and CDR segments considering FR4, plus the last two residues of CDR3, as a J-minigene (110).
`
`The subsequent demonstration and sequencing of five J nucleotide segments from BALB/c, each
`separated by an intervening sequence in the mouse genome, supported the assortment data (68) and
`established J as a minigene. Four of the J segments encoded amino acid residues 96-108, but J3 ap-
`peared to be a pseudogene(111,112). In mouse VH four J-(1 13-115), in human VH six J-(1 16), in rat V,C
`six J-(1 17,1 18), in human V,C five J-(119), and in mouse V)L four J—minigenes (120,121 ) were recognized; .
`JA4 may not be functional. in rabbits of b4 and b5 allotypes, there are five V9c J—minigenes(122, 123);
`only J2 being functional. There appear to be two or more J,C minigenes that can be expressed with the
`b9 allotype and three potentially functional st are found associated with the K2 isotype (124, 125).
`
`Unlike the findings for light chains, the mouse genomic VH and JH segments did not encode
`complete V-regions (113,114). Segments coding for five and for 14 amino acids, most of CDR3, were
`missing. Subsequently, genomic DNAs capable of encoding three, four, and six amino acids were
`located, using as a probe an incompletely rearranged D-J segment including the flanking signal
`sequences (126-128). Some of these mouse D—minigenes coded for as many as six amino acids of
`CDR3, matches of 9 to 18 nucleotides having been found.
`
`Four human D-minigenes have been sequenced (129), but their presumed coding regions did
`not correspond to any CDR3. However, one VHlll human genomic clone (105) contained fourteen
`nucleotides in CDR2 which completely matched the D2 minigene coding segment (130). Another human
`VHll genomic clone (131) matched at 13 of these 14 nucleotides, the first of t

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket