`
`
`
` IN THE UNITED STATES DISTRICT COURT
`
`FOR THE DISTRICT OF DELAWARE
`
`GUARDANT HEALTH, INC.,
`
`
`
`Plaintiff,
`
`
`
`v.
`
`FOUNDATION MEDICINE, INC.,
`
`Defendant.
`
`
`
`
`
`C.A. No. 20-cv-1580-LPS
`
`JURY TRIAL DEMANDED
`
`FILED UNDER SEAL
`
`
`
`
`
`DECLARATION OF GREGORY COOPER, PH.D.
`
`
`
`
`
`
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 2 of 94 PageID #: 2702
`
`
`1.
`
`I, Gregory Cooper, Ph.D., have been retained by Weil, Gotshal & Manges
`
`LLP, counsel for the Plaintiff Guardant Health, Inc. (“Guardant”). I understand that Plaintiff
`
`has filed suit against defendant Foundation Medicine, Inc. (“FMI”) alleging infringement of
`
`United States Patent Number 10,704,086 (the ’086 Patent) and United States Patent Number
`
`10,704,085 (the ’085 patent)1.
`
`I.
`
`BACKGROUND AND QUALIFICATIONS, PREVIOUS TESTIMONY, AND
`COMPENSATION
`
`
`A.
`
`
`
`Background and Qualifications
`
`2.
`
`My qualifications are set forth in my C.V. in Appendix A. I am currently a
`
`Faculty Investigator at the HudsonAlpha Institute for Biotechnology (“HudsonAlpha”) in
`
`Huntsville, Alabama. I have held this position since 2010. HudsonAlpha, established in 2008,
`
`is an independent, non-profit institution dedicated to improving human health and welfare
`
`through biological research, genomic medicine, education, and economic development. Since
`
`coming to HudsonAlpha, I have set up an independent human genomics research lab. My lab
`
`works on projects related to the genetics of human disease, with a particular emphasis on
`
`research and clinical applications of genome sequencing for diagnosing children with
`
`developmental delays and intellectual disabilities. We also study the genetics and
`
`biochemistry of gene expression regulation, develop new computational approaches to
`
`analysis of human genomes and related datasets, and pursue other applications of genomic
`
`technologies to understand human biology.
`
`
`
`3.
`
`I am also concurrently an Associate Professor Adjunct in the Department of
`
`
`1 United States Patent Number 10,704,086 (the ’086 Patent) and United States Patent Number 10,704,085 (the ’085
`patent) are attached as Ex. 1 and Ex. 2 to this Declaration.
`
`
`
`2
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 3 of 94 PageID #: 2703
`
`
`Genetics at the University of Alabama at Birmingham (“UAB”). I have held a UAB adjunct
`
`faculty position since 2010.
`
`4.
`
`I earned a Bachelor of Arts in Microbiology and Bachelor of Science in
`
`Mathematics and Statistics, summa cum laude, from Miami University (Oxford, OH) in 2001.
`
`As an undergraduate student, I performed molecular biology and genetic research in research
`
`laboratories at Miami University, the University of Illinois at Urbana-Champaign, and the
`
`University of Cincinnati.
`
`5.
`
`I earned a Ph.D. in Genetics from Stanford University in 2006 under the
`
`mentorship of Professor Arend Sidow, Ph.D. My thesis work was mainly comprised of
`
`computational biology work, with a focus on evolutionary comparisons of mammalian
`
`genome assemblies to better annotate functional elements in human genomes and assess
`
`whether mutations in those elements might contribute to disease. While there, I worked and/or
`
`received training in areas including: theoretical and applied human genetics, molecular
`
`evolution,
`
`functional genomics, developmental biology, bioinformatics, genomic
`
`technologies, and analysis of datasets produced by high throughput experimental platforms,
`
`especially DNA sequencers.
`
`6.
`
`Beginning in 2006, I was a Senior Fellow in the Department of Genome
`
`Sciences at the University of Washington working with Professors Evan Eichler, Ph.D., and
`
`Deborah Nickerson, Ph.D. While there, I conducted my postdoctoral research. I was primarily
`
`interested in applying genomic technologies to better understand the genetic basis for human
`
`disease. One series of projects centered on analytical and algorithmic developments to identify
`
`large deletion and duplication events, also known as “copy-number variants” (“CNVs”), from
`
`microarray data generated to perform Single Nucleotide Polymorphism (“SNP”) genotyping.
`
`
`
`3
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 4 of 94 PageID #: 2704
`
`
`This process involved extensive processing, normalization, and analysis of Illumina SNP
`
`array data. I was further a key developer of a customized CNV genotyping assay using
`
`Illumina’s BeadXpress platform. A second series of projects was aimed at better
`
`understanding the genetic basis for a variety of complex traits. These included genetic
`
`analyses of transcript levels, for example, and also a genome-wide association study of patient
`
`response to the commonly prescribed anticoagulant warfarin. Finally, another series of
`
`projects focused on analysis of high-throughput DNA-sequencing data, in particular the
`
`locations of genetic variants, from individuals with Mendelian diseases. I completed my
`
`postdoctoral studies in 2009.
`
`7.
`
`From 2009 to 2010, I was Acting Assistant Professor in the Department of
`
`Genome Sciences at the University of Washington.
`
`8.
`
`I am an author or contributor to more than 90 publications in the areas of
`
`human genetics and genomics, including functional and evolutionary annotations of
`
`mammalian genomes, DNA sequencing and DNA analysis, and the genetics of human
`
`diseases, especially, but not exclusively, intellectual disabilities and other developmental
`
`issues. As of November 2020, these publications have been collectively cited more than
`
`33,000 times, according to Google Scholar estimates.
`
`9.
`
`Among the awards and honors I have earned are: Harrison Scholar, Miami
`
`University (Oxford, OH) (1997-2001), Goldwater Scholar (1999-2001), Howard Hughes
`
`Medical Institute Doctoral Fellowship (2001-2006), and a Merck, Jane Coffin Childs
`
`Memorial Fund Postdoctoral Fellowship (2007-2010).
`
`10.
`
`I am a Principal Investigator (“PI”) on two current human genomic research
`
`projects funded by the National Institutes of Health (“NIH”) and a PI or co-Investigator on
`
`
`
`4
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 5 of 94 PageID #: 2705
`
`
`several additional projects funded by non-NIH sources, including both governmental and
`
`philanthropic sources. I have also been a PI or co-Investigator on multiple additional grants
`
`related to human genetics and genomics, from NIH and elsewhere, that have been completed.
`
`11.
`
`In general, I have knowledge and experience with both the general scientific
`
`contexts relevant to these litigation proceedings and specific details of the technologies
`
`employed. For example, I have extensive experience with analyzing massively parallel
`
`sequence datasets from human samples, including targeted sequencing, whole-genome
`
`sequencing, functional genomics in which loci with particular properties or sequence elements
`
`are enriched through various experimental means, transcriptional genomics, and other related
`
`types of experiments. Of particular relevance to these proceedings, I have extensive
`
`experience with computational pipelines to detect genetic variants (“mutations”) within
`
`collections of sequence reads.
`
`12.
`
`I have previously served as an expert witness in three patent infringement
`
`proceedings. First, I worked on behalf of Illumina in a lawsuit against Ariosa (now owned by
`
`Roche), case number 3:12-cv-05501-SI (N.D. Ca.), which centered on non-invasive prenatal
`
`diagnostic technologies based on cell-free DNA testing. I was deposed in that proceeding and
`
`also testified at trial. Additionally, I have previously worked as an expert on Guardant’s
`
`behalf in two lawsuits directly related to this one, including infringement cases against
`
`Personal Genome Diagnostics (PGDX) and Foundation Medicine (FMI), case numbers 17-
`
`cv-1623-LPS-CJB (D. Del.) and 17-cv-1616-LPS-CJB (D. Del.) I was deposed in both of
`
`those cases; the former has been settled and a trial date for the latter has been delayed to an
`
`as-yet undetermined date.
`
`13.
`
`Based upon my experience, qualifications and expertise, I am qualified as an
`
`
`
`5
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 6 of 94 PageID #: 2706
`
`
`expert in the fields of DNA sequencing and nucleic-acid based assays and the technologies
`
`relating to the ’086 Patent. Furthermore, I believe that I am qualified to opine as to the
`
`presence and extent of potential patent infringement within this scientific and technological
`
`area.
`
`B.
`
`Compensation
`
`14.
`
`I am being compensated for my time at a rate of
`
` per hour. My
`
`compensation is not dependent in any way upon the outcome of this proceeding.
`
`15.
`
`Beyond having general and specific expertise from my educational
`
`background and work, I have also performed a careful and extensive review of documents
`
`and evidence specific to this litigation, including, but not necessarily limited to:
`
`
`
`
`
`the ’086 Patent and the ’085 Patent
`
`relevant scientific publications, including those published by FMI employees and
`
`other scientists not associated with the parties of this litigation; and
`
` publicly available information on Foundation’s websites.
`
`16.
`
`A full list of the documents, products, and information I have reviewed is listed
`
`in Appendix B. To the extent not identified in Appendix B, I have also considered each of the
`
`documents cited in this report.
`
`17.
`
`Based on my review of the foregoing materials, it is my opinion that FMI
`
`infringes claim 1 of the ’085 Patent and claim 1 of the ’086 Patent.
`
`II.
`
`LEGAL STANDARDS
`
`18.
`
`I am not a patent attorney nor have I independently researched the law on
`
`infringement. However, I have been informed by counsel of the various legal standards that
`
`apply to the pertinent technical issues, and I have applied those standards in arriving at my
`
`conclusions expressed in this Report.
`
`
`
`6
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 7 of 94 PageID #: 2707
`
`
`19.
`
`I have been informed that the “specification” of a patent includes the written
`
`description of one or more preferred embodiments of the invention, any drawings or figures,
`
`and one or more claims that point out and distinctly claim the subject matter of the invention.
`
`It is the claims that define the scope of the patent. A separate invention is defined by each
`
`claim.
`
`20.
`
`I am informed that a dependent claim incorporates each and every element of
`
`the claim from which it depends.
`
`21. My understanding of is that an accused method infringes a claim if it performs
`
`all the elements of that claim, regardless of whether or not there are other additional steps in
`
`the accused method.
`
`22. While literal infringement requires each step to be literally performed in the
`
`accused method, infringement under the doctrine of equivalents can be found when every
`
`claim element that is not literally present is nonetheless equivalently performed.
`
`23.
`
`I am informed that, under the doctrine of equivalents, even if an accused
`
`method does not literally perform one or more limitations of a claim of a patent, the accused
`
`method may still infringe that claim if the differences between the accused method and the
`
`limitation of the claimed method are insubstantial.
`
`24.
`
`I am further informed that one test for the applicability of the doctrine of
`
`equivalents is that the accused method performs substantially the same function in
`
`substantially the same way to achieve substantially the same result as the claimed method.
`
`25.
`
`I have been informed that the burden of proof for proving patent infringement
`
`is by a preponderance of the evidence, which means that the accused device is more likely to
`
`infringe than not to infringe a claim of the patents-in-suit.
`
`
`
`7
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 8 of 94 PageID #: 2708
`
`
`III.
`
`SUMMARY OF THE TECHNOLOGY
`A.
`
`DNA
`
`26.
`
`DNA is the molecule that stores genetic information in living organisms,
`
`including in cancer-causing tumor cells. DNA is a chain of four basic building blocks, known
`
`as nucleotides. The four different kinds of nucleotides are Thymine (T), Cytosine (C),
`
`Adenine (A), and Guanine (G). The order of these nucleotides encodes genetic information in
`
`living organisms. See Recombinant DNA: Genes and Genomes – A Short Course, 3rd Ed by
`
`James D. Watson, et al.
`
`27.
`
`The human genetic code contains ~3 billion nucleotides spread out over 23
`
`chromosomes. A complete sequence of these nucleotides is referred to as a “human genome”.
`
`There is, on average, an ~99.9% similarity between any given stretch of DNA sequence in
`
`one person versus another; that is to say, that, there is a “point substitution” approximately 1
`
`in every 1,000 nucleotides of genomic sequence (e.g., Ex. 3 [A map of human genome
`
`variation from population-scale sequencing]). Other types of variation, such as insertions and
`
`deletions (“indels”), copy-number variants (“CNVs”), and rearrangements, can also be
`
`observed between any two individuals. Further, humans are “diploid”, which is to say that
`
`they inherit one set of chromosomes from their mother and one from their father. The
`
`similarity between these individual genomic copies, sometimes referred to as “haploid
`
`genomes”, also tends to be ~99.9% in any given stretch of sequence. Additionally, as for
`
`differences between two people, other types of variation, such as indels, CNVs, and
`
`rearrangements, can also exist between the two haploid genomes within one person. In
`
`general, variations in only a small fraction of the 3 billion nucleotides in the human genome
`
`– as stated previously, for example, point substitutions tend to occur at only ~0.1% of all
`
`positions -- are responsible for wide variations among human beings.
`
`
`
`8
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 9 of 94 PageID #: 2709
`
`
`Fig. 12
`
`
`
`28.
`
`Another aspect of human genetics important to these proceedings is genetic
`
`variation among cells in a person. In particular, nearly all cells, with a few exceptions such as
`
`red blood cells and gametes, are diploid and contain nearly the same two haploid genomes,
`
`one that was inherited from a person’s mother and one from their father. However, differences
`
`can also arise within a given cell or subset of cells inside a given person. This phenomenon is
`
`sometimes termed “somatic” variation. In some cases, a genetic variant arises in a cell that
`
`can alter the rate at which it grows, behaves, or divides. The “daughter” cells that emerge arise
`
`when that cell divides will inherit that variant. If that cell divides too rapidly, or causes other
`
`cells to divide too rapidly, it can lead to cancer.
`
`29.
`
`It is also important to note that DNA can also be found outside of cells in the
`
`
`2 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`9
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 10 of 94 PageID #: 2710
`
`
`body. When cells die, for example, they may burst open and release their contents into the
`
`surrounding “extracellular” space nearby; the genomic DNA present in these cells thus
`
`becomes “cell-free DNA”. While much of this cell-free DNA is quickly degraded, it can
`
`sometimes remain stable and intact and enter the bloodstream, a basic phenomenon which
`
`was discovered many decades ago (Mandel and Metais 1948). As technologies matured,
`
`scientists and clinicians began to recognize that analyses of cell-free DNA in people may
`
`provide clues about their health. For example, in 1997 Dennis Lo and colleagues showed that
`
`fetal DNA could be detected in plasma, which is a cell-free component of blood, of pregnant
`
`women (Ex. 5 [Presence of fetal DNA in maternal plasma and serum]). Similarly, it was also
`
`found that cell-free DNA levels in the plasma could be altered by cancer (e.g., by Leon in
`
`1977) and that cell-free DNA that was specific to tumor cells could also be detected (e.g., by
`
`Stroun in 1989).
`
`B.
`
`Sequencing
`
`30.
`
`One can make a determination of the sequence of an individual’s DNA using
`
`machines known as sequencers. Specifically, below is an Illumina Next-Generation
`
`Sequencing (NGS) machine, which is widely used by both academic researchers and
`
`diagnostics companies for “high-throughput DNA sequencing.”
`
`31.
`
`The Illumina sequencing platform can simultaneously sequence millions of
`
`fragments of DNA at the same time, and is hence commonly referred to as a “massively
`
`parallel” sequencing platform.
`
`
`
`10
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 11 of 94 PageID #: 2711
`
`
`Fig. 33
`
`
`
`32.
`
`There are four basic types of genetic mutations that can lead to cancer, shown
`
`in the four figures presented below.
`
`
`
`Fig. 44
`
`
`3 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
` 4
`
` From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`
`11
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 12 of 94 PageID #: 2712
`
`
`33.
`
`First, in the upper left, there are single nucleotide variants, where one
`
`nucleotide is replaced with another. In the figure, the typical nucleotide is an A, and is replaced
`
`with a C.
`
`34.
`
`Second, in the upper right, there are insertions & deletions, also known as
`
`“indels”, where a nucleotide or group of nucleotides is added or removed, respectively. In the
`
`figure, the normal sequence consist of as stretch that reads T-C-A-T, but the A has been
`
`deleted such that the mutated sequence reads T-C-T.
`
`35.
`
`Third, in the lower left, there are gene fusions, where a DNA rearrangement
`
`occurs and leads to genes being combined together or otherwise rearranged so as to lead to a
`
`change in how the gene functions. In the figure, the blue and orange regions of the
`
`chromosome on the left are re arranged and appear instead in the chromosome on the right.
`
`36.
`
`Finally, in the lower right, there are copy number variations (CNVs), also
`
`known as gene amplifications or deletions, where whole genes or even entire chromosomes
`
`are duplicated or deleted within a cell. In the figure, two examples are shown. First, there is a
`
`CNV in which a single gene is either deleted or duplicated. As an example, for the blue gene
`
`labeled C, there can be one less copy or one extra copy.
`
`37.
`
`I note that, in general, the terms “genetic variant”, “variant”, “variation”,
`
`“genetic variation”, “mutation”, “allele”, “genetic aberration”, “genetic alteration”, and
`
`related terms, while not strictly identical in meaning, are similar terms and often used
`
`interchangeably and generally to refer to a change in DNA between two individuals, between
`
`two copies of genetic information within an individual, or among cells within an individual.
`
`Many, in fact most, “mutations” are harmless with little to no effect on a cell, tissue, or
`
`organism, while some can have large effects and lead to disease, like cancer. Unless otherwise
`
`
`
`12
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 13 of 94 PageID #: 2713
`
`
`specified, and as a reflection of their interchangeable use amongst the various documents and
`
`sources used for this report, such terms are also used interchangeably and generically below.
`
`IV.
`
`BACKGROUND OF THE INVENTION
`A.
`
`Tissue Biopsies
`
`38. Most cancer-diagnosis techniques require the use of invasive biopsies in which
`
`one uses a needle or surgical instrument to physically extract a piece of the tumor, as shown
`
`in the figure below. Tissue biopsies, however, carry many disadvantages. First, tissue biopsies
`
`require invasive procedures that have a chance of medical complications. Indeed, it is by no
`
`means straightforward to sample a tumor that is, for instance, present in a patient’s brain or
`
`lung. Second, tissue biopsies are costly, due to the extensive imaging and surgical work that
`
`is typically required to obtain one. Third, tissue biopsies require the presence of a known
`
`tumor, limiting their use only to patients with more advanced stage cancers. Fourth, tissue
`
`biopsies often sample only a small site in a tumor. This means that biopsy samples do not
`
`accurately represent the entire tumor or set of tumors in a patient and may miss clinically
`
`relevant biomarkers due to tumor heterogeneity. This could lead to a biopsy missing key
`
`mutations.
`
`
`
`Fig. 55
`
`
`5 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`13
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 14 of 94 PageID #: 2714
`
`
`39.
`
`Cellular DNA is normally locked inside a cell within its nucleus; sometimes,
`
`however, bits of DNA are released into the bloodstream. For example, when a cell dies, its
`
`chromosomes are spilled into the blood stream, and are then quickly digested by the body into
`
`small fragments consisting of, on average, about 170 nucleotides in length. This is known to
`
`happen regularly with tumor cells, as shown in the figure below, albeit sometimes with larger
`
`DNA fragments. Guardant’s technology analyzes this type of DNA, also known as “cell-free
`
`DNA” or “cfDNA.” cfDNA that originates from cancer cells is known as “circulating tumor
`
`DNA” or “ctDNA.”
`
`
`
`Fig. 66
`
`B.
`
`Liquid Biopsies And The Approach Of The Patents-in-Suit
`
`40.
`
`In this section, I provide an overview of “liquid biopsies” and the approach in
`
`the Patents-in-Suit. To be clear, this section is intended to give a general overview of the
`
`patented approach, and does not capture all of the specific details recited in the various claims
`
`of the patent-in-suit.
`
`41.
`
`The ’086 and ’085 Patents are continuations of a common parent and share a
`
`common specification. The common specification describes a series of innovations in the
`
`field of “liquid biopsies.” The field of liquid biopsies is one in which cancer causing mutations
`
`
`6 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`14
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 15 of 94 PageID #: 2715
`
`
`are detected using cfDNA, extracted from a simple, non-invasive blood draw. Using cfDNA
`
`overcomes many of the challenges of tissue biopsies. A simple blood draw avoids an invasive
`
`biopsy, reduces cost, and increase patient safety. Further, because cfDNA is released by most
`
`tumor cells and only stays in a person’s body for a limited time, this approach is able to
`
`analyze an up-to-date spectrum of mutations across the entire collection of tumors in a cancer
`
`patient.
`
`42.
`
`One primary problem with using cfDNA to find mutations is that cfDNA is
`
`present in very small quantities. And the percentage of cfDNA that contain DNA originating
`
`from tumors is even lower. The percentage is often so low that the mutations are
`
`indistinguishable from errors, or noise, that result from the typical DNA preparation and
`
`sequencing process. Thus, detecting mutations associated with cancer becomes a problem of
`
`finding a needle in a haystack.
`
`43.
`
`The figure below is from Guardant’s publication describing their technology
`
`for detecting cancer-causing mutations in cfDNA, which is known in the industry as “digital
`
`sequencing.” In the figure below, true mutations are highlighted in red, and deviations from
`
`the normal human sequence that are not real, but are nothing more than standard sequencing
`
`errors, are shown by black dots. Normal human genetic variation is present in much higher
`
`quantities, shown in green.
`
`44.
`
`As the figure below shows, the prevalence of true mutations that cause cancer
`
`is so low that they are indistinguishable from noise.
`
`
`
`15
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 16 of 94 PageID #: 2716
`
`
`Fig. 77
`
`
`
`45.
`
`As the common specification of the Patents describes, false positive mutations
`
`come largely from two sources. See, e.g., Ex. 2 [’085 patent] at 31:30-37 (“Noise can be
`
`introduced through errors in copying and/or reading a polynucleotide. For example, in a
`
`sequencing process a single polynucleotide can first be subject to amplification. Amplification
`
`can introduce errors, so that a subset of the amplified polynucleotides may contain, at a
`
`particular locus, a base that is not the same as the original base at that locus. Furthermore, in
`
`the reading process a base at any particular locus may be read incorrectly.”). To detect
`
`sequences, DNA must first be copied. Any copying process that is not 100% accurate will
`
`introduce false mutations. Additionally, modern sequencing techniques may have an
`
`additional error rate of 0.1 to 10%, depending upon the particular sequencing technology and
`
`application. These factors generate the vast majority of noise described above.
`
`46.
`
`As the Asserted Patents explain, Guardant’s approach processes amplified
`
`copies of an original cfDNA molecule so as to be analyzed as a group, or family of related
`
`
`7 Ex. 5 [R.B. Lanman, et al., Analytical and clinical validation of a digital sequencing panel], p. 6, Fig. 2.
`
`
`
`16
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 17 of 94 PageID #: 2717
`
`
`sequences. By analyzing families of sequences, a consensus sequence can be generated that
`
`substantially eliminates errors that may have arisen in any single sequence analysis.
`
`47.
`
`Guardant’s methods involve tagging cfDNA molecules prior to amplification.
`
`This allows treatment of all similarly tagged molecules as members of the same family,
`
`containing the same information. In this way, any noise that may be introduced to one
`
`particular observation will not affect the total output of the whole family.
`
`48.
`
`Applying these innovative techniques reduces errors to lower than one in a
`
`million base pairs, allowing true mutations, even ones that are extremely rare, to be
`
`distinguished. This is shown in the figure below, again from Guardant’s publication. Using
`
`Guardant’s digital sequencing technology, the noise, which is shown as the black dots, largely
`
`vanishes. The true mutations, which are shown in red, are easily identified.
`
`Fig. 88
`
`
`
`49.
`
`The process begins by keeping track of cfDNA molecules obtained from a
`
`patient. Guardant does this by attaching molecular identifiers, also known as barcodes, to both
`
`ends of the cfDNA molecule. The figure below is from another of Guardant’s publications.
`
`
`8 Ex. 5 [R.B. Lanman, et al., Analytical and clinical validation of a digital sequencing
`panel], p. 6, Fig. 2A.
`
`
`
`17
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 18 of 94 PageID #: 2718
`
`
`On the left, there is cell free DNA. The red and green colored fragments are the tumor DNA.
`
`Barcodes are represented as colored segments at the ends of each piece of DNA. The cfDNA
`
`fragments with their barcodes are copied (amplified) to increase the overall signal. Because
`
`each copy of a tagged molecule will have the same barcodes, the barcodes can be used to help
`
`identify the DNA sequence information that originated from a particular tagged cell free DNA
`
`molecule.
`
`Fig. 99
`
`
`
`
`
`50.
`
`A challenge arises from the fact that there are often billions of fragments of
`
`cfDNA fragments in each sample that must be identified. Tagging each fragment with a
`
`different barcode typically involves the generating random sequence barcodes that may, for
`
`example, vary considerably from batch-to-batch, leading to sequences that interfere with one
`
`another or with cfDNA fragments, pose difficulties to the sequencers, and otherwise reduce
`
`consistency, efficiency, and/or accuracy of cfDNA testing.
`
`51.
`
`Guardant’s inventors recognized that uniquely tagging each cfDNA fragment
`
`is not necessary. Rather, the number of tags need only meet a sufficient threshold, depending
`
`
`9 From J.I. Odegaard, et al., Validation of a plasma-based comprehensive cancer genotyping assay utilizing orthogonal
`tissue- and plasma-based methodologies at 3542.
`
`
`
`18
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 19 of 94 PageID #: 2719
`
`
`on the amount of DNA that is being analyzed.
`
`52.
`
`Once cfDNA is tagged, it is amplified, or copied using known techniques. The
`
`copied DNA is sequenced, generating sequence reads.
`
`53.
`
`Guardant’s proprietary algorithms compare each single sequence read to a
`
`reference genome, which is simply a digital representation of a previously determined
`
`sequence for a representative haploid human genome. This process is commonly referred to
`
`as mapping or aligning, and allows one to determine where a sequence is from in the genome.
`
`54.
`
`The mapping process provides additional context to determine whether
`
`sequence reads are copies of the same parent. This is especially important when not every
`
`cfDNA is uniquely tagged.
`
`
`
`
`
`
`
`
`
`
`
`19
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 20 of 94 PageID #: 2720
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`10 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`20
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 21 of 94 PageID #: 2721
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`11 Based on Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623
`(D.I. 58).
`
`12 From Guardant Health, Inc.’s Technology Tutorial from Case Nos. 1:17-cv-01616 (D.I. 52) and 1:17-cv-01623 (D.I.
`58).
`
`
`
`21
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 22 of 94 PageID #: 2722
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`V.
`
`THE ’086 PATENT
`
`62.
`
`63.
`
`The ’086 patent claims an exemplary embodiment of Guardant’s processes.
`
`Claim 1 of the ’086 Patent recites:
`
`1. A method for detecting a presence or absence of one or more somatic genetic
`variants in cell-free deoxyribonucleic acid (cfDNA) molecules from a bodily fluid sample
`of a subject, comprising:
`
`(a) non-uniquely tagging a plurality of cfDNA molecules from a population of
`cfDNA molecules obtained from the bodily fluid sample with molecular barcodes from a
`set of molecular barcodes to produce non-uniquely tagged parent polynucleotides,
`
`wherein the non-uniquely tagging comprises ligating molecular barcodes from the
`set of molecular barcodes to both ends of a cfDNA molecule from the plurality of cfDNA
`molecules using more than a 10x molar excess of molecular barcodes relative to the
`
`
`
`22
`
`
`
`Case 1:20-cv-01580-LPS Document 14 Filed 12/18/20 Page 23 of 94 PageID #: 2723
`
`
`population of cfDNA molecules,
`
`wherein the cfDNA molecules that map to a mappable base position of a reference
`sequence are tagged with a number of diffrent molecular barcodes ranging from at least 2
`and fewer than a number of cfDNA molecules that map to the mappable base position, and
`
`wherein at least 20% of the cfDNA molecules from the population of cfDNA
`molecules are attached to molecular barcodes;
`
`(b) amplifying a plurality of the non-uniquely tagged parent polynucleotides to
`produce progeny polynucleotides with associated molecular barcodes;
`
`(c) sequencing a plurality of the progeny polynucleotides to produce sequencing
`reads of the progeny polynucleotides with associated molecular barcodes;
`
`(d) mapping a plurality of the sequencing reads to the reference sequence to
`generate mapped sequencing reads;
`
`(e) grouping a plurality of the mapped sequencing reads into a plurality of families
`based on sequence information from the molecular barcodes and at least (1) a start base
`position of a given mapped sequencing read from among the mapped sequencing reads at
`which the given mapped sequencing read is determined to start mapping to the reference
`sequence and/or (2) a stop base position of the given mapped sequencing read at which the
`given mapped sequencing read is determined to stop