`UNITED STATES PATENT AND TRADEMARK OFFICE
`_________________________
`BEFORE THE PATENT TRIAL AND APPEAL BOARD
`_________________________
`
`
`ILLUMINA INC.,
`Petitioner,
`v.
`MOLECULAR LOOP BIOSCIENCES, INC.,
`Patent Owner.
`_________________________
`
`Case No. IPR2024-00964
`Patent No. 11,041,852
`_________________________
`
`
`
`DECLARATION OF PAUL T. SPELLMAN, PH.D.
`
`
`
`
`
`
`
`TABLE OF CONTENTS
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`Page
`
`I.
`
`INTRODUCTION ........................................................................................... 1
`
`A. Qualifications and Experience .............................................................. 1
`
`B. Materials Considered ............................................................................. 5
`
`II.
`
`LEVEL OF ORDINARY SKILL IN THE ART ............................................. 6
`
`III. SUMMARY OF OPINIONS ........................................................................... 7
`
`IV. LEGAL STANDARDS .................................................................................12
`
`V.
`
`BACKGROUND AND STATE OF THE ART ............................................15
`
`A.
`
`First Generation Sequencing ...............................................................15
`
`B. Next Generation Sequencing Techniques Available as of
`
`December 2010 ....................................................................................18
`
`1.
`
`2.
`
`3.
`
`4.
`
`5.
`
`Overview of Sequencing Workflow .........................................19
`
`Basic Principles Behind Illumina and Roche 454
`
`Sequencing ................................................................................21
`
`Immobilization of Nucleic Acids of Interest ............................25
`
`Amplification of Nucleic Acids of Interest ...............................26
`
`Generation of Sequence Reads and Sequences of Nucleic
`
`Acids of Interest ........................................................................29
`
`6.
`
`Use of Sequence Identifiers in Multiplex Sequencing .............30
`
`i
`
`
`
`
`
`7.
`
`Resources available to a POSA before December 2010 ...........39
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`VI. SUMMARY OF THE PRIOR ART ..............................................................40
`
`A.
`
`Parameswaran (EX1004) ...................................................................40
`
`B. Gloor (EX1005) ..................................................................................46
`
`C.
`
`Bentley (EX1006) ................................................................................51
`
`VII. THE ’852 PATENT .......................................................................................53
`
`A. Overview .............................................................................................53
`
`B.
`
`The Claims of the ’852 Patent .............................................................54
`
`C. Disclosures of the ’852 Patent .............................................................56
`
`D.
`
`E.
`
`Prosecution History of the ’852 Patent ...............................................62
`
`Claim Construction..............................................................................64
`
`VIII. CHALLENGED CLAIMS ARE UNPATENTABLE OVER THE
`
`PRIOR ART ...................................................................................................65
`
`A. Ground 1: Claims 1-8 Would Have Been Obvious Over
`
`Parameswaran and Gloor ...................................................................65
`
`1.
`
`Claim 1 Would Have Been Obvious Over Parameswaran
`
`and Gloor ..................................................................................65
`
`2.
`
`A POSA Would Have Been Motivated to Combine
`
`Parameswaran and Gloor .........................................................81
`
`ii
`
`
`
`
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`3.
`
`A POSA Would Have Had a Reasonable Expectation of
`
`Success in Combining Parameswaran and Gloor ....................87
`
`4.
`
`Dependent Claims 2-8 Would Have Been Obvious Over
`
`the Combination of Parameswaran and Gloor ........................92
`
`B. Ground 2: Claims 1-8 Would Have Been Obvious Over
`
`Parameswaran and Bentley ...............................................................101
`
`1.
`
`Claim 1 Would Have Been Obvious Over Parameswaran
`
`and Bentley ..............................................................................102
`
`2.
`
`A POSA Would Have Been Motivated to Combine
`
`Parameswaran and Bentley ....................................................106
`
`3.
`
`A POSA Would Have Had a Reasonable Expectation of
`
`Success in Combining Parameswaran and Bentley ...............106
`
`4.
`
`Dependent Claims 2-8 Would Have Been Obvious Over
`
`Parameswaran and Bentley ....................................................109
`
`C. No Unexpected Results or Other Evidence of Nonobviousness .......112
`
`IX. CONCLUSION ............................................................................................112
`
`iii
`
`
`
`
`I.
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`INTRODUCTION
`I, Dr. Paul T. Spellman, have been retained as an independent expert in
`1.
`
`the field of genetics and DNA sequencing. I submit this declaration on behalf of
`
`Petitioner Illumina, Inc., in the above-captioned inter partes review (“IPR”).
`
`2.
`
`I am being compensated for my time in connection with this IPR at my
`
`standard hourly consulting rate of $500/hour. I do not have any personal or financial
`
`stake or interest in the outcome of this proceeding and my compensation is in no
`
`way contingent on the nature of my analysis or the outcome of this IPR or any other
`
`proceeding.
`
`3.
`
`I am over 21 years of age and, if I am called upon to do so, I would be
`
`competent to testify as to the matters set forth herein.
`
`A. Qualifications and Experience
`I believe that I am qualified to serve as a technical expert in this matter
`4.
`
`based upon my qualifications, discussed in detail below. A copy of my curriculum
`
`vitae is attached as Appendix A to this declaration.
`
`1
`
`
`
`5.
`
`I am a Professor of Medicine in the Division of Hematology and
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`Oncology as well as in the Department of Human Genetics at the University of
`
`California Los Angeles (“UCLA”) David Geffen School of Medicine. My research
`
`focuses on bioinformatics and genome sequencing, including developing improved
`
`sequencing techniques and applying genomic and computational technologies to
`
`improve human health. My research encompasses all phases of genomic research
`
`and sequencing, from technology and method development to application of
`
`technologies to answer critical questions in cancer biology, to population studies to
`
`understand the impact of genetic variation of disease, and to implementation trials
`
`that directly impact health. I have more than 25 years of experience in nucleic acid
`
`sequencing and have first-hand experience in designing multiplex sequencing
`
`assays.
`
`6.
`
`I graduated from the Massachusetts Institute of Technology in 1995
`
`with a Bachelor of Science in Biology. I then earned my Ph.D. in Genetics from the
`
`Stanford University School of Medicine in 2000.
`
`2
`
`
`
`7.
`
`Following my Ph.D., I conducted post-doctoral research from 2000 to
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`2003 at the University of California Berkley Department of Molecular and Cellular
`
`Biology, studying gene regulation and genomics. From 2003 to 2011, I was a
`
`Scientist and Staff Scientist at the Lawrence Berkeley Lab, Life Science Division.
`
`In 2011, I also served as the Special Assistant to the Deputy Director of the National
`
`Cancer Institute.
`
`8.
`
`In 2011, I joined the Oregon Health & Science University (“OHSU”)
`
`School of Medicine first serving as an Associate Professor, then Professor with
`
`tenure, and then the Penny and Phil Knight Endowed Professor in Cancer Research
`
`Innovation. There, I studied the use of population genetics to help determine who is
`
`at risk for cancer, how to computationally analyze genomic data to identify early
`
`changes in cancers, and how to accurately screen different populations for the
`
`disease. My research also included using genetic and genomic approaches to
`
`understand the processes by which cancer develops, monitor disease, and identify
`
`therapeutic strategies.
`
`3
`
`
`
`9.
`
`I spent 12 years at the OHSU School of Medicine before joining the
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`faculty of UCLA in 2023. During that time at OHSU, I held multiple leadership roles
`
`for programs relating to genomic sequencing, including serving as the Program
`
`Leader of the Quantitative Oncology Program, the Co-Director of the Cancer Early
`
`Detection Advanced Research (“CEDAR”) Center in the OHSU Knight Cancer
`
`Institute, and the Interim Director for the Program in Computational Biology.
`
`10. To date, I have authored about 200 publications, including 175 articles,
`
`3 book chapters, 5 letters to the editor, 12 literary reviews, and 4 abstracts, many of
`
`which relate to genomic research or apply genomic sequencing.
`
`11. Since 2002, I have served on 25 grant review committees, 6 conference
`
`planning committees, and over 30 project advisory, institutional, and editorial
`
`committees. I am also active in related professional societies, including having
`
`served as Treasurer and Board member for the Microarray Gene Expression Data
`
`Society and Data Coordination and as a member of the Management Working Group
`
`and Scientific Planning Committee for the International Cancer Genomics
`
`Consortium.
`
`12.
`
`I have served as a regular reviewer for journals relating to genome
`
`sequencing and biotechnology, including Bioinformatics, Nucleic Acids Research,
`
`Nature Communications, PLoS ONE, Cell Reports, Genome Research, and Genes,
`
`among others.
`
`4
`
`
`
`13. Since 2004, I have been awarded dozens of grants, fellowships, and
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`other sources of funding for my work, including for studying topics such as
`
`developing cost effective sequence-based technologies, studying genes relating to
`
`breast and ovarian cancer, and analyzing genomic data. Currently, I have research
`
`funding for a systematic analysis of genetic and gene regulation information in
`
`clinical cohorts as part of the Genome Data Analysis Network and a clinical trial
`
`implementing genetic health screening for hereditary breast and ovarian cancer and
`
`Lynch syndromes.
`
`14. Over the years, my work has been highlighted by news media outlets,
`
`including by the Salem Statesman Journal. I have given over 80 guest lectures on
`
`my work, and I have taught over 50 courses, workshops, and seminars on topics
`
`including genetics, genome
`
`sequencing, biotechnology, and
`
`sequencing
`
`technologies. I have also served as an advisor to over 20 post-doctoral fellows and
`
`pre-doctoral graduate students.
`
`15. As high-throughput sequencing has come to dominate the technological
`
`basis of my career, I have been well aware of, and my lab routinely uses, unique and
`
`non-unique dual-index sequencing methods.
`
`B. Materials Considered
`16. My analysis in this declaration is based on my education, personal
`
`knowledge, and professional and academic experience in the areas of genetics and
`
`5
`
`
`
`
`DNA sequencing, including multiplex sequencing and dual-indexing methods. My
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`analysis is particularly focused on the state of the art in December of 2010.
`
`17.
`
`In forming my opinions, I have considered the ’852 patent and other
`
`exhibits cited in my declaration, including those listed in Appendix B. I reserve the
`
`right to rely on documents cited in the appendices to this declaration, and to
`
`supplement my opinions in view of new materials and information that becomes
`
`available to me during this proceeding.
`
`II. LEVEL OF ORDINARY SKILL IN THE ART
`I understand that an assessment of a patent’s claims should be
`18.
`
`undertaken from the perspective of a person of ordinary skill in the art (“POSA”) as
`
`of the “effective filing date” of the patent claims. In performing my analysis, I have
`
`been asked to assume that the “effective filing date” of the ’852 patent is December
`
`23, 2010.
`
`19.
`
`I understand that the level of ordinary skill in the art is determined by
`
`looking at; (1) the type of problems encountered in the art; (2) the prior-art solutions
`
`to those problems; (3) the rapidity with which innovations are made; (4) the
`
`sophistication of the technology; and (5) the educational level of active workers in
`
`the field.
`
`20.
`
`In my opinion, a POSA would have had (i) a Ph.D. in molecular
`
`biology, genetics, bioinformatics, or a related field and at least two years of
`
`6
`
`
`
`
`experience in high-throughput sequencing technologies or (ii) a Master’s degree in
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`one of the same fields with at least four years of experience in high-throughput
`
`sequencing technologies. As explained above in Section I.A, I had at least these
`
`qualifications by December 2010 and am qualified based on my education and
`
`experience to provide an opinion as to what a POSA would have known and
`
`concluded as of December 2010. Supra § I.A.
`
`III. SUMMARY OF OPINIONS
`I understand that this declaration accompanies a Petition for IPR
`21.
`
`involving U.S. Patent No. 11,041,852 (“the ’852 patent”) (EX1001). I understand
`
`that
`
`the earliest possible “effective filing date” for
`
`the ’852 patent is
`
`December 23, 2010. In performing my analysis, I have been asked to assume that
`
`the “effective filing date” of the ’852 patent is December 23, 2010.
`
`22. The claims of the ’852 patent are directed to a “method for reducing
`
`template cross-over error introduced during sequencing workflow.” As discussed
`
`below, the sequencing workflow typically involved several steps, including
`
`extracting template nucleic acids from samples, preparing those template nucleic
`
`acids for sequencing, and sequencing the products. It was often advantageous to use
`
`polymerase chain reaction (PCR) or other amplification methods to replicate the
`
`template during the sequencing workflow to increase the number of molecules
`
`available for sequencing. In the case of multiplex sequencing where multiple
`
`7
`
`
`
`
`samples were sequenced simultaneously, sequencing products were combined
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`(“pooled”) at various steps during the sequencing workflow to gain efficiencies and
`
`reduce costs, including for PCR or other forms of DNA amplification.
`
`23. Numerous different types of errors, however, could occur during the
`
`amplification steps. One prominent type of error was where two different template
`
`nucleic acids create a hybrid product during amplification (usually through
`
`incomplete extension and mispriming at a subsequent step), resulting in new
`
`molecules, the sequences of which were not present in the initial mixture of samples
`
`prior to amplification. These incorrect molecules were commonly referred to as
`
`“chimeric” molecules because they appear as the fusion of two pre-existing
`
`molecules. These chimeric molecules could occur between templates derived from
`
`a single sample or templates from two different samples.
`
`24. The ’852 patent purports to reduce sequencing errors derived from
`
`chimeric molecules resulting from the fusion of two templates from two samples,
`
`which the ’852 Patent calls “crossover” errors. Specifically, the claims of the
`
`’852 patent recite incorporating a distinct pair of identifier sequences (also referred
`
`to as “barcodes” or “indexes”) to template nucleic acids from different samples,
`
`amplifying the templates on a surface of a flow cell, and discarding any sequence
`
`that contains a pair of identifier sequences that was not originally assigned to any of
`
`8
`
`
`
`
`the template nucleic acids—including chimeric sequences that arose through
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`“crossover” error.
`
`25. Based on my knowledge and experience and my review of the
`
`information described in this declaration, it is my opinion that claims 1-8 of the
`
`’852 patent would have been obvious to a person of ordinary skill in the art
`
`(“POSA”).
`
`26. As of December 2010, it was well known that chimeric sequences,
`
`including those resulting from cross-over errors, were problematic and could lead
`
`to, among other problems, misassignment of sequences. By then, it was also well
`
`known that using two identifier sequences per sample’s template nucleic acids (i.e.,
`
`“dual-indexing”) would be helpful to identify and discard sequences arising from
`
`cross-over errors. For example, Parameswaran et al., Nucleic Acids Research
`
`(2007) 35:e130 (“Parameswaran”; EX1004) disclosed multiplex sequencing
`
`methods where each sample’s template nucleic acids were combined with a pair of
`
`identifier sequences. In Parameswaran, every identifier sequence was unique to
`
`template nucleic acids from a single sample—a method commonly referred to as
`
`“unique” dual-indexing. Using unique dual-indexing, Parameswaran identified
`
`erroneous sequences containing incorrect pairs of identifiers, i.e., pairs of identifier
`
`sequences that had not been assigned to any sample. Parameswaran explains that
`
`these sequences are a “false-discovery,” constituting “misassignment” of sequences.
`
`9
`
`
`
`27.
`
`It was also known that dual-indexing methods could be performed
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`using any major next-generation sequencing platform,
`
`including Illumina
`
`sequencers where template amplification occurs on a surface of a flow cell. While
`
`performing the sequencing experiments using a Roche 454 sequencer (where
`
`amplification does not occur on a surface of a flow cell), Parameswaran predicted
`
`that its methods will be “compatible with other sequencing platforms,” including
`
`Illumina sequencers. EX1004 (Parameswaran), 8, 1.
`
`28. A few years
`
`later, Gloor et al., Cornell University Library
`
`arXiv:1007.5075v1 (2010) (“Gloor”; EX1005) empirically demonstrated that dual-
`
`indexing methods are platform agnostic and can be done using Illumina sequencing
`
`platforms. Gloor also disclosed multiplex sequencing methods where templates
`
`from each sample were tagged with a pair of identifier sequences. While Gloor
`
`repeated individual identifier sequences across different samples, Gloor used a
`
`distinct pair of forward and reverse identifier sequences per sample—a method
`
`commonly referred to as “non-unique” dual-indexing. Using non-unique dual-
`
`indexing, Gloor successfully identified erroneous sequences, including those
`
`recognized as caused by cross-over errors.
`
`29.
`
`In addition, Bentley et al., Nature (2008) 456:53-59 (“Bentley”;
`
`EX1006) disclosed detailed guidance for how to perform sequencing using Illumina
`
`10
`
`
`
`
`sequencers, allowing researchers in the field of sequencing to perform dual-indexing
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`methods, like those disclosed in Parameswaran, using Illumina sequencers.
`
`30. Thus, each and every element of the claims of the ’852 patent is found
`
`in each of the following combinations: (1) Parameswaran and Gloor and
`
`(2) Parameswaran and Bentley. As discussed below, a POSA would have been
`
`motivated to combine the prior art with a reasonable expectation of success. Indeed,
`
`to improve the accuracy of assigning sequences to samples, a POSA would have
`
`been motivated to use Parameswaran’s unique dual-indexing method, which
`
`identified chimeric sequences. A POSA would have been motivated to apply
`
`Parameswaran’s method to known next generation sequencing platforms including
`
`Illumina sequencing, which was known to be cheaper and more accurate than the
`
`platform used in Parameswaran. A POSA would have had a reasonable expectation
`
`of success in doing so because Bentley provided detailed guidance for Illumina
`
`sequencing and Gloor already demonstrated that Illumina sequencing was
`
`compatible with dual-indexing methods, while not disclosing any non-obvious
`
`technical hurdle that needed to be overcome. Thus, claims 1-8 of the ’852 patent
`
`would have been obvious to a POSA as of December 23, 2010. I am currently not
`
`aware of any objective evidence of non-obviousness for the claims of the ’852
`
`patent. I reserve the right to consider and comment on any such evidence after I sign
`
`this declaration.
`
`11
`
`
`
`
`IV. LEGAL STANDARDS
`31. The opinions I express in this declaration involve the application of my
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`technical knowledge and experience in evaluating certain prior art with respect to
`
`the ’852 patent. In preparing this declaration, certain patent law concepts have been
`
`explained to me by counsel, including the legal standard for interpreting claims as
`
`well as those for assessing obviousness of a patent claim.
`
`32.
`
`I have been informed that, in IPR proceedings such as this one, the party
`
`challenging the patent bears the burden of proving unpatentability by a
`
`preponderance of the evidence. I understand that a preponderance of the evidence
`
`means “more likely than not.”
`
`33.
`
`I understand that patentability must be analyzed from the perspective
`
`of a POSA in the same field as the challenged patent as of the “effective filing date”
`
`of the claims. I understand that a POSA is a hypothetical individual presumed to
`
`know the relevant art as of the effective filing date of the claims.
`
`34.
`
`I understand that the Patent Trial and Appeal Board interprets claims
`
`based on their ordinary meaning as understood by a POSA at the time of the effective
`
`filing date in view of the claim language, patent specification, and prosecution
`
`history. I also understand that the specification may reveal a special definition given
`
`to a claim term by an inventor that differs from its ordinary meaning as understood
`
`by a POSA. In that case, I understand that the inventor’s definition governs.
`
`12
`
`
`
`35.
`
`I understand that a patent claim is invalid if it would have been obvious
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`to a person of ordinary skill in the art as of the effective filing date. I understand that
`
`assessing obviousness entails considering: (1) the scope and content of the prior art,
`
`(2) the differences between the prior art and the claimed invention, (3) the level of
`
`ordinary skill in the art, and (4) any secondary considerations of non-obviousness.
`
`36.
`
`I understand that a claim may be obvious based on a combination of
`
`multiple prior-art references, as well as based on the knowledge and skill of a POSA
`
`as of the effective filing date. I also understand that to combine prior-art references,
`
`there must have been a motivation that would have prompted a POSA to do so. In
`
`other words, obviousness requires a motivation to combine the features of the prior
`
`art. I further understand that a POSA must have reasonably expected that the
`
`combination would work. That is, obviousness requires a reasonable expectation of
`
`success in combining the prior art to achieve the claimed subject matter. I also
`
`understand that, while obviousness requires a reasonable expectation of success, it
`
`does not require absolute predictability of success in achieving the claimed subject
`
`matter.
`
`37.
`
`I understand that a motivation to combine prior-art references may arise
`
`from a variety of sources, including, among other things, scientific literature, a need
`
`or unsolved problem in the field, or market demand.
`
`13
`
`
`
`38.
`
`I also understand that a claim may be obvious if a POSA would have
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`found it obvious to try combining a finite number of predictable solutions known in
`
`the art. For example, if one technique had been used to improve a method of using a
`
`device, and a POSA would recognize that the same technique was one of a limited
`
`number of solutions and would improve similar devices in the same way, a POSA
`
`may have found it obvious to try that same technique on similar devices.
`
`39.
`
`I understand
`
`that, when present, evidence of “secondary
`
`considerations” must be considered along with the other factual evidence relevant to
`
`obviousness. Such secondary considerations could include unexpected results,
`
`commercial success of products or processes using the invention, long-felt but unmet
`
`need for the invention, failure of others to make the invention, industry acceptance
`
`of the invention, or copying of the invention by others. I understand that, in order
`
`for such evidence to support nonobviousness, it must have a “nexus” to the features
`
`of the invention as claimed.
`
`40.
`
`I have been informed that a dependent claim is a patent claim that refers
`
`back to another patent claim. I have been informed that a dependent claim includes
`
`all of the limitations of the claim to which it refers.
`
`14
`
`
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`V. BACKGROUND AND STATE OF THE ART
`First Generation Sequencing
`A.
`41. Deoxyribonucleic acid (“DNA”) carries genetic information. DNA
`
`consists of two “reverse complementary” strands made of deoxynucleotides where
`
`nucleotides are linked between their phosphate and deoxyribose portions. In DNA,
`
`there are four types of nucleotides, which respectively contain adenine (A),
`
`guanine (G), thymine (T), and cytosine (C) bases. The two strands of a DNA
`
`molecule are held together by hydrogen bond formation between complementary
`
`base pairs. EX1031 (Garland), 3. Specifically, guanine (G) is complementary to and
`
`pairs with cytosine (C) and adenine (A) is complementary to and pairs with
`
`thymine (T). EX1031 (Garland), 3. Each nucleotide has a 5’ phosphate (“5’” or
`
`“5’-P”) end and a 3’ hydroxide (“3’” or “3’-OH”) end that links to the 5’ end of the
`
`next nucleotide. During extension of a given DNA by enzymatic or chemical means,
`
`nucleotides are added to the 3’ end of the existing chain through the catalyzed
`
`reaction of a deoxynucleotide triphosphate (dNTP) being used to add the nucleotide
`
`releasing a diphosphate in the reaction. EX1031 (Garland), 38.
`
`42. Knowing the sequence of a given DNA can be important for any
`
`number of reasons, including disease diagnosis, identifying disease-causing
`
`mutations, and confirming genetic identity. For the last five decades, many DNA-
`
`15
`
`
`
`
`sequencing techniques have been developed, most of which are based on the
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`complementarity of the two strands in DNA.
`
`43.
`
`In 1977, Dr. Frederick Sanger published his method of DNA
`
`sequencing, now known as “Sanger-based sequencing,” which uses a technique
`
`known as non-reversible termination to determine the order of nucleotides in a strand
`
`of DNA. EX1009 (Sanger 1977). The traditional Sanger-based sequencing workflow
`
`began by amplifying a sequence of interest into millions of copies through in vivo
`
`cloning. EX1012 (Shendure), 2. Genomic DNA of a target sequence was
`
`fragmented, cloned into plasmid vectors, and then transformed to E. coli for
`
`amplification. EX1012 (Shendure), 2. For each sequencing reaction, a single
`
`bacterial colony was picked and plasmid DNA is isolated. EX1012 (Shendure), 2.
`
`44. Following amplification and isolation, reverse strand synthesis was
`
`performed on these copies using a known priming sequence upstream of the region
`
`to be sequenced. EX1010 (Kircher), 2-3; EX1011 (Metzker), 1-2; EX1012
`
`(Shendure), 1. Reverse strand synthesis used four pools of polymerization reactions,
`
`each with a mixture of dNTPs (i.e., dATP, dGTP, dCTP, and dTTP), plus one
`
`chemically labeled dideoxynucleoside triphosphate, a dNTP missing a hydroxyl
`
`group at the ’3-end (i.e., ddATP, ddGTP, ddCTP, or ddTTP). EX1012 (Shendure),
`
`1; EX1010 (Kircher), 2. In other words, four reactions were performed (consisting
`
`of dNTPs plus ddATP, ddGTP, ddCTP, or ddTTP), wherein polymerization of the
`
`16
`
`
`
`
`nucleotides occurred by incorporation of dNTPs. EX1012 (Shendure), 1.
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`Polymerization was
`
`stochastically
`
`terminated wherever
`
`the
`
`labeled
`
`dideoxynucleotides happened to be incorporated. EX1010 (Kircher), 2-3; EX1011
`
`(Metzker), 1-2; EX1012 (Shendure), 1. Incorporation of a dideoxynucleotide into
`
`the growing strand terminated DNA polymerization because dideoxynucleotides
`
`lack the 3’-OH group necessary for the phosphodiester bond formation that would
`
`have allowed polymerization to continue. EX1010 (Kircher), 2-3; EX1011
`
`(Metzker), 1-2; EX1012 (Shendure), 1. The dNTPs/ddNTP mixture thus causes
`
`random, non-reversible termination of the extension reaction, creating different
`
`copies of molecules extended to different lengths. EX1010 (Kircher), 2-3; EX1011
`
`(Metzker), 1-2; EX1012 (Shendure), 1-2.
`
`45. The resulting molecules were sorted by molecular weight via
`
`electrophoresis (corresponding to the point where a labeled dideoxynucleotide
`
`terminated polymerization), and signals generated from labels attached to the
`
`terminating dideoxynucleotides were detected to determine which nucleotides were
`
`added. EX1010 (Kircher), 2-3; EX1011 (Metzker), 1-2; EX1012 (Shendure), 1-2.
`
`46. Although significant improvements were made to the technique after
`
`1977, Sanger-based sequencing remained time-consuming and expensive compared
`
`to later-developed techniques (“next generation sequencing”), described below.
`
`EX1010 (Kircher), 2. Among other disadvantages, Sanger-based sequencing
`
`17
`
`
`
`
`required isolating every molecule to be sequenced separately and thus was limited
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`to sequencing only one target sequence in each reaction.
`
`47. Consequently, Sanger-based sequencing was ill suited for sequencing
`
`thousands of target sequences in parallel. For example, Sanger-based sequencing
`
`was used in the first project of the Cancer Genome Atlas. See EX1035 (TCGA), 6.
`
`But it required massive amplification of DNA from more than one hundred
`
`specimens and then further amplification of thousands of individual targets to be
`
`sequenced in hundreds of samples. See EX1035 (TCGA), 1, 6.
`
`B. Next Generation Sequencing Techniques Available as of December
`2010
`48. Starting in the mid-2000s, “next-generation sequencing” techniques
`
`became commercially available, which greatly increased efficiency compared to
`
`traditional Sanger-based sequencing. See EX1022 (Li), Abstract. Among other
`
`advantages, “next generation sequencing” techniques facilitated first parallel
`
`sequencing (sequencing millions of molecules under one set of reaction conditions)
`
`and later “multiplex sequencing”—that is, the simultaneous sequencing of templates
`
`from multiple samples during a single run. See infra § V.B.2; EX1023 (Choi), 1
`
`(reporting “massively parallel DNA sequencing” using next-generation sequencing).
`
`49. As of December 2010, Illumina and Roche 454 were two of the major
`
`“next-generation” sequencing platforms known
`
`in
`
`the art. See EX1004
`
`18
`
`
`
`
`(Parameswaran), 1. Here, I provide a brief overview of the basic principles and the
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`sequencing workflow behind Illumina and Roche 454 sequencing.
`
`1. Overview of Sequencing Workflow
`50. Shortly after launching, “next-generation sequencing” techniques such
`
`as Roche 454 and Illumina “outperform[ed] the older Sanger-sequencing
`
`technologies by a factor of 100-1,000” and simultaneously “reduc[ed] the cost of
`
`sequencing.” EX1010 (Kircher), 1, 3, 5, 11. While the sequencing biochemistries
`
`used for Roche 454 and Illumina platforms differ, “their work flows are conceptually
`
`similar.” EX1012 (Shendure), 2.
`
`19
`
`
`
`
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`
`
`Figure 1. General Sequencing Workflow
`(Revised from EX1012 (Shendure), 2 (Figure 1(b)))
`51. As shown in Figure 1, both platforms start with sample preparation.
`
`This involved extracting DNA from a sample (or extracting RNA and generating
`
`cDNA). EX1012 (Shendure), 2 (Figure 1b). Sample preparation also involved
`
`20
`
`
`
`
`attaching sequencer-specific adapters, which were used to immobilize copies of the
`
`IPR2024-00964
`U.S. Patent No. 11,041,852
`
`DNA on a solid substrate during sequencing. Both platforms commonly used PCR
`
`to amplify the adapter-bound DNA molecules prior to sequencing. EX1012
`
`(Shendure), 2 (Figure 1b); EX1010 (Kircher), 4. While Roche 454 and Illumina
`
`platforms used their own platform-specific adapters, they shared many principles of
`
`operation and conf