`(57) Abstract: Methods, devices and systems are provided herein for surfaces for de novo polynucleotide synthesis that provide for
`increased polynucleotide yield. Surfaces described herein comprise a texture that increases surface area provide for increased polynu-
`cleotide yield compared to non-textured surfaces. In addition, the patterned placement of nucleoside coupling reagent spanning such
`surfaces provides for improved synthesis yield, representation, and a reduction in contamination on the surface between different
`polynucleotide species.
`This application claims the benefit of U.S. Provisional Application No. 62/370,548,
`filed August 3, 2016, which application is incorporated herein by reference in its entirety.
`All publications, patents, and patent applications mentionedin this specification are
`herein incorporated by reference to the same extent as if each individual publication, patent, or
`patent application wasspecifically and individually indicated to be incorporated by reference.
`Highly efficient chemical gene synthesis with high fidelity and low cost has a central
`role in biotechnology and medicine, and in basic biomedical research. De novo gene synthesis is a
`powerful tool for basic biological research and biotechnology applications. While various methods
`are known for the synthesis of relatively short fragments in a small scale, these techniques suffer
`from scalability, automation, speed, accuracy, and cost. There is a need for devices for simple,
`reproducible, scalable, less error-prone and cost-effective methods that guarantee successful
`synthesis of desired genes and are amenable to automation.
`Provided herein is a device for polynucleotide synthesis, the device comprising: a solid
`support comprising a surface; a plurality of loci on the surface, wherein each of the loci comprises:
`an inner region, wherein the inner region comprises a plurality of recesses or protrusions; and an
`outer region that comprisesa plurality of first molecules, wherein the outer region spans and
`extends beyond the inner region, and wherein each ofthe first molecules binds to the surface and
`comprises a reactive group capable of binding to a nucleoside. Further provided herein is a device
`wherein the plurality of loci are arranged in clusters. Further provided herein is a device wherein
`each cluster comprises 50 to 500 loci. Further provided herein is a wherein each cluster comprises
`about 121 loci. Further provided herein is a device wherein the outer region has a diameter of up to
`100 um. Further provided herein is a device wherein the outer region has a diameter of about 60
`um. Further provided herein is a wherein the inner region has a diameter of about 55 um. Further
`provided herein is a device wherein the inner region has a diameter 80% to 95% shorter than the
`diameter of the outer region. Further provided herein 1s a device wherein the inner region has a
`diameter 2 um to 20 um shorter than the diameter of the outer region. Further provided herein is a
`device wherein the inner region has a diameter about 5 um shorter than the diameter of the outer
`region. Further provided herein is a device wherein each of the recesses or protrusions have an etch
`depth of 100 um to 1000 nm. Further provided herein is a device wherein each of the recesses or
`protrusions has an etch depth of 200 um to 500 nm. Further provided herein is a device wherein
`each of the recesses or protrusions has a width of 100 to 500 um. Further provided herein is a
`device wherein each of the recesses or protrusions has a width of 300 to 330 um. Further provided
`herein is a device wherein each of the recesses or protrusionshasa pitch length of about 2 to 3
`times a width of the recesses or protrusions. Further provided herein is a device wherein each of
`the recesses or protrusions has a depth of about 60%to 125% of a pitch length. Further provided
`herein is a wherein each of the recesses or protrusions has a patch of up to 1 um. Further provided
`herein is device a wherein the solid support has a tensile strength of 1 MPa to 300 MPa. Further
`provided herein is a device wherein the solid support has a tensile strength of 1 MPa to 10 MPa.
`Further provided herein is a device wherein the solid support has a stiffness of 1 GPa to 500 GPa.
`Further provided herein is a device wherein the solid support has a stiffness of 1 GPa to 10 GPa.
`Further provided herein is a device wherein the solid support comprises nylon, nitrocellulose, or
`polypropylene. Further provided herein is a device wherein the solid support comprises silicon,
`silicon dioxide, silicon nitride, polytetrafluoroethylene, polypropylene, polystyrene, polycarbonate,
`gold, or platinum. Further provided herein is a device wherein each ofthe first molecules is a
`silane. Further provided herein is a device wherein the silane is an aminosilane. Further provided
`herein is a device wherein each ofthe first molecules is N-(3-triethoxysilylpropyl)-4-
`hydroxybutyramide (HAPS), 11-acetoxyundecyltriethoxysilane, n-decyltriethoxysilane, (3-
`aminopropyl)trimethoxysilane, (3-aminopropy])triethoxysilane, 3-
`glycidoxypropyltrimethoxysilane, 3-iodo-propyltrimethoxysilane, or octylchlorosilane. Further
`provided herein is a device comprising a plurality of second molecules, wherein plurality of second
`molecules is located on the surface in a region surrounding the outer region of each of the loci, and
`wherein each second molecule binds to the surface and lacks a reactive group capable of binding to
`the nucleoside. Further provided herein is a device wherein the second molecule is a fluorosilane.
`Further provided herein is a device wherein the fluorosilaneis (tridecafluoro-1,1,2,2-
`tetrahydrooctyl)trichlorosilane, perfluorooctyltrichlorosilane, perfluorooctyltriethoxysilane, or
`Provided herein is a method for polynucleotide synthesis, comprising:
`providing predetermined sequences for polynucleotides; providing the device of any one of claims
`1 to 27; and synthesizing the polynucleotides. Further provided herein is a method wherein the
`polynucleotides comprise at least 30,000 non-identical polynucleotides.
`. Further provided herein
`is a method wherein the at least 30,000 non-identical polynucleotides encode for at least 750 genes.
`. Further provided herein is a method wherein the at least 30,000 non-identical polynucleotides
`have an aggregate error rate of less than 1 in 1000 bases comparedto the predetermined sequences
`for polynucleotides. Further provided herein is a method wherein the at least 30,000 non-identical
`polynucleotides have an aggregate error rate of less than 1 in 1500 bases compared to the
`predetermined sequences for the polynucleotides. Further provided herein is a method wherein at
`least 80% of at least 30,000 non-identical polynucleotides have no errors compared to the
`predetermined sequencesfor the polynucleotides. Further provided herein is a method wherein at
`least 89% of at least 30,000 non-identical polynucleotides have no errors compared to the
`predetermined sequencesfor the polynucleotides.
`Provided herein is a method for gene synthesis, comprising: providing predetermined
`sequences for polynucleotides; providing the device of any one of claims | to 27; synthesizing the
`polynucleotides; and assembling the polynucleotides to form a plurality of genes. Further provided
`herein is a method further comprising releasing the polynucleotides prior to step (d).
`Provided herein is a system for polynucleotide synthesis, the system comprising: a
`material deposition device comprising plurality of reagents for polynucleotide synthesis and a
`plurality of nozzles for depositing the plurality of reagents for polynucleotide synthesis; a computer
`for controlling the release of the plurality of reagents for polynucleotide synthesis from the plurality
`of nozzles; and the device of any one of claims 1 to 27 for synthesis of polynucleotides.
`The present disclosure provides systems, methods, devices for rapid parallel synthesis of
`polynucleotide libraries with low error rates. The oligonucleotide synthesis steps described herein
`are “de novo,” meaning that oligonucleotides are built one monomerat a time to form a polymer.
`During de novo synthesis of polynucleotides, the crowding of single stranded polynucleotides
`extending from a surface results in an increase in error rates. To reduce the frequency of crowding-
`related errors, methods are provided herein to reduce the density of nucleoside-coupling agent
`boundto specific regions of the surface. At the same time, to compensate for the reduced density of
`polynucleotides extending from a surface, methods are disclosed herein to increase surface area so
`as to increasethe yield of synthesized polynucleotides.
`Throughoutthis disclosure, numerical features are presented in a range format. It should
`be understood that the description in range format is merely for convenience and brevity and should
`not be construed as an inflexible limitation on the scope of any embodiments. Accordingly, the
`description of a range should be considered to have specifically disclosed all the possible subranges
`as well as individual numerical values within that range to the tenth of the unit of the lowerlimit
`unless the context clearly dictates otherwise. For example, description of a range such as from 1 to
`6 should be considered to have specifically disclosed subranges such as from | to 3, from 1 to 4,
`from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual values within that range,
`for example, 1.1, 2, 2.3, 5, and 5.9. This applies regardless of the breadth of the range. The upper
`and lowerlimits of these intervening ranges may independently be included in the smaller ranges,
`and are also encompassed within the invention, subject to any specifically excluded limit in the
`stated range. Wherethe stated range includes one or both of the limits, ranges excluding either or
`both of those included limits are also included in the invention, unless the context clearly dictates
`The terminology used herein is for the purpose of describing particular embodiments
`only andis not intended to be limiting of any embodiment. Asusedherein, the singular forms “a,”
`“an” and “the” are intended to include the plural forms as well, unless the context clearly indicates
`otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used
`in this specification, specify the presence of stated features, integers, steps, operations, elements,
`and/or components, but do not preclude the presence or addition of one or more other features,
`integers, steps, operations, elements, components, and/or groups thereof. As used herein, the term
`“and/or” includes any and all combinations of one or moreof the associated listed items.
`Unless specifically stated or obvious from context, as used herein, the term “about” in
`reference to a numberor range of numbersis understood to mean the stated number and numbers
`+/- 10% thereof, or 10% below the lowerlisted limit and 10% abovethe higherlisted limit for the
`valueslisted for a range.
`Asused herein, the terms “preselected sequence”,
`“predefined sequence” or
`“predetermined sequence” are used interchangeably. The terms meanthat the sequenceof the
`polymer is known and chosen before synthesis or assembly of the polymer. In particular, various
`aspects of the invention are described herein primarily with regard to the preparation of nucleic
`acids molecules, the sequence of the oligonucleotide or polynucleotide being known and chosen
`before the synthesis or assemblyof the nucleic acid molecules.
`Provided herein are methods and compositions for production of synthetic (i.e. de novo
`synthesized or chemically synthesizes) polynucleotides. The term oligonucleotide, oligo, and
`polynucleotide are defined to be synonymousthroughout. Libraries of synthesized polynucleotides
`described herein may comprise a plurality of polynucleotides collectively encoding for one or more
`genes or gene fragments. In someinstances, the polynucleotide library comprises coding or non-
`coding sequences. In someinstances, the polynucleotide library encodes for a plurality of cDNA
`sequences. Reference gene sequences from which the cDNA sequences are based may contain
`introns, whereas cDNA sequences exclude exons. Polynucleotides described herein may encode
`for genes or gene fragments from an organism. Exemplary organismsinclude, without limitation,
`prokaryotes (e.g., bacteria) and eukaryotes (e.g., mice, rabbits, humans, and non-human
`In someinstances, the polynucleotide library comprises one or more polynucleotides,
`each of the one or more polynucleotides encoding sequences for multiple exons. Each
`polynucleotide within a library described herein may encodea different sequence,i.e., non-
`identical sequence. In someinstances, each polynucleotide within a library described herein
`comprisesat least one portion that is complementary to sequence of another polynucleotide within
`the library. Polynucleotide sequences described herein may be, unless stated otherwise, comprise
`DNA or RNA.
`Provided herein are methods and compositions for production of synthetic (i.e. de novo
`synthesized) genes. Libraries comprising synthetic genes may be constructed by a variety of
`methods described in further detail elsewhere herein, such as PCA, non-PCA gene assembly
`methods or hierarchical gene assembly, combining (“stitching”) two or more double-stranded
`polynucleotides to produce larger DNA units(i.e., a chassis). Libraries of large constructs may
`involve polynucleotides that are at least 1, 1.5, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70,
`80, 90, 100, 125, 150, 175, 200, 250, 300, 400, 500 kb long or longer. The large constructs can be
`bounded by an independently selected upper limit of about 5000, 10000, 20000 or 50000 base
`pairs. The synthesis of any numberof polypeptide-segment encoding nucleotide sequences,
`including sequences encoding non-ribosomal peptides (NRPs), sequences encoding non-ribosomal
`peptide-synthetase (NRPS) modules and synthetic variants, polypeptide segments of other modular
`proteins, such as antibodies, polypeptide segments from other protein families, including non-
`coding DNA or RNA,such as regulatory sequences e.g. promoters, transcription factors, enhancers,
`siRNA, shRNA, RNAi, miRNA,small nucleolar RNA derived from microRNA,or anyfunctional
`or structural DNA or RNA unit of interest. The following are non-limiting examples of
`polynucleotides: coding or non-coding regions of a gene or gene fragment, intergenic DNA,loci
`(locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA,
`ribosomal RNA,short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA
`(miRNA), small nucleolar RNA, ribozymes, complementary DNA (cDNA), which is a DNA
`representation of mRNA, usually obtained by reverse transcription of messenger RNA (mRNA)or
`by amplification; DNA molecules produced synthetically or by amplification, genomic DNA,
`recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any
`sequence, isolated RNA of any sequence, nucleic acid probes, and primers. cDNA encoding for a
`gene or gene fragmentreferred to herein, may comprise at least one region encoding for exon
`sequence(s) without an intervening intron sequence foundin the corresponding genomic sequence.
`Alternatively, the corresponding genomic sequence to a cDNA maylack intron sequencein the first
`Unless otherwise stated, water contact angles mentioned herein correspond to
`measurements that would be taken on uncurved, smooth, or planar equivalents of the surfaces in
`Clusters and Loci
`Provided herein is a device comprising a surface, wherein the surface is modified to
`support polynucleotide synthesis at predetermined locations and with a resulting low errorrate, a
`low dropoutrate, a high yield, and a high oligo representation. In some embodiments, the surface
`comprisesa plurality of loci, wherein each locus comprisesa plurality of first molecules deposited
`on the locus, wherein the first molecule binds to the surface and comprises a reactive group capable
`of binding to a nucleoside.
`The terms “locus” and “loci,” as used herein, refer to a single discrete active region, and
`to a plurality of discrete active regions on the surface of the device, respectively, wherein the
`plurality of first molecules are deposited on said locus, and wherein the first molecule bindsto the
`surface and comprises a reactive group capable of binding to a nucleoside. In some embodiments,
`the plurality of first molecule comprises one or a mixture of molecule(s), which bindsto the surface
`and comprises a reactive group capable of binding to a nucleoside.
`Referring to FIGS. 1A to 1C, an exemplary device 100 provided herein comprises a
`surface 101, wherein the surface 101 comprises a plurality of loci 110, wherein each locus 110
`comprisesa plurality of first molecules 120, wherein the plurality of first molecules 120 comprise a
`high-energy molecule, and wherein the first molecule binds to the surface 101 and comprises a
`reactive group capable of binding to a nucleoside, to synthesize a single sequence polynucleotide.
`In this arrangement, the plurality of the first molecules 120 deposited on each locus 110 exhibit a
`higher surface energy than the surface 101 of the device, and wherein the variation in the surface
`energy facilitates localization of droplets of a fluid onto the loci 110. In some embodiments,
`localization of droplets onto the loci 110 is altered by adjusting the pattern and geometry ofthe loci
`110. In someinstances, the high-energy molecules 120 on one locus 110 are capable of binding to
`the surface and comprise a reactive group capable of binding to a certain nucleoside to support the
`synthesis of a certain population of polynucleotides having a certain sequence, wherein thefirst
`molecules on another locus 110 are capable of binding to the surface and comprise a reactive group
`capable of binding to a different nucleoside to support the synthesis of a different population of
`polynucleotides having a different sequence.
`In someinstances, the surface 101 of the device 100 comprisesa plurality of loci 110,
`wherein the plurality of loci 110 are arranged into a plurality of clusters 140, wherein each cluster
`140 comprises a plurality of loci 110. Referring to FIGS. 1A to 1C, the surface 101 of the device
`100 comprises a rectilinear array of 16 columns and 16 rowsof cluste