Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 1 of 48 PageID #:
`5863
`
`Exhibit 2
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 2 of 48 PageID #:
`5864
`
`
`
`IN THE UNITED STATES DISTRICT COURT
`FOR THE EASTERN DISTRICT OF TEXAS
`SHERMAN DIVISION
`
`R2 SOLUTIONS LLC,
`
`
`Plaintiff,
`
`v.
`
`
`
`
`
`
`DATABRICKS, INC.,
`
`
`Civil Action No. 4:23-cv-01147-ALM
`
` JURY TRIAL DEMANDED
`
`Defendant.
`
`DEFENDANT’S INITIAL INVALIDITY CONTENTIONS
`
`I.
`
`INTRODUCTION
`
`Pursuant to the Court’s Scheduling Order (Dkt. 28) and P.R. 3-3, defendant Databricks,
`
`Inc. (“Defendant” or “Databricks”) provides the following initial invalidity contentions to plaintiff
`
`R2 Solutions, LLC (“Plaintiff” or “R2”) regarding the currently asserted claims of U.S. Patent No.
`
`8,190,610 (the “’610 patent” or the “Asserted Patent”).
`
`Plaintiff has asserted claims 1-5, and 17-21 of the ’610 patent against Defendant in this
`
`case (the “Asserted Claims”). See Plaintiff’s June 21, 2024 Infringement Contentions (“Cont.”) at
`
`2. Defendant provides these invalidity contentions for the above-identified Asserted Claims. To
`
`the extent Plaintiff later attempts to assert additional claims, Defendant reserves the right to amend
`
`its invalidity contentions and contend that any additional claims are also invalid. Defendant’s
`
`invalidity contentions are not an admission of validity as to any non-asserted claims of the Asserted
`
`Patent.
`
`Based on its investigation and knowledge developed to date, and pursuant to P.R. 3-3,
`
`Defendant: (a) identifies each currently known item of prior art that anticipates and/or renders
`
`1
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 3 of 48 PageID #:
`5865
`
`
`
`obvious the Asserted Claims; (b) states whether each such item of prior art anticipates the Asserted
`
`Claims or renders the Asserted Claims obvious (alone or in combination); (c) provides claim charts
`
`identifying where specifically in each item of prior art each element of the Asserted Claims is
`
`found; (d) identifies grounds of invalidity of the Asserted Claims under 35 U.S.C. § 112 based on
`
`indefiniteness, enablement, and/or written description; and (e) identifies claims that are directed
`
`to ineligible subject matter under 35 U.S.C. § 101. Contemporaneously with these invalidity
`
`contentions, Defendant produces the prior art referenced herein and documentation sufficient to
`
`show the operation of the functionalities accused in R2’s infringement contentions, as required
`
`under P.R. 3-4.
`
`Defendant’s invalidity contentions are based on information reasonably available at this
`
`time with respect to the Asserted Claims, and are necessarily preliminary and may require
`
`subsequent amendment, modification, and/or supplementation. Defendant reserves the right to
`
`amend, modify, and/or supplement these invalidity contentions based on, among other things,
`
`amendments, modifications or supplements
`
`to R2’s
`
`infringement contentions, further
`
`investigation, third-party discovery, fact or expert discovery and/or evaluation of the scope and
`
`content of the prior art (including, for example, the prior art from any other cases in which R2
`
`asserts the Asserted Patent), disclosure of the parties’ claim constructions, an order construing the
`
`Asserted Claims, new developments in the case or any inter partes review proceedings, or any
`
`other basis contemplated by the Federal Rules of Civil Procedure, the Court’s Local Rules, and
`
`any other applicable order entered by the Court.
`
`Moreover, fact discovery is ongoing, and Defendant has not obtained deposition testimony
`
`from any of the named inventors of the Asserted Patent or any third party, including, without
`
`limitation, deposition of any third party identified in these invalidity contentions. Defendant has
`
`2
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 4 of 48 PageID #:
`5866
`
`
`
`served third-party subpoenas to Yahoo, Microsoft, and the Apache Software Foundation and is
`
`awaiting discovery in response to its requests. Defendant may subpoena additional third parties,
`
`including Google, for information. Defendant expects further discovery will reveal additional
`
`prior art, including related disclosures and corresponding evidence for many of the prior art
`
`references identified herein. As such, Defendant has not yet completed its investigation, discovery
`
`or analysis of matters relating to the validity or enforceability of the Asserted Claims, including,
`
`without limitation, invalidity due to on-sale statutory bars, public use statutory bars, improper
`
`inventorship, or unenforceability due to inequitable conduct. The disclosures herein are not and
`
`should not be construed as a statement that no other persons have discoverable information, that
`
`no other documents, data compilations, and/or tangible things exist that Defendant may use to
`
`support its claims or defenses, or that no other legal theories or factual bases will be pursued.
`
`Accordingly, Defendant reserves the right to amend, modify and/or supplement these invalidity
`
`contentions as additional information is discovered, identified or otherwise appreciated, including
`
`testimony about the scope and content of the prior art and the claimed inventions.
`
`These initial invalidity contentions are based on the Defendant’s present understanding of
`
`R2’s infringement contentions, served on June 21, 2024. Nothing in these initial invalidity
`
`contentions should be regarded as conceding that R2’s infringement contentions are legally or
`
`factually adequate or as necessarily reflecting the proper interpretation of the claims or an
`
`interpretation of the claims that Defendant agrees with or proposes. Defendant disputes R2’s
`
`apparent claim interpretations and will identify claim constructions for specific claim terms under
`
`the scheduling order governing this case. To the extent additional information regarding R2’s
`
`infringement contentions becomes available, Defendant anticipates that it will provide
`
`corresponding invalidity contentions correlating R2’s interpretation of the claims with the prior art
`
`3
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 5 of 48 PageID #:
`5867
`
`
`
`and Defendant may thus amend its invalidity contentions accordingly as applicable to the claims
`
`asserted by R2.
`
`Nothing in these invalidity contentions shall be treated as an admission that Defendant’s
`
`accused products meet any limitation of the Asserted Claims. Defendant denies that it infringes
`
`any claim of the Asserted Patent. To the extent that any prior art reference identified by Defendant
`
`contains a claim element that is the same as or similar to an element in an accused product,
`
`inclusion of that reference in Defendant’s invalidity contentions shall not be deemed a waiver of
`
`any claim construction or non-infringement position. Any use of these invalidity contentions to
`
`support any allegation of infringement would be misleading, false, and wrong as a matter of law
`
`and fact.
`
`Unless otherwise specified, the invalidity contentions set forth herein are based on the
`
`alleged priority date of the ’610 patent asserted by R2 in its infringement contentions. To the
`
`extent R2 asserts entitlement to an earlier priority date for prior art purposes, Defendant reserves
`
`the right to amend these contentions. Further, nothing in these contentions constitutes an
`
`admission concerning the priority dates, conception date, or reduction to practice of the Asserted
`
`Claims.
`
`II.
`
`IDENTIFICATION OF PRIOR ART
`
`The concepts disclosed and claimed in the Asserted Patent are not new, and had been
`
`disclosed, and actively practiced by others prior to the claimed invention date. The prior art
`
`includes various documents, products, patents, and inventions that separately and together render
`
`the Asserted Claims invalid. In addition, as described in more detail below, claims of the Asserted
`
`Patent are invalid under 35 U.S.C. §§ 101 and 112.
`
`Defendant asserts that the prior art listed in Exhibit A, individually or in combination,
`
`invalidate the Asserted Claims. Defendant identifies patents, publications, and the products and
`
`4
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 6 of 48 PageID #:
`5868
`
`
`
`systems it describes as prior art under 35 U.S.C. §§ 102 (a), (b), (e), (g) and § 103. The Asserted
`
`Patent has an earliest filing or priority date before March 16, 2013 and is therefore not subject to
`
`the AIA rules relating to what qualifies as prior art. Defendant asserts that as of the date of these
`
`invalidity contentions, these products and systems were: (1) known or used in this country before
`
`the alleged invention of the claimed subject matter of the Asserted Claims; (2) in public use and/or
`
`on sale in this country more than one year before the filing date of the patent; (3)invented in this
`
`country by another who did not abandon, suppress, or conceal, before the alleged invention of the
`
`claimed subject matter of the asserted claim; (4) patented or described in a printed publication in
`
`this or a foreign country; before the alleged invention of the claimed subject matter of the Asserted
`
`Claims; and/or (5) patented or described in a printed publication in this or a foreign country more
`
`than one year prior to the filing date of the patent. These prior art products and systems and their
`
`associated patents and/or printed publications individually anticipate and/or collectively render
`
`obvious each of the Asserted Claims.
`
`These prior art references and products disclose each and every element of one or more of
`
`the Asserted Claims either explicitly, inherently, or via an obvious combination and may also be
`
`relied upon to show the state of the art in the relevant timeframe. The date these prior art items
`
`were offered for sale or publicly used or known, is at least as early as the date the related
`
`publications were published. Defendant anticipates that the actual dates, circumstances, and
`
`identities of individuals will be the subject of third-party discovery during this lawsuit. Defendant
`
`therefore reserves the right to modify, amend, or supplement these invalidity contentions if
`
`additional information becomes available during the course of discovery.
`
`5
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 7 of 48 PageID #:
`5869
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 8 of 48 PageID #:
`5870
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 9 of 48 PageID #:
`5871
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 10 of 48 PageID #:
`5872
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 11 of 48 PageID #:
`5873
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 12 of 48 PageID #:
`5874
`
`
`
`Applying Lucene to the
`Web
`Nutch, Open-Source Web
`Search
`Free Search: Lucene &
`Nutch
`Nutch: Open Source Web
`Search Software
`Scalable Computing with
`MapReduce
`Open Source Search
`Scaling Nutch
`Hadoop version source code
`and documentation, hadoop-
`0.4.0
`Hadoop version source code
`and documentation, hadoop-
`0.1.0
`Hadoop Map/Reduce
`
`200417
`
`200418
`
`200419
`
`200420
`
`200521
`
`200522
`200523
`200624
`
`200625
`
`200626
`
`Doug Cutting
`
`Cutting VIII
`
`Doug Cutting
`
`Cutting IX
`
`Doug Cutting
`
`Cutting X
`
`Doug Cutting
`
`Cutting XI
`
`Doug Cutting
`
`Cutting XII
`
`Doug Cutting
`Doug Cutting
`Doug Cutting /
`Apache Software
`Foundation
`Doug Cutting /
`Apache Software
`Foundation
`Owen O’Malley
`
`Cutting XIII
`Cutting XIV
`Hadoop I
`
`Hadoop II
`
`Hadoop III
`
`
`17 Presented May 7, 2004, The Server Side Java Symposium, Las Vegas, NV. Available at
`https://cwiki.apache.org/confluence/display/NUTCH/Presentation
`18 Presented May 22, 2004, WWW2004, New York, NY. Available at
`https://cwiki.apache.org/confluence/display/NUTCH/Presentation
`19 Presented June, 10 2004, Wizards of OS, Berlin. Available at
`https://cwiki.apache.org/confluence/display/NUTCH/Presentation Audio link available at:
`http://www.archive.org/audio/audio-details-db.php?collectionid=3 do t2 17h 2-
`Cutting_a&collection=conference_proceedings; video link available at:
`http://www.archive.org/details/3_do_t2_17h_2-Cutting
`20 Presented November 26, 2004, University of Pisa, Italy. Available at
`https://cwiki.apache.org/confluence/display/NUTCH/Presentation
`21 Presented August 3, 2005 at OSCON in Portland, OR. Available at
`https://cwiki.apache.org/confluence/display/NUTCH/Presentation
`22 Presented December 5, 2005, at the IBM Information Retrieval Seminar in Haifa, Israel.
`Available at https://cwiki.apache.org/confluence/display/NUTCH/Presentation
`23 Presented September 22, 2005, at the 5th International Web Archiving Workshop in Vienna,
`Austria. Available at
`24 Hadoop-0.4.0 was publicly available no later than June 28, 2006.
`25 Hadoop-0.1.0 was publicly available no later than April 1, 2006.
`26 Presented in July 2006. Available at
`https://cwiki.apache.org/confluence/display/HADOOP2/HadoopPresentations.
`
`11
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 13 of 48 PageID #:
`5875
`
`
`
`Hadoop Distributed File
`System: Architecture and
`Design
`Lucene-Hadoop
`
`Hadoop Overview
`
`Grep – Lucene-hadoop
`Wiki
`Lucene-Hadoop FAQ
`
`Hadoop MapReduce
`
`How Many Maps and
`Reduces
`Lucene-Hadoop DFS
`Requirements
`Lucene-Hadoop IO
`
`200527
`
`200628
`
`200629
`
`200630
`
`200631
`
`200632
`
`200633
`
`200634
`
`200635
`
`Dhruba Borthakur
`/ Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`
`Hadoop IV
`
`Hadoop V
`
`Hadoop VI
`
`Hadoop VII
`
`Hadoop VIII
`
`Hadoop IX
`
`Hadoop X
`
`Hadoop XI
`
`Hadoop XII
`
`
`27 Published in 2005. Available at: https://web.mit.edu/mriap/hadoop/hadoop-
`0.13.1/docs/hdfs_design.pdf
`28 Publicly available at least as of September 2006. Available at:
`https://web.archive.org/web/20060901172519/http:/wiki.apache.org/lucene-hadoop/
`29 Publicly available at least as of June 26, 2006. Available at:
`https://web.archive.org/web/20060626012900/http:/wiki.apache.org:80/lucene-
`hadoop/HadoopOverview
`30 Publicly available at least as of June 26, 2006. Available at:
`https://web.archive.org/web/20061124055719/http:/wiki.apache.org/lucene-hadoop/Grep
`31 Publicly available at least as of August 23, 2006. Available at:
`https://web.archive.org/web/20060823192520/http:/wiki.apache.org/lucene-hadoop/FAQ
`32 Publicly available at least as of September 1, 2006. Available at:
`https://web.archive.org/web/20060901172147/http:/wiki.apache.org/lucene-
`hadoop/HadoopMapReduce
`33 Publicly available at least as of August 23, 2006. Available at:
`https://web.archive.org/web/20060823193735/http://wiki.apache.org/lucene-
`hadoop/HowManyMapsAndReduces
`34 Publicly available at least as of September 1, 2006. Available at:
`https://web.archive.org/web/20060901050023/http://wiki.apache.org/lucene-
`hadoop/DFS_requirements
`35 Publicly available at least as of June 26, 2006. Available at:
`https://web.archive.org/web/20060626010843/http://wiki.apache.org:80/lucene-hadoop/io
`
`12
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 14 of 48 PageID #:
`5876
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 15 of 48 PageID #:
`5877
`
`
`
`prior
`Yahoo
`implementations
`MapReduce
`and
`functionality42
`prior
`Microsoft
`implementations
`MapReduce
`and
`functionality43
`The Hadoop System44
`
`art
`of
`similar
`
`art
`of
`similar
`
`The Nutch System45
`
`Prior to
`October 5,
`2006
`
`Prior to
`October 5,
`2006
`
`Prior to
`October 5,
`2006
`Prior to
`October 5,
`2006
`
`Yahoo
`
`The Yahoo
`MapReduce System
`
`Microsoft
`
`The Microsoft
`MapReduce System
`
`Apache Software
`Foundation
`
`Apache Software
`Foundation
`
`Hadoop System
`
`Nutch System
`
`
`
`3.
`
`IDENTIFICATION OF INVALIDITY DUE TO OBVIOUSNESS
`
`The tables below list prior art references that render one or more of the Asserted Claims
`
`invalid as obvious under 35 U.S.C. § 103, alone or in combination with the knowledge of a person
`
`having ordinary skill in the art and/or other prior art. The attached claim charts in Exhibit A
`
`demonstrate where each limitation of the Asserted Claims is found in certain of the references
`
`listed below, either expressly or inherently in the larger context of the passage, as understood by a
`
`person having ordinary skill in the art. The following references are prior art under at least 35
`
`U.S.C. §§ 102(a), (b), (e), or (g).
`
`
`42 Databricks is seeking discovery from Yahoo for non-public information concerning their
`implementations of MapReduce and similar functionality including related projects, services,
`or products.
`43 Databricks is seeking discovery from Microsoft for non-public information concerning their
`implementations of MapReduce and similar functionality including related projects, services,
`or products.
`44 The Hadoop System includes but is not limited to: Cutting I-XIV, Hadoop I-XII, including all
`prior art versions of the Hadoop source code. The Hadoop System is the subject of on-going
`third-party discovery.
`45 The Nutch System includes but is not limited to: Cutting I-XIV, including all prior art versions
`of the Nutch source code. The Nutch System is the subject of on-going third-party discovery.
`
`
`
`14
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 16 of 48 PageID #:
`5878
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 17 of 48 PageID #:
`5879
`
`
`
`Intranet Search With Nutch
`Applying Lucene to the
`Web
`Nutch, Open-Source Web
`Search
`Free Search: Lucene &
`Nutch
`Nutch: Open Source Web
`Search Software
`Scalable Computing with
`MapReduce
`Open Source Search
`Scaling Nutch
`Hadoop version source code
`and documentation, hadoop-
`0.4.0
`Hadoop version source code
`and documentation, hadoop-
`0.1.0
`Hadoop Map/Reduce
`Hadoop Distributed File
`System: Architecture and
`Design
`Lucene-Hadoop
`
`Hadoop Overview
`
`Grep – Lucene-hadoop
`Wiki
`Lucene-Hadoop FAQ
`
`Hadoop MapReduce
`
`How Many Maps and
`Reduces
`Lucene-Hadoop DFS
`Requirements
`
`Doug Cutting
`Doug Cutting
`
`Cutting VII
`Cutting VIII
`
`Doug Cutting
`
`Cutting IX
`
`Doug Cutting
`
`Cutting X
`
`Doug Cutting
`
`Cutting XI
`
`Doug Cutting
`
`Cutting XII
`
`Doug Cutting
`Doug Cutting
`Doug Cutting /
`Apache Software
`Foundation
`Doug Cutting /
`Apache Software
`Foundation
`Owen O’Malley
`Dhruba Borthakur
`/ Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`Apache Software
`Foundation
`
`Cutting XIII
`Cutting XIV
`Hadoop I
`
`Hadoop II
`
`Hadoop III
`Hadoop IV
`
`Hadoop V
`
`Hadoop VI
`
`Hadoop VII
`
`Hadoop VIII
`
`Hadoop IX
`
`Hadoop X
`
`Hadoop XI
`
`2004
`2004
`
`2004
`
`2004
`
`2004
`
`2005
`
`2005
`2005
`2006
`
`2006
`
`2006
`2005
`
`2006
`
`2006
`
`2006
`
`2006
`
`2006
`
`2006
`
`2006
`
`16
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 18 of 48 PageID #:
`5880
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 19 of 48 PageID #:
`5881
`
`
`
`d.
`
`Motivation for Combining Prior Art
`
`A person having ordinary skill in the art at the time of filing of the Asserted Patent would
`
`have understood the references listed above, alone or in combination, to contain explicit and/or
`
`implicit teaching, suggestion, and/or rationales to combine them for at least the following
`
`exemplary reasons.46
`
`The combinations of references provided in the accompanying prior art reference charts
`
`for each Asserted Claim are examples and are not intended to be exhaustive. Additional
`
`obviousness combinations of the references identified here are possible, and Defendant may rely
`
`on such combination(s) in this litigation. In particular, Defendant is unaware of the extent, if any,
`
`to which R2 may contend that limitations of the claims at issue are not disclosed in the prior art
`
`identified by Defendant as anticipatory, and the extent to which R2 will contend that elements not
`
`disclosed in the specification of the Asserted Patent would have been known to persons of skill in
`
`the art. Defendant is continuing its investigation of the large universe of prior art to identify
`
`potential prior art systems, publications related to those systems, and third-parties that may have
`
`information about those systems. Defendant reserves the right to supplement these contentions to
`
`identify other prior art and combinations that would have made such limitations obvious.
`
`A POSITA would have been motivated to combine the teachings of any of the references
`
`related to MapReduce—i.e., Pike, Dean I-III, McClosky, Nutch System, Hadoop System, GFS,
`
`and/or The Google MapReduce System, Yahoo MapReduce System, and/or Microsoft MapReduce
`
`
`46 In KSR Int’l Co. v. Teleflex Inc., 550 U.S. 398, 418 (2007), the Supreme Court held that prior
`art need not disclose the precise teachings of a patented invention to render it obvious,
`because a court “can take account of the inferences and creative steps that a person of
`ordinary skill in the art would employ.” Under KSR, an explanation for why a combination
`of prior art items renders a claim obvious may be found in the “interrelated teachings of
`multiple patents; the effects of demands known to the design community or present in the
`marketplace; and the background knowledge possessed by a person having ordinary skill in
`the art.” Id. at 418.
`
`18
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 20 of 48 PageID #:
`5882
`
`
`
`Systems (the “MapReduce References”). Each of the MapReduce references concerns the same
`
`subject matter—MapReduce—or various systems or products implementing it or similar parallel
`
`programming architectures. For example, each of those references disclose partitioning data
`
`groups into key value pairs, providing the partitions to mapping functions, and reducing the
`
`intermediate data based on a common key, all within a distributed system. See e.g., Pike at 2:14-
`
`39, 2:41-3:35; MapReduce in Nutch; Dean I at Fig. 1, 2:40-60; McClosky at 1-5. It would have
`
`been obvious to consider and combine the teachings of a publication, patent, patent application, or
`
`system that relates to a certain product or service with the features of that product or service as
`
`known and/or provided commercially. Similarly, where multiple publications discuss the same
`
`underlying product, service, standard, or project, it was obvious to combine the discussions and
`
`disclosures of the publications as they would be understood to describe features or potential
`
`features of the underlying subject matter. Further, where one publication discusses another
`
`publication or standard, it was obvious to consider and combine the teachings of each publication
`
`in combination with each other. Indeed, the inventors of the ’610 patent refer to teachings from
`
`such references interchangeably. See Map-Reduce-Merge: Simplified Relational Data Processing
`
`on Large Clusters at 1029 (classifying Google’s File System (GFS), Map-Reduce, Microsoft’s
`
`Dryad, Yahoo’s internal infrastructure and open-source Hadoop as comparable systems).
`
`Combining the teachings of these references would have involved nothing more than the use of
`
`known features to yield predictable results—mapping and reducing data groups using key-value
`
`pairs.
`
`Moreover, these prior art references or other evidence of prior art identify and address the
`
`same technical issues and suggest similar solutions to those issues. The prior art references,
`
`actions, knowledge, and/or prior inventions for the Asserted Patent are directed to the same or
`
`19
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 21 of 48 PageID #:
`5883
`
`
`
`similar fields and are directed to solving the same or similar problems such that one of ordinary
`
`skill in the art would have been motivated to consider the techniques and systems disclosed or
`
`involved in those items of prior art and to combine them to arrive at the alleged inventions in the
`
`Asserted Claims.
`
`Additionally it would have been obvious to combine any of MacLeod I-III, Matsumoto,
`
`Chowdhuri, DeWitt, Hellerstein, and Ozsu (the Relational Database References), as each of which
`
`describe performing processing operations on data in relational databases, merging or other
`
`functions on heterogeneous datasets and identifying and tracking data throughout processing, with
`
`the MapReduce References. Each of the Relational Database References concerns data in a
`
`distributed database, and explicitly refers to performing the same functions within that database
`
`system. See e.g., MacLeod I at 1:19-29 (“A relational database is a collection of related data that
`
`is organized in related two-dimensional tables of columns and rows wherein information can be
`
`derived by performing set operations on the tables, such as join, sort, merge, and so on. The data
`
`stored in a relational database is typically accessed by way of a user-defined query that is
`
`constructed in a query language such as Structured Query Language (“SQL”). A SQL query is
`
`non-procedural in that it specifies the objective or desired result of the query in a language
`
`meaningful to a user but does not define the steps to be performed, or the order of the steps in order
`
`to accomplish the query.”); Matsumoto at 1:13-14 (“FIG.22 and FIG. 23 are block diagrams
`
`showing a related art join processing system [i]n a relational database), 2:58-61 (“An object of this
`
`invention is to provide a join processing system and method, which includes high speed join
`
`processing in a relational database configured in a multiprocessor system.”); Chowdhuri at ¶ 108
`
`(“Data in a relational database is stored as a series of tables, also called relations. Typically resident
`
`on the server 230, each table itself comprises one or more “rows' or “records” (tuples) (e.g., row
`
`20
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 22 of 48 PageID #:
`5884
`
`
`
`255 as shown at FIG. 2). A typical database will contain many tables, each of which stores
`
`information about a particular type of entity. A table in a typical relational database may contain
`
`anywhere from a few rows to millions of rows.”), ¶ 109 (“Most relational databases implement a
`
`variant of the Structured Query Language (SQL), which is a language allowing users and
`
`administrators to create, manipulate, and access data stored in the database.”); DeWitt at Abstract
`
`(“This paper describes the design of the Gamma database machine and the techniques employed
`
`in its implementation. Gamma is a relational database machine currently operating on an Intel
`
`iPSC /2 hypercube with 32 processors and 32 disk drives. Gamma employs three key technical
`
`ideas which enable the architecture to be scaled to hundreds of processors.”), id. (“Second, novel
`
`parallel algorithms based on hashing are used to implement the complex relational operators such
`
`as join and aggregate functions.”)
`
`A POSITA would have been motivated to combine the MapReduce Reference with the
`
`Relational Database References. For example, Matsumoto and Chowdhuri disclose performing
`
`join operations on heterogeneous data having different schema. A POSITA also would have been
`
`motivated to map and reduce the heterogeneous data sources disclosed in Choudhuri and
`
`Matsumoto in the systems of the MapReduce References because those references teach and
`
`suggest performing similar MapReduce operations on their data / data tables. For example, the
`
`Nutch and Hadoop Systems disclose joining heterogeneous data sets using mapping and reducing
`
`functions. See Cutting II at 13 (describing how the Nutch System uses MapReduce to merge data
`
`from heterogeneous data files “ParseData,” “ParseText,” “Inlinks,” “CrawlDatum.”).
`
`Moreover, a POSITA would have recognized, for example, that Chowdhuri’s scanning and
`
`hashjoin iterators correspond to the MapReduce References’ map and reduce tasks, and thus
`
`Chowdhuri’s tables and processing techniques were compatible with the MapReduce References’
`
`21
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 23 of 48 PageID #:
`5885
`
`
`
`systems. Indeed, Chowdhuri’s scan, hashjoin, GroupBy, and sort iterators correspond to the ’610
`
`patent’s description of map and reduce functions. ’610 patent at 1:20-27 (“[t]he map function
`
`iterates” and “[t]he reduce operation … combines elements”), 5:19-39 (sorting), 2:64-66 (“a sort
`
`and group-by-key task”), 5:48-6:7 (group iterator), 6:34-7:16 (“hashtable” iterators), 10:1-7 (“the
`
`reducers include a sort, group-by-key and combine task”); see also Matsumoto at Abstract (“This
`
`invention provides a join processing system and method, which operates efficiently without a
`
`burden to a master processor, in a relational database on a multiprocessor.”); id. (“Each of the slave
`
`processors retrieves a first sub-table and transfers the table to the master processor, and the master
`
`processor creates a first main table. Each of the slave processors retrieves the second data
`
`selectively with reference to the first main table, and creates a second sub-table. The master
`
`processor merges the second sub-tables, and creates a second main table. Then, a join processing
`
`of the first main table and the second main table is performed.”); id. at 11:8-19 (“The command
`
`orders the system to retrieve employee names from the employee data and to select the sales
`
`amount and merchandise from the sales data. In this example, a condition that the employee's
`
`family names are Tanaka is given. Furthermore, when join conditions of the employee data and
`
`the sales data are matched according to a key called a join key, the employee data and the sales
`
`data are selected and joined. In FIG. 2, the join condition is that the data is selected when man
`
`numbers of the employee data and the man numbers of the sales data are matched. The man
`
`numbers are used as the join keys to join the data.”).
`
`A POSITA would have looked to the MapReduce References to map and merge data sets
`
`or tables from Chowdhuri and Matsumoto using the MapReduce References’ distributed system
`
`to, for example, leverage the MapReduce References’ distributed system and its fault tolerant task
`
`scheduling. E.g., Pike, 4:28-53, 6:37-54, 6:55-7:25, 11:57-13:3. This would have been nothing
`
`22
`
`

`

`Case 4:23-cv-01147-ALM Document 84-3 Filed 02/12/25 Page 24 of 48 PageID #:
`5886
`
`
`
`more than combining known elements in the prior art to achieve predictable results. The resulting
`
`combination would have processed Chowdhuri’s or Matsumoto’s tables using, for example,
`
`Chowdhuri’s scanning, hashjoin, GroupBy, and sort iterators as map and reduce tasks in the
`
`MapReduce References, while employing the MapReduce References’ fault-tolerant architecture
`
`to do so. Alternatively, the resulting combination would have processed Chowdhuri’s or
`
`Matsumoto’s tables using, for example the map and reduce operations described by the
`
`MapReduce References.
`
`A POSITA would have been motivated to combine, e.g., Chowdhuri’s partitioning with
`
`the MapReduce References large-scale data processing systems to take advantage of the
`
`MapReduce References parallel processing system and fault tolerant task scheduling. E.g., Pike,
`
`4:28-53, 6:55-7:25, 11:57-13:3. As the MapReduce architecture already includes partitioning
`
`input data sets, see e.g., Dean I at Fig. 2, the combination would have been nothing more than
`
`combining known elements in the prior art to yield the predictable result of efficiently processing
`
`Chowdhuri’s input tables by partitioning each table into a plurality of partitions and processing
`
`them in a distributed, fault-tolerant system. The resulting combination would have processed
`
`Chowdhuri’s or Matsumoto’s tables using, for example, Chowdhuri’s scanning, hashjoin,
`
`GroupBy, and sort iterators as map and reduce tasks in the MapReduce References, while
`
`employing the MapReduce References’ fault-tolerant architecture to do so. Alternatively, the
`
`resulting combination would have processed Chowdhuri’s or Mat

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.

We are unable to display this document.

PTO Denying Access

Refresh this Document
Go to the Docket