Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 1 of 79 PageID #:
`4614
`
`
`
`
`
`EXHIBIT 5
`
`
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 2 of 79 PageID #:
`4615
`
`Filed on behalf of: R2 Solutions LLC
`
`By: Carder W. Brooks
`Registration No. 75,456
`NELSON BUMGARDNER CONROY P.C.
`3131 W. 7th St., Suite 3000
`Fort Worth, TX 76107
`Telephone: (817) 806-3814
`Email: carder@nelbum.com
`
`
`
`UNITED STATES PATENT AND TRADEMARK OFFICE
`________________
`
`BEFORE THE PATENT TRIAL AND APPEAL BOARD
`________________
`
`Databricks, Inc.,
`
`Petitioner
`
`v.
`
`R2 Solutions LLC,
`
`Patent Owner.
`
`________________
`
`Case IPR2024-00659
`
`U.S. Patent 8,190,610
`________________
`
`
`PATENT OWNER’S PRELIMINARY RESPONSE
`
`
`
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 3 of 79 PageID #:
`4616
`
`TABLE OF CONTENTS
`
`INTRODUCTION ........................................................................................... 1
`
`BACKGROUND ............................................................................................. 2
`
`A. Overview of the ’610 Patent ........................................................................ 2
`
`1.
`
`Introduction to MapReduce ..................................................................... 2
`
`2. Shortcomings of Prior Technologies ....................................................... 6
`
`3.
`
`Improvement to MapReduce Described by ’610 Patent. ......................... 9
`
`4. The Claimed Technology of the ’610 Patent Improves MapReduce. ...13
`
`I.
`
`II.
`
`
`
`
`
`
`
`
`
`
`
`III. LEVEL OF ORDINARY SKILL ..................................................................15
`
`IV. CLAIM CONSTRUCTION ..........................................................................15
`
`V. APPLICABLE LEGAL STANDARDS ........................................................18
`
`
`A. Obviousness ...............................................................................................19
`
`1. Claims Cannot Be Found Obvious if an Element Is Absent. ................19
`
`2. A Petition Must Provide Articulated Reasoning with Rational
`Underpinning to Combine and/or Modify References. .........................20
`
`A. Overview of Primary Reference: Pike .......................................................22
`
`
`
`
`
`
`
`
`VI. SUMMARY OF THE PRIOR ART REFERENCES ....................................22
`
`
`B. Overview of Secondary References: Chowdhuri ......................................25
`
`
`VII. THE PETITION SHOULD BE DENIED UNDER 35 U.S.C. § 325(d). ......28
`
`
`A. Substantially the Same Art and Arguments Were Previously Presented
`to the Office. ..............................................................................................29
`
`B.
`
`Petitioner Has Failed to Demonstrate that the Office Committed Material
`Error. ..........................................................................................................33
`
`
`
`
`
`Patent Owner’s Preliminary Response
`
`
`
`ii
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 4 of 79 PageID #:
`4617
`
`VIII. THE PETITION DOES NOT DEMONSTRATE THAT THE
`CHALLENGED CLAIMS ARE UNPATENTABLE. .................................35
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`A. Ground 1: Pike Does Not Teach or Suggest at Least One Limitation of
`Each Independent Claim. ...........................................................................37
`
`1. Pike Does not Disclose “Data Groups.” ................................................37
`
`2. Pike Does not Disclose Delineating “Data Groups” Prior to
`Partitioning. ............................................................................................41
`
`3. Pike Does not Disclose “Intermediate Data” that Is “Identifiable” to
`a “Data Group.” .....................................................................................45
`
`4. Pike Does not Disclose “Data Groups” with Different Schemas. .........49
`
`5. Pike Does not Disclose a Key in Common Between the (1) “Data
`Groups”/Schema of the Input Data, and (2) the Intermediate Data.......52
`
`6. Pike Does not Disclose Processing Intermediate Data According to
`its “Data Group,” or Merging the Intermediate Data Based on a Key in
`Common Between the Intermediate Data and the “Data Groups.” .......54
`
`B. Grounds 2 and 3: Combining Chowdhuri with Pike Fails to Render the
`Independent Claims Obvious. ....................................................................57
`
`1. Chowdhuri Does not Disclose “Data Groups.” .....................................57
`
`2. Chowdhuri Does not Disclose Delineating “Data Groups” Prior to
`Partitioning. ............................................................................................61
`
`3. Chowdhuri Does not Disclose “Providing Each Data Partition to a
`Selected One of a Plurality of Mapping Functions.” .............................62
`
`4. Chowdhuri Does not Disclose “Intermediate Data” that Is “Identifiable”
`to a “Data Group.” .................................................................................63
`
`5. Chowdhuri Does not Disclose “Data Groups” with Different
`Schemas. ................................................................................................66
`
`Patent Owner’s Preliminary Response
`
`
`
`iii
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 5 of 79 PageID #:
`4618
`
`6. Chowdhuri Does not Disclose a Key in Common Between the (1)
`“Data Groups”/Schema of the Input Data, and (2) the Intermediate
`Data. .......................................................................................................67
`
`7. Chowdhuri Does not Disclose Processing Intermediate Data
`According to its “Data Group,” or Merging the Intermediate Data
`Based on a Key in Common Between the Intermediate Data and the
`“Data Groups.” .......................................................................................68
`
`
`
`
`
`
`IX. CONCLUSION ..............................................................................................69
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Patent Owner’s Preliminary Response
`
`
`
`iv
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 6 of 79 PageID #:
`4619
`
`TABLE OF AUTHORITIES
`
`
`Cases
`
`Advanced Bionics, LLC v. Med-El Elektromedizinische Geräte GmbH,
` IPR2019-01469 (PTAB Feb. 13, 2020) ........................................................ passim
`
`American Airlines, Inc. et al. v. R2 Solutions LLC,
` IPR2023-00689 (PTAB Mar. 7, 2023) .................................................................35
`
`CFMT, Inc. v. YieldUp Int’l Corp.,
` 349 F.3d 1333 (Fed. Cir. 2003) ............................................................................20
`
`Garmin Int’l, Inc. v. Patent of Cuozzo Speed Techs. LLC,
` Case No. IPR2012-00001, Paper 15 (PTAB Jan. 9, 2013) ...................................20
`
`Graham v. John Deere Co.,
` 383 U.S. 1 (1966) ..................................................................................................19
`
`In re Kahn,
` 441 F.3d 977 (Fed. Cir. 2006) ..............................................................................21
`
`In re Rijckaert,
` 9 F.3d 1531 (Fed. Cir. 1993) ................................................................................20
`
`In re Royka,
` 490 F.2d 981 (C.C.P.A. 1974) ..............................................................................20
`
`InTouch Techs., Inc. v. VGo Comm’ns., Inc.,
` 751 F.3d 1327 (Fed. Cir. 2014) ............................................................................20
`
`KSR Int’l Co. v. Teleflex Inc.,
` 550 U.S. 398 (2007) ................................................................................. 20, 21, 67
`
`LG Elecs., Inc. v. Cellular Commc’ns Equip. LLC,
` Case No. IPR2016-00197, Paper 7 (PTAB April 29, 2016) ......................... 21, 68
`
`N.V. v. Abbott Labs.,
` 512 F.3d 1363 (Fed. Cir. 2008) ............................................................................21
`
`
`
`
`Patent Owner’s Preliminary Response
`
`
`
`v
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 7 of 79 PageID #:
`4620
`
`Rules, Regulations, and Statutes
`
`35 U.S.C. § 103 ........................................................................................................19
`
`35 U.S.C. § 103(a) ...................................................................................................19
`
`35 U.S.C. § 316(e) ...................................................................................................19
`
`35 U.S.C. § 325(d) ........................................................................................ 1, 28, 35
`
`37 C.F.R. § 42.108(c) ...............................................................................................19
`
`Other Authorities
`
`PTAB Consolidated Trial Practice Guide (Nov. 2019) ...........................................19
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Patent Owner’s Preliminary Response
`
`
`
`vi
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 8 of 79 PageID #:
`4621
`
`Exhibit
`
`2001
`
`
`2002
`
`
`2003
`
`
`2004
`
`
`2005
`
`
`2006
`
`
`2007
`
`
`2008
`
`
`2009
`
`PATENT OWNER’S EXHIBIT LIST
`
`Description
`
`
`[RESERVED]
`
`
`Fundamentals of MapReduce
`
`
`MapReduce Design Patterns
`
`
`[RESERVED]
`
`
`Google’s MapReduce patent – what does it mean for Hadoop?
`
`
`Information Disclosure Statement
`
`
`US6341289
`
`
`US20060117036A1
`
`
`“Hash join in MySQL 8,” available at https://dev.mysql.com/blog-
`archive/hash-join-in-mysql-8/.
`
`
`Patent Owner’s Preliminary Response
`
`
`
`vii
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 9 of 79 PageID #:
`4622
`
`I.
`
`INTRODUCTION1
`
`The Petition should be denied under 35 U.S.C. § 325(d). The Petition presents
`
`substantially the same art and arguments that were already before the USPTO during
`
`prosecution of the ’610 patent. Thus, under Advanced Bionics, it was incumbent on
`
`Petitioner to demonstrate how the USPTO erred in a way material to patentability.
`
`See Advanced Bionics, LLC v. Med-El Elektromedizinische Geräte GmbH,
`
`IPR2019-01469, Paper 6 at 7 (PTAB Feb. 13, 2020) (precedential). But the Petition
`
`is silent as to any alleged error committed by the USPTO. The Board should thus
`
`exercise its discretion under § 325(d) and deny institution.
`
`The Petition should also be denied under 35 U.S.C. § 314(a) because it fails
`
`to establish a reasonable likelihood that the Board would find any of the challenged
`
`claims unpatentable. The ’610 patent claims novel and non-obvious enhancements
`
`to MapReduce architecture that are nowhere to be found in the prior art and would
`
`not be obvious to a POSITA. Each of the Petition’s grounds have glaring holes,
`
`
`
` Databricks represented that the Petition “raises grounds of unpatentability
`
` 1
`
`identical to those already raised in IPR2024-00303.” Petition at 1. Patent Owner’s
`
`arguments here are thus identical to those found in the Preliminary Response filed
`
`in IPR2024-00303.
`
`Patent Owner’s Preliminary Response
`
`
`
`
`1
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 10 of 79 PageID #:
`4623
`
`neither of the cited references teach or suggest key limitations of the challenged
`
`claims, and the combination of the references is self-evidently inoperable.
`
`II. BACKGROUND
`
`
`A. Overview of the ’610 Patent
`
`
`
`The ’610 patent, granted to Yahoo!, claims a novel MapReduce architecture
`
`that can be applied to affect mapping and reducing across data sets comprising data
`
`of different schema. Such architecture, adapted to receive and process data from
`
`heterogenous sources, greatly expanded the applicability of MapReduce to
`
`dramatically improve big data analytics. This improvement was significant at the
`
`time of the ’610 patent (and remains significant today), enhancing the overall
`
`capabilities of computer functionality as it relates to distributed database processing.
`
`
`
`1. Introduction to MapReduce
`
`As laid out in the ’610 specification, the concept of MapReduce consists of a
`
`“‘map’ function [that] maps key-value pairs to new (intermediate) key-value pairs,”
`
`and a “‘reduce’ function [that] represents all mapped (intermediate) key-value pairs
`
`sharing the same key to a single key-value pair or a list of values.” ’610 patent at
`
`1:17-20. At its core, “MapReduce implementations enable the use of massive
`
`Patent Owner’s Preliminary Response
`
`
`
`
`2
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 11 of 79 PageID #:
`4624
`
`clusters of commodity computers to provide a simplified programming and
`
`execution model for processing large sets of data in parallel.” Id. at 9-12.
`
`In a simple example found in U.S. Patent No. 7,590,620 to Pike (“Pike,”
`
`Petitioner’s primary reference), MapReduce can be used to “count[] the number of
`
`occurrences of each distinct word in a large collection of documents.” Ex. 1002 at
`
`10:36-39. In this example, the map function “outputs a key/value pair for every word
`
`w in every document in the collection, where the key/value pair is <w, 1>.” Id. at
`
`10:40-43. It is likely that with a voluminous number of documents, each map
`
`function “will produce hundreds or thousands of records of the form <word, 1>.” Id.
`
`at 10:51-52. After the mapping phase, the reduce functions “sort the intermediate
`
`data values, merge or otherwise combine sorted intermediate data values having the
`
`Patent Owner’s Preliminary Response
`
`
`
`
`3
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 12 of 79 PageID #:
`4625
`
`same key.” Id. at 8:16-19. In this example, the effect is that “[t]he Reduce function
`
`simply adds up the count values.” Id. at 10:52-53.
`
`The below graphic illustrates the word count example discussed in Pike:
`
`
`
`Ex. 2002 at 5. In this example, the input data consists of a collection of words (Deer,
`
`Bear, River, and Car). Applying traditional MapReduce in the “Splitting” step, the
`
`input data is split to be supplied to different map processes. Next, in the “Mapping”
`
`step, each map function creates lists of key-value pairs. A key-value pair works much
`
`like a dictionary—the words are the “keys,” and the definitions are the “values.” In
`
`the example above, each mapping function generates a list of key-value pairs where
`
`the words are the “keys” and the values are “1” to indicate that each word, as a
`
`discrete item, occurs one time (e.g., “Deer, 1”). See Ex. 1002 at 10:40-43. Obviously,
`
`some of the words occur multiple times, and that is accounted for later in the process.
`
`At the “Shuffling” step, all key-value pairs having the same key are grouped together
`
`to ensure that pairs with the same key are submitted to the same reducers. At the
`
`Patent Owner’s Preliminary Response
`
`
`
`
`4
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 13 of 79 PageID #:
`4626
`
`“Reducing” step, the reducers determine how many occurrences of that particular
`
`key exist for the received data and can generate another key-value pair, accordingly.
`
`See id. at 8:16-19. Ultimately, the output in this example is a list of all of the key-
`
`value pairs from each reducer that represents the number of times the word appears
`
`in the input text (e.g., “Deer, 2” indicating that the word “Deer” appeared twice).
`
`See id. at 10:52-53; 8:16-19.
`
`MapReduce can also be used to accomplish other tasks, such as “joins” or
`
`“merges” of data or lists that already exist as key/value pairs. See id. at 9:65-66 (“The
`
`input data blocks . . . may in some embodiments be treated as key/value pairs . . .”).
`
`The below graphic illustrates such a merger:
`
`Ex. 2003 at 8.
`
`Patent Owner’s Preliminary Response
`
`
`
`
`
`
`5
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 14 of 79 PageID #:
`4627
`
`Here, the input data is already provided as a list of key-value pairs. In this
`
`example, the input data is a list of features in a graph (e.g., roads and intersections)
`
`that are numbered; the numbers are the keys and the feature types are the values. In
`
`the mapping phase, new keys are assigned that correspond to the intersection that
`
`the particular feature participates in (e.g., roads 1, 2, and 5, and intersection 3, all are
`
`involved with intersection 3). See Ex. 1002 at 7:29-43. Thus, in this example, the
`
`key generated in the mapping phase is the intersection ID, and the value is the key-
`
`value pair corresponding to the graph feature. The values are shuffled and passed to
`
`the reducers based on their common key. See id. at 8:16-19. Then, the reducers
`
`“reduce” the data to yield the output data, i.e., the reducers can consolidate all of the
`
`features pertaining to a particular common key, such that all of the features
`
`participating in intersection 3 fall with intersection 3, and all of the features
`
`participating in intersection 6 fall with intersection 6. See id.
`
`As is apparent from the above examples, MapReduce offers a way to take
`
`large, unwieldy datasets and extract quantifiable and useful
`
`information
`
`algorithmically. As discussed in the ’610 patent, however, in 2006, MapReduce had
`
`significant limitations that restricted its usefulness.
`
`2. Shortcomings of Prior Technologies
`
`The ’610 patent discloses shortcomings in the prior art and then explains, in
`
`detail, the technical way the claimed inventions resolve or overcome those
`
`Patent Owner’s Preliminary Response
`
`
`
`
`6
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 15 of 79 PageID #:
`4628
`
`shortcomings. The specification discusses
`
`that “conventional MapReduce
`
`implementations do not have facility to efficiently process data from heterogeneous
`
`sources” and that “it is impractical to perform joins over two relational tables that
`
`have different schemas.” ’610 patent at 3:9-20. “Schema” can refer to, e.g., “fields”
`
`in a particular collection of data. For example, in Figure 3 of the ’610 patent, the
`
`“All Emp” table (302) and “All Dept” table (304) have different “schema” because
`
`the “All Emp” table has the fields “DeptID” and “EmpName,” while the “All Dept”
`
`table has the fields “DeptID” and “DeptName”:
`
`
`
`Id. at FIG. 3; 3:19-20.
`
`The specification explains that traditional MapReduce cannot efficiently join
`
`the “All Emp” table (302) and “All Dept” table (304) to achieve the “All Emp and
`
`Patent Owner’s Preliminary Response
`
`
`
`
`7
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 16 of 79 PageID #:
`4629
`
`Dept” table (306). For example, the below graphic demonstrates how traditional
`
`MapReduce would process the “All Emp” and “All Dept” tables:
`
`
`
`In this example, the “key” for the mapping phase is the “DeptID” (which,
`
`mirroring the merger example in Ex. 2003, is a field that each entry shares or
`
`participates in) followed by a value of “DeptID, EmpName” or “DeptID,
`
`DeptName” (again, mirroring the merger example above). As can be seen here, even
`
`though there is a key-in-common amongst the “All Emp” and “All Dept” data sets
`
`at the input, there is no joining of the data in any usable way. Instead, the output is
`
`a collection of jumbled lists that show both employee names and department names
`
`for a single key instead of indicating, e.g., DeptID, employee name, and the
`
`associated department name. For instance, the output list for key 34 in this example
`
`contains “Clerical” (a department name) mixed with “Smith,” “Robinson,” and
`
`“Jasper” (employee names)—there is no merger of the two tables to reflect, e.g., “34,
`Patent Owner’s Preliminary Response
`8
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 17 of 79 PageID #:
`4630
`
`Smith, Clerical.” As datasets get bigger and more complex, this problem would only
`
`be compounded.
`
`
`
`3. Improvement to MapReduce Described by ’610 Patent.
`
`The ’610 patent offers an improvement over traditional MapReduce through
`
`the use of “data groups.” The specification summarizes this enhancement:
`
`In accordance with an aspect, an input data set is treated as a
`plurality of grouped sets of key/value pairs, which enhances the utility
`of
`the MapReduce programming methodology. Utilizing such
`grouping, map processing is carried out independently on two or more
`related datasets (e.g., related by each being characterized by a schema
`with a key in common). The intermediate results of the map processing
`(key/value pairs) for a particular key are processed together in a single
`reduce function by applying a different iterator to intermediate values
`for each group. Different iterators can be composed inside reduce
`functions in ways however desired.
`Thus, for example, the enhanced MapReduce programming
`methodology may be easily employed to carry out distributed relational
`database processing.
`
`’610 patent at 1:31-44.
`
`The specification goes on to explain how implementation of “data groups,”
`
`realizes these improvements:
`
`In general, partitioning the data sets into data groups enables a
`mechanism to associate (group) identifiers with data sets, map
`
`Patent Owner’s Preliminary Response
`
`
`
`
`9
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 18 of 79 PageID #:
`4631
`
`functions and iterators (useable within reduce functions to access
`intermediate data) and, also, to produce output data sets with (group)
`identifiers. It is noted that the output group identifiers may differ from
`the input/intermediate group identifiers.
`
`Id. at 3:58-64.
`
`The ’610 patent provides a helpful schematic that illustrates how the “data
`
`groups” improve MapReduce:
`
`Id. at FIG. 5 (annotated).
`
`In the above schematic, the input data in portion 502 consists of the Employee
`
`table 302 and the Department table 304 from Figure 3, which share a common key
`Patent Owner’s Preliminary Response
`10
`
`
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 19 of 79 PageID #:
`4632
`
`(i.e., DeptID). See id. at 8:15-24. Each of these “data groups” include a “mechanism
`
`for identifying data from that group”2 that is depicted as following the data through
`
`the mapping phase 504 and into the reducing phase 510 (i.e., “Emp” and “Dept”
`
`emphasized in green). As shown here, the data of each of the “data groups” are
`
`partitioned into a “plurality of data partitions” in between portions 502 and 504, and
`
`each partition is provided to a “selected one of a plurality of mapping functions” in
`
`portion 504. The mapping functions form “intermediate data” at portion 506, and the
`
`“intermediate data” is shown as being “identifiable” to its parent “data group” via
`
`the “Emp” and “Dept” mechanisms. The “intermediate data” also shares the
`
`common key (DeptID) with the two “data groups.” After sorting in portion 508,
`
`“intermediate data” corresponding to each “data group” is provided to “reduce
`
`functions within the portion 510.” Id. at 8:34-37. By “processing the intermediate
`
`data for each data group in a manner that is defined to correspond to that data group”
`
`in the reducing phase, portion 512 depicts data that is “a merging of the
`
`
`
` As discussed in more detail below, the Eastern District of Texas construed the term
`
` 2
`
`“data group” to mean “a group of data and a mechanism for identifying data from
`
`that group.” Petitioner purports to apply this construction. See Petition at 10. The
`
`Board should likewise adopt that construction.
`
`Patent Owner’s Preliminary Response
`
`
`
`
`11
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 20 of 79 PageID #:
`4633
`
`corresponding different intermediate data based on the key in common” (i.e., table
`
`306 in Figure 3).
`
`Thus, the ’610 patent provides a clear technological improvement to existing
`
`MapReduce systems via a novel MapReduce architecture where mapping and
`
`reducing functions can be applied to data from heterogeneous data sources (i.e., data
`
`sources having different schema) to accomplish the merger of heterogeneous data
`
`based on a key in common among the heterogeneous data. The specification goes so
`
`far as to provide specific examples of how a “data group”-oriented MapReduce
`
`architecture can be deployed in programming. The ’610 patent shows, for example,
`
`that a “data group” identifier can be passed as a parameter to a mapping function and
`
`thereby travel through to the resulting intermediate data, followed by a reduce
`
`function that employs the “data group” identifier to call particular programming
`
`objects (e.g., iterators) associated with specific “data groups”:
`
`
`
`
`
`Id. at 6:29-33, 39-48. Figures 3-5 illustrate a similar embodiment. In this
`
`embodiment, the “data group” identifier (annotated in red) is indicated to the
`
`Patent Owner’s Preliminary Response
`
`
`
`
`12
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 21 of 79 PageID #:
`4634
`
`mapping and/or reduce functions, such that different “data groups” can be mapped
`
`differently and/or processed in a way that corresponds to a particular “data group”:
`
`
`
`Id. at 7:45, 55, 64-65.
`
`4. The Claimed Technology of the ’610 Patent Improves
`MapReduce.
`
`The independent claims of the ’610 patent claim the improved MapReduce
`
`
`
`architecture outlined in the specification. A crucial feature is the implementation of
`
`what is referred to as a “data group.” As explained above, by utilizing “data groups,”
`
`a data set with data of different schema can be fed to a collection of mapping
`
`functions to ultimately be reduced and/or merged because the “mechanism for
`
`identifying data from that group” (which is part of the “data group”) follows the data
`
`through the mapping phase and into the reducing phase. Thus, the MapReduce
`
`architecture can use specialized processing based on the “data group” from which
`
`the data being reduced originated. These claimed features enable the MapReduce
`
`architecture in the reducing phase to accomplish the merger of intermediate data
`
`hailing from different data groups.
`
`For example, Claim 1 describes that “data groups,” each with a different
`
`schema, are partitioned, and the resulting partitions are provided to “a selected one
`Patent Owner’s Preliminary Response
`13
`
`
`
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 22 of 79 PageID #:
`4635
`
`of a plurality of mapping functions” such that “the data of the first data group is
`
`mapped differently than the data of the second data group.” Ex. 1026 at 1. This
`
`requires that the architecture differentiates between “data groups” at the mapping
`
`phase to obtain “intermediate data” on a per-“data group” basis, which avoids
`
`jumbling of data having different schemas into the same map and thus eliminating
`
`the usefulness of the reducing phase. After partitions of each of the “data groups”
`
`are provided to selected mapping functions, the claim describes that the mapping
`
`functions output “intermediate data” for a particular “data group” and that the
`
`“different schema [of the data groups] and corresponding different intermediate data
`
`have a key in common.” Id. This “key in common” requirement is necessary,
`
`because it ultimately serves as the key on which the data of the “data groups” is
`
`merged in the reducing phase.
`
`Claim 1 further requires that the “intermediate data” be “identifiable” to its
`
`parent “data group” (i.e., via the “mechanism for identifying data from that [data]
`
`group”). Id. As revealed later in the claim, the MapReduce architecture “process[es]
`
`the intermediate data for each data group in a manner that is defined to correspond
`
`to that data group, so as to result in a merging of the corresponding different
`
`intermediate data based on the key in common.” Id. In other words, the reduce
`
`functions recognize that received “intermediate data” stems from particular “data
`
`groups,” and can therefore employ processing that corresponds to each “data group”
`
`Patent Owner’s Preliminary Response
`
`
`
`
`14
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 23 of 79 PageID #:
`4636
`
`to accomplish a combined reducing phase that merges the data (each with a different
`
`schema) together. The specification provides helpful context regarding this step,
`
`disclosing that, in one embodiment, the reducing phase can employ “data group”-
`
`specific iterators to accomplish merger, and this is only possible because of the
`
`improved MapReduce architecture’s implementation of “data groups.” See, e.g.,
`
`’610 patent at 6:35-7:23. Without “intermediate data” that is identifiable to a
`
`particular “data group,” Claim 1’s MapReduce architecture would not be able to
`
`distinguish one type of “intermediate data” from another, which would (in one
`
`example) lead to the useless amalgamation of data discussed in Section II.A.2 above.
`
`Hence, the claimed features describe a MapReduce architecture that accounts
`
`for different “data group” schema to accomplish a reduction and merger of data
`
`having different schema. As explained in the specification, such task was ineffective
`
`using traditional MapReduce architectures. Id. at 3:9-20
`
`III. LEVEL OF ORDINARY SKILL
`
`
`For the limited purpose of this Preliminary Response, Patent Owner does not
`
`contest Petitioner’s definition of a person of ordinary skill in the art, but it reserves
`
`the right to do so in the event that trial is instituted.
`
`
`
`IV. CLAIM CONSTRUCTION
`
`
`Patent Owner’s Preliminary Response
`
`
`
`
`15
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 24 of 79 PageID #:
`4637
`
`The Eastern District of Texas has twice construed terms of the ’610 patent,
`
`ordering the following constructions (see Ex. 1007 at 13-31; Ex. 1008 at 8-19):
`
`Term/Phrase
`
`Agreed Construction
`
`“a plurality of mapping functions
`that are each user-configurable”
`
`“two or more mapping functions that are each
`configurable by a user”
`
`“data group”
`
`“a group of data and a mechanism for
`identifying data from that group”
`
`“a method of processing data of a
`data set over a distributed system,
`wherein the data set comprises a
`plurality of data groups,
`the
`method comprising”
`
`Limiting preamble in Claim 1
`
`“identifiable
`group”
`
`to
`
`[that/the] data
`
`plain meaning
`
`“independently output a plurality
`of lists of values”
`
`plain meaning
`
`Patent Owner’s Preliminary Response
`
`
`
`
`16
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 25 of 79 PageID #:
`4638
`
`“the data of the first data group is
`mapped differently than the data of
`the second data group”
`
`plain meaning
`
`“data set”
`
`plain meaning
`
`“data partition”
`
`plain meaning
`
`plain meaning
`
`“including processing the
`intermediate data for each data
`group in a manner that is defined
`to correspond to that data group,
`so as to result in a merging of the
`corresponding different
`intermediate data based on the key
`in common”
`
`“the at least one output data group
`is a plurality of output data groups”
`
`plain meaning
`
`
`
`R2 and Databricks have not yet engaged in claim construction proceedings in
`
`their dispute in the Eastern District of Texas.
`
`For the purposes of this Preliminary Response, Patent Owner submits that the
`
`Patent Owner’s Preliminary Response
`
`
`
`
`17
`
`

`

`Case 4:23-cv-01147-ALM Document 62-13 Filed 11/12/24 Page 26 of 79 PageID #:
`4639
`
`Board should adopt and apply the Eastern District of Texas’s construction of “data
`
`group” as “a group of data and a mechanism for identifying data from that group.”
`
`Petitioner itself purports to apply this construction.3 See Petition at 10. And this
`
`construction of “data groups” is proper in light of the patent itself. As explained by
`
`the court, the specification discloses that partitioning data “into data groups enables
`
`a mechanism to associate (group) identifiers” with the data (’610 patent at 3:58-59),
`
`and thus, “the word ‘group’ inherently implies some mechanism to identify the
`
`group. . . . Finding otherwise would potentially leave the scope of ‘data group’
`
`amorphous, and some construction of the disputed claim language will assist” in
`
`understanding the claims. Ex. 1007 at 24-25 (internal citations omitted). The Board
`
`should accordingly construe “data group” to mean “a group of data and a mechanism
`
`for identifying data from that group.”
`
`
`V. APPLICABLE LEGAL STANDARDS
`
`
`The Board may only grant a petition for inter partes review where “the
`
`information presented in the petition … sho

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.

We are unable to display this document.

PTO Denying Access

Refresh this Document
Go to the Docket