`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
` UNITED STATES PATENT AND TRADEMARK OFFICE
`
`
`
`
`
`
`BEFORE THE PATENT TRIAL AND APPEAL BOARD
`
`
`
`
`
`
`APPLE INC.,
`Petitioner,
`
`v.
`
`Zentian Limited
`Patent Owner.
`____________________
`
`Case IPR2023-00033
`Patent No. 7,587,319
`____________________
`
`
`
`DECLARATION OF DELIANG WANG, Ph.D., IN SUPPORT OF
`PATENT OWNER’S PRELIMINARY RESPONSE
`
`
`
`
`
`
`
`
`
`Case IPR2023-00033
`DECLARATION OF DELIANG WANG, PH.D
`TABLE OF CONTENTS
`
`
`Introduction ...................................................................................................... 1
`I.
`A.
`Engagement ........................................................................................... 1
`B.
`Background and qualifications .............................................................. 1
`C. Materials considered.............................................................................. 3
`Relevant legal standards .................................................................................. 4
`II.
`A.
`Person of ordinary skill in the art .......................................................... 4
`B.
`Burden of proof ..................................................................................... 6
`C.
`Claim construction ................................................................................ 6
`D. Obviousness ........................................................................................... 7
`III. Overview of the ’319 Patent ............................................................................ 8
`IV. The ’319 Patent’s limitations 46(b) and (d) .................................................... 9
`
`
`
`
`
`
`
`
`
`
`
`
`
`- i -
`
`
`
`
`
`
`
`
`
`
`
`I.
`
`I, DeLiang Wang, Ph.D., do hereby declare as follows:
`
`Introduction
`A.
`Engagement
`1.
`I have been retained by Patent Owner Zentian Limited (“Zentian” or
`
`“Patent Owner”) to provide my opinions with respect to Zentian’s Preliminary
`
`Response to the Petition in Inter Partes Review proceeding IPR2023-00033, with
`
`respect to U.S. Pat. 7,587,319. I am being compensated for my time spent on this
`
`matter. I have no interest in the outcome of this proceeding and the payment of my
`
`fees is in no way contingent on my providing any particular opinions.
`
`2.
`
`As part of this engagement, I have also been asked to provide my
`
`technical review, analysis, insights, and opinions regarding the materials cited and
`
`relied upon by the Petition, including the prior art references and the supporting
`
`Declaration of Mr. Schmandt.
`
`3.
`
`The statements made herein are based on my own knowledge and
`
`opinions.
`
`Background and qualifications
`B.
`4. My full qualifications, including my professional experience and
`
`education, can be found in my Curriculum Vitae, which includes a complete list of
`
`my publications, and is attached as Ex. A to this declaration.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`5.
`
`I have spent my professional and academic career as a researcher in
`
`the field of speech processing and machine learning (including deep learning). I
`
`am currently a University Distinguished Scholar and Professor in the Department
`
`of Computer Science and Engineering at Ohio State University.
`
`6.
`
`I received B.S. in 1983 and M.S. in 1986 from Peking (Beijing)
`
`University, both in computer science. I received a Ph.D. in computer science in
`
`1991 from the University of Southern California.
`
`7.
`
`I have received numerous awards and honors, including the U.S.
`
`Office of Naval Research Young Investigator Award, the Best Paper Awards from
`
`the Institute of Electrical and Electronics Engineers (“IEEE”) Computational
`
`Intelligence Society and the IEEE Signal Processing Society, and the Helmholtz
`
`Award from the International Neural Network Society. I am Co-Editor-in-Chief of
`
`Neural Networks, a premier journal in the field of neural networks and deep
`
`learning, and also served as President of the International Neural Network Society.
`
`8.
`
`I am an IEEE Fellow and an ISCA Fellow. I have published 175
`
`articles in major scientific journals and more than 250 papers in leading conference
`
`proceedings. In addition, I have supervised 29 graduate students who earned their
`
`PhDs in computer science and engineering, including those currently employed by
`
`
`
`
`
`
`
`
`2
`
`
`
`
`
`
`
`
`
`
`
`
`leading IT companies preforming ASR and related work. More details are given in
`
`the attached curriculum vitae.
`
`9.
`
`I am a recognized expert in the field of robust ASR (automatic speech
`
`recognition) technology, including scientific methods, and algorithm development
`
`and testing. Robust ASR aims to develop ASR algorithms that can suppress, or
`
`remain unaffected by, background interference (such as noise). ASR algorithms
`
`developed in my laboratory have been recognized as some of the best in the world;
`
`our algorithms achieved the highest recognition rate in the CHiME-2 challenge in
`
`2016, in the CHiME-4 challenge in 2020, and the LibriCSS challenge in 2021. My
`
`research contributions and achievements in the fields of speech processing were
`
`featured in the March 2017 issue of IEEE Spectrum, the most circulated technical
`
`magazine in the world. I am one of the most published authors in peer-reviewed
`
`scientific journals in the fields of speech and audio processing.
`
`C. Materials considered
`In the course of preparing my opinions, I have reviewed and am familiar
`10.
`
`with the ’319 patent, including its written description, figures, and claims. I have
`
`also reviewed and am familiar with the Petition in this proceeding, the supporting
`
`Declaration of Mr. Schmandt, and the relied upon prior art, including Thelen, Bailey,
`
`and Chen. I have also reviewed the materials cited in this declaration. My opinions
`
`
`
`
`
`
`
`
`3
`
`
`
`
`
`
`
`
`
`are based on my review of these materials as well as my 30 years of experience,
`
`
`
`research, and education in the field of art.
`
`II. Relevant legal standards
`I am not an attorney. I offer no opinions on the law. But counsel has
`11.
`
`informed me of the following legal standards relevant to my analysis here. I have
`
`applied these standards in arriving at my conclusions.
`
`A.
`12.
`
`Person of ordinary skill in the art
`I understand that an analysis of the claims of a patent in view of prior
`
`art has to be provided from the perspective of a person having ordinary skill in the
`
`art at the time of invention of the ’319 patent. I understand that I should consider
`
`factors such as the educational level and years of experience of those working in the
`
`pertinent art; the types of problems encountered in the art; the teachings of the prior
`
`art; patents and publications of other persons or companies; and the sophistication
`
`of the technology. I understand that the person of ordinary skill in the art is not a
`
`specific real individual, but rather a hypothetical individual having the qualities
`
`reflected by the factors discussed above.
`
`13.
`
`I understand that the Petition applies a priority date of February 4, 2002,
`
`for the challenged claims, Pet. 5, and I apply the same date.
`
`14.
`
`I further understand that the Petition defines the person of ordinary skill
`
`in the art at the time of the invention as having had a master’s degree in computer
`
`4
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`engineering, computer science, electrical engineering, or a related field, with at least
`
`
`
`two years of experience in the field of speech recognition, or a bachelor’s degree in
`
`the same fields with at least four years of experience in the field of speech
`
`recognition. The Petition adds that further education or experience might substitute
`
`for the above requirements. I do not dispute the Petition’s assumptions at this time,
`
`and my opinions are rendered on the basis of the same definition of the ordinary
`
`artisan set forth in the Petition.
`
`15.
`
`I also note, however, that an ordinarily skilled engineer at the time of
`
`the invention would have been trained in evaluating both the costs and benefits of a
`
`particular design choice. Engineers are trained (both in school and through general
`
`experience in the workforce) to recognize that design choices can have complex
`
`consequences that need to be evaluated before forming a motivation to pursue a
`
`particular design choice, and before forming an expectation of success as to that
`
`design choice. In my opinion, anyone who did not recognize these realities would
`
`not be a person of ordinary skill in the art. Thus, a person who would have simply
`
`formed design motivations based only on the premise that a particular combination
`
`of known elements would be possible would not be a person of ordinary skill
`
`regardless of their education, experience, or technical knowledge. Likewise, a person
`
`who would have formed design motivations as to a particular combination of known
`
`
`
`
`
`
`
`
`5
`
`
`
`
`
`
`
`
`
`elements based only on the premise that the combination may provide some benefit,
`
`
`
`with no consideration of the relevance of the benefit in the specific context and in
`
`relation to the costs or disadvantages of that combination, would also not have been
`
`a person of ordinary skill in the art, regardless of their education, experience, or
`
`technical knowledge. In my opinion, a person of ordinary skill in the art would have
`
`been deliberative and considered, rather than impulsive.
`
`16. Throughout my declaration, even if I discuss my analysis in the present
`
`tense, I am always making my determinations based on what a person of ordinary
`
`skill in the art (“POSA”) would have known at the time of the invention. Based on
`
`my background and qualifications, I have experience and knowledge exceeding the
`
`level of a POSA, and am qualified to offer the testimony set forth in this declaration.
`
`B.
`17.
`
`Burden of proof
`I understand that in an inter partes review the petitioner has the burden
`
`of proving a proposition of unpatentability by a preponderance of the evidence.
`
`C. Claim construction
`I understand that in an inter partes review, claims are interpreted based
`18.
`
`on the same standard applied by Article III courts, i.e., based on their ordinary and
`
`customary meaning as understood in view of the claim language, the patent’s
`
`description, and the prosecution history viewed from the perspective of the ordinary
`
`artisan. I further understand that where a patent defines claim language, the
`
`6
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`definition in the patent controls, regardless of whether those working in the art may
`
`
`
`have understood the claim language differently based on ordinary meaning.
`
`D. Obviousness
`I understand that a patent may not be valid even though the invention
`19.
`
`is not identically disclosed or described in the prior art if the differences between the
`
`subject matter sought to be patented and the prior art are such that the subject matter
`
`as a whole would have been obvious to a person having ordinary skill in the art in
`
`the relevant subject matter at the time the invention was made.
`
`20.
`
`I understand that, to demonstrate obviousness, it is not sufficient for a
`
`petition to merely show that all of the elements of the claims at issue are found in
`
`separate prior art references or even scattered across different embodiments and
`
`teachings of a single reference. The petition must thus go further, to explain how a
`
`person of ordinary skill would combine specific prior art references or teachings,
`
`which combinations of elements in specific references would yield a predictable
`
`result, and how any specific combination would operate or read on the claims.
`
`Similarly, it is not sufficient to allege that the prior art could be combined, but rather,
`
`the petition must show why and how a person of ordinary skill would have combined
`
`them.
`
`21.
`
`I understand that where an alleged motivation to combine relies on a
`
`particular factual premise, the petitioner bears the burden of providing specific
`
`7
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`support for that premise. I understand that obviousness cannot be shown by
`
`
`
`conclusory statements, and that the petition must provide articulated reasoning with
`
`some rational underpinning to support its conclusion of obviousness. I also
`
`understand that skill in the art and “common sense” rarely operate to supply missing
`
`knowledge to show obviousness, nor does skill in the art or “common sense” act as
`
`a bridge over gaps in substantive presentation of an obviousness case.
`
`III. Overview of the ’319 Patent
`22. U.S. Patent 7,587,319, titled “Speech recognition circuit using parallel
`
`processors,” is directed to an improved speech recognition circuit that “uses parallel
`
`processors for processing the input speech data in parallel.” Ex. 1001, 1:4-6. The
`
`’319 patent teaches multiple processors “arranged in groups or clusters,” with each
`
`group or cluster of processors connected to one of several “partial lexical memories”
`
`that “contains part of the lexical data.” Ex. 1001, 2:64-3:3. “Each lexical tree
`
`processor is operative to process the speech parameters using a partial lexical
`
`memory and the controller controls each lexical tree processor to process a lexical
`
`tree corresponding to partial lexical data in a corresponding partial lexical memory.”
`
`Ex. 1001, 3:3-7. The ’319 patent further teaches that the invention “provides a circuit
`
`in which speech recognition processing is performed in parallel by groups of
`
`processors operating in parallel in which each group accesses a common memory of
`
`lexical data.” Ex. 1001, 3:44-47.
`
`
`
`
`
`
`
`8
`
`
`
`
`
`
`
`
`
`
`
`
`IV. The ’319 Patent’s limitations 46(b) and (d)
`23. Limitation 46(b) recites “a plurality of lexical memories containing in
`
`combination complete lexical data for word recognition, each lexical memory
`
`containing part of said complete lexical data.” A person of ordinary skill would
`
`understand limitation 46(b) to require (1) multiple lexical memories; (2) with each
`
`lexical memory containing part of the “complete lexical data,” and (3) the lexical
`
`memories together containing “complete lexical data for word recognition.”
`
`24. Limitation 46(d) recites: “said [plurality of] processors being arranged
`
`in groups of processors, each group of processors being connected to a lexical
`
`memory.” A person of ordinary skill would understand limitation 46(d) to require
`
`(1) multiple processors; (2) arranged in groups of processors; (3) with multiple
`
`groups each containing said multiple processors; and (4) with each group of multiple
`
`processors being connected to a lexical memory.
`
`25. Together, limitations 46(b) and 46(d) require multiple groups of
`
`processors, each group containing a plurality of processors, and each group
`
`respectively connected to one of multiple lexical memories, with each lexical
`
`memory containing part of the complete lexical data, and all of the lexical memories
`
`combined containing all of the complete lexical data.
`
`26. Figure 2 of the patent, annotated below, illustrates this architecture by
`
`showing two groups of lexical tree processors, with each group containing multiple
`
`9
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`processors 1-k, and each group of processors connected to a dedicated “acoustic
`
`
`
`model memory,” such that there are at least two acoustic model memories for at least
`
`two groups of processors. Fig. 2 (annotated).
`
`
`
`
`
`I also note that the ’319 patent distinguishes that said architecture from
`
`
`
`
`27.
`
`two prior known alternative designs, which the patent describes as less
`
`advantageous. The ’319 patent teaches that: “By providing a plurality of processors
`
`in a group with a common memory, flexibility in the processing is provided without
`
`being bandwidth limited by the interface to the memory that would occur if only a
`
`single memory were used for all processors. The arrangement is more flexible than
`
`the parallel processing arrangement in which each processor only has access to its
`
`10
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`own local memory and requires fewer memory interfaces (i.e. chip pins).” Ex. 1001,
`
`
`
`3:50-58. I understand from those teachings that the patent distinguishes its design
`
`from (1) a one-memory-to-all-processors design, which it describes as bandwidth
`
`(access) limited in the processor to memory interface; and (2) a one-memory-to-one-
`
`processor design, which would require more memory interfaces and afford less
`
`flexibility in controlling the extent of parallel processing of speech parameters.
`
`28.
`
`I understand that in order to meet the processor to memory architecture
`
`of the challenged claims, the Petition relies on the combination of Thelen, Bailey,
`
`and Chen. Pet. 23-33, 38-46.
`
`29. The Petition acknowledges that Thelen’s recognizers 1-3 all access a
`
`single memory storage 340. Pet at 24, 30. Thelen’s Figure 3 illustrates a single
`
`storage 340 serving each of the recognizers (alleged processors) labeled as Rec 1,
`
`Rec 2, Rec 3, and Testing Recognizer, and the single storage 340 containing all of
`
`the “models” 341-348. Ex. 1030, Fig. 3 (Petition’s annotations).
`
`
`
`
`
`
`
`
`11
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`30. As Thelen’s Fig 3 illustrates, Thelen neither teaches limitation 46(b)
`
`nor 46(d), much less the two limitations combined.
`
`31. With respect to limitation 46(b), Thelen does not teach multiple lexical
`
`memories, with each lexical memory containing part of the complete lexical data,
`
`and the lexical memories together containing “complete lexical data for word
`
`recognition.” Instead, Thelen teaches the one-memory-to-all-processors (or one-
`
`
`
`
`
`
`
`
`12
`
`
`
`
`
`
`
`
`
`memory-to-all-recognizers) design that the ’319 patent expressly distinguished from
`
`
`
`and improved upon.
`
`32. With respect to limitation 46(d), Thelen does not teach a plurality of
`
`processors “arranged in groups of processors, each group of processors being
`
`connected to a lexical memory.” Rather, Thelen teaches individual “recognizers,”
`
`which are never arranged in groups, and which are all connected to a single complete
`
`storage memory 340, rather than one of several lexical memories containing partial
`
`lexical data, as the claims require. It is useful to keep in mind that Thelen’s teachings
`
`mainly targeted huge vocabulary speech recognition, and the main innovation was
`
`to employ a plurality of large vocabulary speech recognizers, each targeting a subset
`
`of the huge vocabulary.
`
`33.
`
`I understand that the Petition also relies on two embodiments from
`
`Bailey—the embodiments of Bailey’s Figures 3A and 3B. Pet. 30-33, 36-38.
`
`34.
`
`In the embodiment of Bailey’s Fig. 3A (below), each individual
`
`processor (or pattern recognition engine, labeled PRE) is connected to its own
`
`dedicated memory (labeled MEM) such that there is one processor to one memory.
`
`
`
`
`
`
`
`
`13
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Ex. 1031, Fig. 3 (Petition’s annotated version).
`
`35. Bailey teaches, “each PR engine 525a-525c [is] interfaced with its own
`
`private memory 615a-615c.” Ex. 1031, 11:1-3. The embodiment of Bailey’s Fig. 3A
`
`is thus the one-memory-to-one-processor design distinguished by the ’319 patent.
`
`Accordingly, while Bailey’s Fig. 3A teaches a “plurality” of memories, it does not
`
`teach the memory architecture of the ’319 patent, but rather teaches the prior art
`
`design that the ’319 patent explicitly distinguished as inferior.
`
`
`
`
`
`
`
`
`14
`
`
`
`
`
`
`
`
`
`36. The design of Bailey’s Figure 3B is identical to the other design
`
`
`
`distinguished by the ’319 patent. Bailey’s Figure 3B illustrates a design in which all
`
`of the processors PRE are connected to a single memory MEM 615. Ex. 1031, Fig.
`
`3B, 11:1-4.
`
`
`
`
`
`
`
`
`15
`
`
`
`
`
`
`
`
`
`
`
`37. The design of Bailey’s Figure 3B is thus the same as Thelen’s, i.e., one
`
`
`
`in which “only a single memory [is] used for all processors,” as distinguished by the
`
`’319 patent.
`
`38. As the ’319 patent explained, the design of Thelen and Bailey’s Fig. 3B
`
`is bandwidth limited compared to the ’319 patent’s claimed architecture. Ex. 1001,
`
`3:44-58. Thus, while Bailey’s Fig. 3B (like Thelen) teaches connecting a plurality
`
`of processors to one memory, Bailey’s does not teach the design of the challenged
`
`claims. Bailey’s Fig 3B does not teach multiple groups (clusters) of processors, each
`
`group containing multiple processors, and each group of processors connected to a
`
`shared lexical memory that contains a portion of the overall lexical data, with the
`
`combined lexical memories containing all of the lexical data.
`
`39. Thus, neither of Bailey’s two embodiments teaches the claimed
`
`innovative processor-to-memory architecture of the challenged claims.
`
`40.
`
`I understand the Petition also relies on Chen for the disclosure of
`
`multiple clusters of processors, each cluster containing multiple processors
`
`connected to a cluster shared memory. Pet. at 38-40.
`
`41.
`
`I note that Chen is not a reference in the field of speech recognition.
`
`Rather, Chen teaches a general-purpose parallel-computing architecture, designed to
`
`provide high performance for all sorts of computational tasks. Chen has no teachings
`
`
`
`
`
`
`
`
`16
`
`
`
`
`
`
`
`
`
`directed to speech recognition systems; in particular, Chen provides no teachings
`
`
`
`regarding “lexical memories,” much less “lexical memories containing in
`
`combination complete lexical data for word recognition,” with “each lexical memory
`
`containing part of said complete lexical data.” Pet. 38-40 (admitting Chen does not
`
`teach lexical memories or lexical data). Thus, Chen also does not teach the claimed
`
`design of limitations 46(b) and (d).
`
`42.
`
`I understand that the Petition relies on a series of modifications to each
`
`of Thelen, Bailey, and Chen to arrive at what the Petition calls “a common sense
`
`extension” of the actual teachings of those three references. Pet. 44. According to
`
`the Petition, Thelen, Bailey, and Chen would have been reconfigured such that:
`
`(1) Thelen’s recognizers would be modified to be in “groups,” Pet. 40;
`
`(2) Each “group” of Thelen’s recognizers would consist of multiple
`
`recognizers for the same subject matter context, e.g., multiple recognizers for
`
`“sports,” and multiple recognizers for “health,” etc., Pet. 42-43;
`
`(3) Each newly created “group” of Thelen’s recognizers would be connected
`
`to a dedicated shared memory, such that there would be multiple dedicated shared
`
`memories in total, Pet. 40;
`
`(4) Each of the dedicated shared group memories would be a lexical
`
`memory, Pet. 40;
`
`
`
`
`
`
`
`
`17
`
`
`
`
`
`
`
`(5) Each dedicated shared group memory would only store the recognition
`
`model for the same specific “context” as the recognizers in the group, Pet. 42-43;
`
`
`
`
`
`
`and
`
`(6) The shared group memories together would contain complete lexical
`
`data, Pet. 40, 43.
`
`43.
`
`I note, however, that each modification in the above theory relies on an
`
`arrangement that is not taught in Thelen, Bailey, or Chen.
`
`44. First, Thelen does not teach “groups” of multiple recognizers for the
`
`same subject matter, as the Petition’s steps (1) and (2) above require. Rather, Thelen
`
`teaches an embodiment in which “the number of recognition models corresponds to
`
`the number of recognizers; each recognizer being associated with an exclusive
`
`recognition model in a fixed one-to-one relationship.” Ex. 1030, 7:43-46, 7:30-33.
`
`In this embodiment, there is only one recognizer for each subject matter, not multiple
`
`recognizers per subject matter, as the Petition’s theory requires.
`
`45. Alternatively, Thelen teaches a “preferred” embodiment in which “the
`
`system comprises more models than active recognizers[.]” Ex. 1030, 7:47-48. In this
`
`“preferred” embodiment, there is one recognizer for multiple recognition models,
`
`not multiple recognizers for one subject matter, as the Petition’s theory requires.
`
`Thus, Thelen’s preferred embodiment teaches the opposite of what the Petition’s
`
`
`
`
`
`
`
`
`18
`
`
`
`
`
`
`
`
`
`theory would require, i.e., an architecture in which multiple recognition models are
`
`
`
`grouped to one processor (recognizer), rather than multiple processors being
`
`grouped to one recognition model (handling a single subject matter) as described in
`
`the Petition’s step (2).
`
`46. Second, as I stated earlier, none of Thelen, Bailey, or Chen teaches a
`
`design in which multiple groups of speech recognition processors, with multiple
`
`processors per group, are each connected to one dedicated lexical memory, as steps
`
`(3) and (4) of the Petition’s theory (above) require. Rather, Thelen and Bailey merely
`
`teach the prior art designs the ’319 patent expressly distinguished, and Chen
`
`provides no teachings regarding speech recognition or lexical memories at all.
`
`47. Third, none of Thelen, Bailey, or Chen teaches storing one recognition
`
`model per lexical memory, with multiple lexical memories combined storing all
`
`recognition models, as steps (5) and (6) of the Petition’s theory (above) require.
`
`Rather, Thelen teaches storing all recognition models in one storage memory 340.
`
`With respect to the embodiment of Fig. 3B, Bailey also teaches storing all lexical
`
`data in a single storage memory. As for the embodiment of Fig. 3A, Bailey also
`
`teaches storing all of its lexical data in each of the dedicated private memories, such
`
`that each private memory 615a, b, and c contains all of the lexical data, not part of
`
`the lexical data. Ex. 1031, 13:12-13 (describing “the private memory 615 which
`
`
`
`
`
`
`
`
`19
`
`
`
`
`
`
`
`
`
`holds all the prototype patterns.”); id. 11:22-23 (“Each PR engine contains its own
`
`
`
`library of prototype patterns 615a to 615c respectively.”); id. at 11:34-37 (“Refer to
`
`FIG. 3(B) which illustrates the system substantially as described with reference to
`
`FIG. 3(A) bus [sic] wherein each separate PR engine shares the same prototype
`
`library 615.”). Bailey never teaches that the private memories in Fig. 3A contain
`
`portions of the overall lexical data. And Chen teaches nothing regarding lexical
`
`memories or lexical data at all.
`
`48. Accordingly, each aspect of the Petition’s combination theory relies on
`
`arrangements and modifications that are not taught or suggested by any of Thelen,
`
`Bailey, or Chen.
`
`49. Furthermore, I disagree with the Petition’s statement that arriving at the
`
`design claimed in limitations 46(b) and (d) would have been a “common sense
`
`extension” of Thelen, Bailey, and Chen. As explained above, the Petition’s theory
`
`requires modifying numerous aspects of each of Thelen, Bailey, and Chen to arrive
`
`at a design that is not disclosed in any of those references. In my experience,
`
`modifying the teachings of three different references to arrive at a design that is not
`
`taught by any of them is not typically the product of “common sense.”
`
`50. For the same reasons, the Petition’s combination would not have been
`
`expected to yield “predictable result,” as persons of ordinary skill in the art would
`
`
`
`
`
`
`
`
`20
`
`
`
`
`
`
`
`
`
`not have found “predictable” the outcome of rearranging and redesigning Thelen,
`
`
`
`Bailey, and Chen in ways that are not taught or even hinted in any of those
`
`references.
`
`51.
`
`I am aware that the Petition contends that its proposed combination
`
`would have improved Thelen’s “efficiency.” I note, however, that the proposed
`
`modification would have required far more hardware—more processors and more
`
`memories—than what either Thelen or Bailey teaches. An ordinary artisan would
`
`have known that this would have come at significant additional cost and complexity
`
`compared to Thelen’s and Bailey’s existing systems. The Petition does not account
`
`for that consideration—or indeed, any costs or trade-offs implied by its combination.
`
`52.
`
` Finally, in my opinion the Petition’s rearrangement of Thelen, Bailey,
`
`and Chen goes well beyond the level of ordinary skill in the art. In the Petition’s
`
`combination, the artisan would have needed the skill and a reason to (1) dedicate
`
`multiple recognizers to each of Thelen’s recognition models, creating multiple
`
`groups each containing multiple recognizers, contrary to Thelen’s own teachings and
`
`motivations; (2) associate each group of recognizers with a dedicated memory,
`
`which was not taught by Thelen or Bailey; (3) ensure that each dedicated memory
`
`contained part of the lexical data, contrary to Bailey and Thelen’s teachings; and (4)
`
`ensure that the sum of the dedicated memories together contained the complete
`
`
`
`
`
`
`
`
`21
`
`
`
`
`
`
`
`
`
`lexical data, which was not even remotely taught in Thelen or Bailey. In my
`
`
`
`experience, an ordinary artisan with the qualifications and work experience
`
`described by the Petition would not have arrived at those choices through ordinary
`
`creativity. In particular, I note that the Petition identifies no logical path or
`
`motivation that would have led the ordinary artisan, using ordinary creativity, to start
`
`from Thelen, Bailey, and Chen and arrive at the design of limitations 46(b) and (d).
`
`53.
`
`I hereby declare that all statements made herein of my own knowledge
`
`are true and that all opinions expressed herein are my own; and further that these
`
`statements were made with the knowledge that willful false statements and the like
`
`are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of
`
`the United States Code.
`
`
`
`DeLiang Wang, Ph.D.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Executed on March 15, 2023
`
`22
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`EXHIBIT A
`EXHIBIT A
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
` UNITED STATES PATENT AND TRADEMARK OFFICE
`
`
`
`
`
`
`BEFORE THE PATENT TRIAL AND APPEAL BOARD
`
`
`
`
`
`
`APPLE INC.,
`Petitioner,
`
`v.
`
`Zentian Limited
`Patent Owner.
`____________________
`
`Case IPR2023-00037
`Patent No. 10,971,140
`____________________
`
`
`
`DECLARATION OF DELIANG WANG, Ph.D., IN SUPPORT OF
`PATENT OWNER’S PRELIMINARY RESPONSE
`
`
`
`
`
`
`
`
`
`Case IPR2023-00037
`DECLARATION OF DELIANG WANG, PH.D
`TABLE OF CONTENTS
`
`
`Introduction
`Engagement
`Background and qualifications
`
`I.
`1
`A.
`1
`B.
`1
`C. Materials considered
`3
`II. Relevant legal standards
`4
`III. Overview of the ’140 Patent
`8
`IV. Replacing Jiang’s generic processor with Chen’s supercomputer processing
`9
`V. Using each of Chen’s shared cluster memories as an acoustic model memory
`16
`VI. It would not have been obvious to configure each of Chen’s eight or more
`19
`
`Person of ordinary skill in the art
`A.
`Burden of proof
`B.
`Claim construction
`C.
`D. Obviousness
`
`4
`6
`6
`7
`
`architecture would not have been obvious
`
`for storing acoustic model data would not have been obvious
`
`processors “to compute a probability” as recited in the challenged claims
`
`
`
`
`
`
`
`
`
`
`
`- i -
`
`
`
`
`
`
`
`
`
`
`I, DeLiang Wang, Ph.D, do hereby declare as follows:
`
`I.
`
`Introduction
`A.
`Engagement
`1.
`I have been retained by Patent Owner Zentian Limited (“Zentian” or
`
`“Patent Owner”) to provide my opinions with respect to Zentian’s Preliminary
`
`Response to the Petition in Inter Partes Review proceeding IPR2023-00037, with
`
`respect to U.S. Pat. 10,971,140. I am being compensated for my time spent on this
`
`matter. I have no interest in the outcome of this proceeding and the payment of my
`
`fees is in no way contingent on my providing any particular opinions.
`
`2.
`
`As part of this engagement, I have also been asked to provide my
`
`technical review, analysis, insights, and opinions regarding the materials cited and
`
`relied upon by the Petition, including the prior art references and the supporting
`
`Declaration of Mr. Schmandt.
`
`3.
`
`The statements made herein are based on my own knowledge and
`
`opinions.
`
`Background and qualifications
`B.
`4. My full qualifications, including my professional experience and
`
`education, can be found in my Curriculum Vitae, which includes a complete list of
`
`my publications, and is attached as Ex. A to this declaration.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`5.
`
`I have spent my professional and academic career as a researcher in
`
`the field of speech processing and machine learning (including deep learning). I
`
`am currently a University Distinguished Scholar and Professor in the Department
`
`of Computer Science and Engineering at Ohio State University.
`
`6.
`
`I received B.S. in 1983 and M.S. in 1986 from Peking (Beijing)
`
`University, both in computer science. I received a Ph.D. in computer science in
`
`1991 from the University of Southern California.
`
`7.
`
`I have received numerous awards and honors, including the U.S.
`
`Office of Naval Research Y