`
`[191‘
`
`||||l|lllllllllllIllllIlllllllllllllllllll|||||llllllllllIlllllllllllllllll
`5,307,266
`
`[11] Patent Number:
`
`USOOS307266A
`
`‘
`
`Hayashi et a1.
`Apr. 26, 1994
`[45] Date of Patent:
`
`[54]
`
`[75]
`
`INFORMATION PROCESSING SYSTEM
`AND METHOD FOR PROCESSING
`DOCUMENT BY USING STRUCTURED
`KEYWORDS
`
`FOREIGN PATENT DOCUMENTS
`0032194A1
`7/1981 European Pat. Off.
`.
`0280866A2
`9/1988 European Pat. Off.
`.
`0361464A2 4/1990 European Pat. Off.
`.
`
`Inventors: Takehisa Hayashi, Sagamihara;
`Kouki Noguchi, Kokubunji; Tsuneya
`Kurihara, Tokyo; Masahiro Abe,
`Iruma, all of Japan
`
`Primary Examiner—Roy N. Envall, Jr.
`Assistant Examiner—A. Bodendorf
`Attorney, Agent, or Firm—Antonelli, Terry, Stout &
`Kraus
`
`[73] Assignee: Hitachi, Ltd., Tokyo, Japan
`
`[57]
`
`ABSTRACT
`
`[21] App]. No.': 741,750
`
`[22] Filed:
`
`Aug. 7, 1991
`
`Foreign Application-Priority Data
`[30]
`Aug. 22, 1990 [JP]
`Japan .................................. 2-219039
`
`
`[51]
`Int. c1;s .............................................. G06F 15/40
`[52] US. Cl. ....................... 364/419.07; 364/419.17
`[58] Field of Search ......................... 364/419; 395/600
`
`[56]
`
`References Cited
`U.S. PATENT DOCUMENTS
`
`4,868,733 9/1989 Fujisawa et al. ..
`
`4,958,284 9/ 1990 Bishop et al.
`..
`4,972,349 1 1/1990 Klienberger .......
`
`4,991,087
`2/1991 Burkowski et al.
`..... 364/200
`......
`. .364/900
`4,992,972 2/ 1991 Brooks et al.
`
`..... 364/419
`5,099,426 3/1992 Carlgren et a].
`
`..... 395/600
`..
`5,123,103 6/1992 Ohtaki et a].
`
`5,168,565 12/1992 Morita ................................. 395/600
`
`A document processing system for processing docu-
`ments by using structured keywords comprises an out-
`put system and a receiver system. The output system
`includes a first storage for storing a structured keyword
`dictionary containing structured keywords among
`which relations are systematically structured, and link-
`age unit providing linkage information for establishing
`correspondences between constituent parts of an input
`document and corresponding ones of the keywords.
`- The receiver system is coupled to the output system and
`includes a second storage for storing structured key-
`words among which relations are systematically struc-
`tured, and retrieving unit having inputs supplied with
`the document and the linkage information for retrieving
`the document to thereby form data of a predetermined
`edition format by using the structured keyword read
`out from the second storage. Data transfer between the
`output system and the receiver systems can be per-
`formed either on-line or off—line.
`
`15 Claims, 17 Drawing Sheets
`
`IZOOINFMIATlON SEMI! SYSTE'M
`
`W Fir-«W
`
`
`
`
`
`
`
`mmnmSEWINGUSER
`
`IFFIXIM UNIT
`
`
`
`JFIII
`llYERFACE COIIUOIICATM
`
`
`
`
`
`PM OTHER
`lUWTION
`SEMI“ USERS
`
`,20I momma sacrum mm
`
`' §COIMICM’ION”ETD"
`
`
`
`union-mm"I!
`
`
`
`Page 1 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 1 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 1 of 17
`
`5,307,266
`
`FIG.
`
`I
`
`_
`
`
`,200_I NFORMATION SENDER SYSTEM
`Io RETRIVAL INFORMATION
`
`3|
`
`%
`2
`
`E
`
`5
`«n
`g
`f-
`
`5
`g
`—
`
`z
`%
`=
`2
`E
`0
`82
`E
`<
`g
`‘5‘
`
`DOCUMENT — .
`
`.
`
`.
`
`§
`
`E
`
`I“:
`E
`35
`a)
`
`
`
`
`3
`
`
`DOCUMENT—
`KEYWORD LINK
`AFFIXING UNIT
`
`BUFFER
`
`000” ENL‘
`
`5
`
`I; g
`2 g
`g 5
`2 .—
`
`8 —
`
`|
`
`.
`
`
`
`gCOMMUNICATIONNETWORK
`
`
`
`
`
`A
`I _
`STRUCTUR D
`.
`KEYWORD
`DICTIONARY
`
`FROM OTHER
`INFORMATION
`SENDING USERS
`
`I
`.
`I
`
`,20I INFORMATION RECEIVER SYSTEM
`-____L__ __________-______. -
`
`I3I
`
`,
`‘
`13930;: °
`I02 DICTIONARY
`
`'°'
`
`,
`
`.
`
`'
`
`HO RETRIEVAL
`INFORMATION STORAGE
`,
`
`I2I
`
`4'
`
`
`u:
`8
`k
`F
`'5
`'
`n:
`2‘»
`
`I
`
`'
`
`
`
`
`I‘—
`
`z
`g
`¢ m
`o 2
`E u.
`if?
`5 a
`u
`
`
`
`
`
`DOCUMENT
`DATA BUFFER
`
`
`'50
`COLLECTED
`TA STORAG
`———————_.-———._______..———.——
`
`Page 2 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 2 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 2 of 17
`
`5,307,266
`
`FIG. 2
`
`STRUCTURED KEYWORD
`
`KEYWCRD OF HIGHER
`RANK CONCEPT
`
`LINK
`
`KEYWORD
`
`INK
`
`stwoao or LOWER .
`RANK CONCEPT
`
`
`
`
`
`
`
`
`svuonm KEYWORD
`
`' ll
`
`
`
`Page 3 of 26
`
`' MINDGEEK EXHIBIT 1006
`
`Page 3 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 3 of 17
`
`5,307,266
`
`MIC OPROCESSOR
`
`CHIP NAMEIID)
`
`FIG. 3
`
`STRUCT
`
`K
`
`ARCHITECTURE
`
`"CISC
`
`$1.5, SERIES
`
`86 SERIES
`
`68 SERIES
`
`RISC
`
`3-3- SERIES
`CC
`“" ERIES
`
`DATE
`
`EM— SERIES
`k DATE OF PUBLICATION (MONTH,YEAR)
`
`I
`
`DATE OF SHIPPING (MONTH ,YEAR)
`
`MANUFACTURER NAME
`
`COMPANY A
`
`COMPANY B
`
`1
`
`COMPANY c
`
`PERFORMANCE
`
`OPERATION PERFORMANCE (MIPS)
`
`CLOCK FREQUENCY (MHZ)
`
`\
`\
`
`POWER CONSUMPTION (WI
`
`SEMICONDUCTOR TECHNOLOGY
`
`‘
`
`PROCESS
`
`kCMOS
`
`ECL
`
`II
`
`THE GENERATION
`
`kaopm
`
`‘_
`
`I. 3pm
`
`|l
`
`Page 4 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 4 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 4 of 17
`
`5,307,266
`
`FIG. 4A
`
`START
`
`INPUTTING 0F KEYWORD
`REPRESENTING TITLE
`OF DOCUMENT
`
`
`
`
`
`40|
`
`
`
`
`EXTRACTION OF KEYWORD
`THROUGH MATCHING
`PROCESSING OF STRUCTURED
`
`
`KEYWORD DICTIONARY
`AND DOCUMENT DATA
`
`
`402
`
`SELECTION OF KEYWORDS
`BY USER
`
`403
`
`
`
`FORMING 0F LINK TO
`KEYWORD CORRESPONDING
`DESCRIPTION CANDIDATE
`
`EYA'FAARSING OF DOCUMENT
`
`
`
`MANUAL
`DESIGNATION
`
`
`OF KEYWORD
`
`
`
`
`
`CONFIRMATION OF VALIDITY
`
`OF FORMED LINK.
`ERROR CORRECTION AND
`ADDITIONAL DOCUMENT
`INPUT BY USER
`
`
`
` KEYWORDS
`
`ARE SUFFICIENT
`
`
`?
`
`406
`
`407
`
`Page 5 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 5 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`, Sheet 5 of 17
`
`5,307,266
`
`FIG. 48
`
`FIG. 4C
`
`DATA COLLECTON 40
`
`PREPARATORY
`
`PROCESSING
`
`0.0
`
`EDITING
`
`caIECTIoN OF DATA
`
`[BY RETRIEVAL AND
`
`
`
`
`DATA COLLECTION
`PREPARATORY
`PROCESSING
`
`
`
`4000
`
`
`
`
`
`DATA COL L ECTION
`
`PROCESSING
`FIG. 4
`
`
`
`DESIGNATION OF
`
`EDITING FORMAT
`
`'4IOO
`
`
`
`
`4200
`
`DESIGNATION OF
`RETRIEVAL-ORIENTED
`ITEM, CONDITION AND
`
`STRUCTURED KEYWORD
`
`(FIG. 4E)
`‘
`
`
`
`
`
`
`Page 6 of 26
`
`—
`
`MINDGEEK EXHIBIT 1006
`
`Page 6 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 6 of 17
`
`‘
`
`5,307,266
`
`FIG. 4D [DESIGNATION OF T400
`
`EDITING FORMAT
`
`INPUTTING 0F
`TABLE HEADER
`ITEMS
`
`RETRIEVAL
`
`PREPARATION OF
`FIELD FOR
`DESIGNATING
`CONDITION AND
`ITEMs FOR
`
`4'20
`
`4I30
`
`FIG. 4E
`
`DESIGNATION 0F RETRIEVAL—
`ORIENTED ITEMS. CON DITION AND
`STRUCTURED KEYWORD
`
`4200
`
`
`DESIGNATIONDF CONDITION FOR
`RETRIEVAL IN CONDITION FIELD OF
`
`
`EDITING FORMAT (FIG. 4F)
`
`
`
`
`DESIGNATE OF ITEM FOR RETRIEVAL IN
`ITEM FIELD OF EDITING FORMAT
`
`
`
`(FIG. 4m
`
`4I20
`
`42 20
`
`Page 7 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 7 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 7 of 17
`
`5,307,266
`
`FIG. 4F
`DESIGNATION 0F CONDITION FOR
`42IO
`RETRIEVAL IN CONDITION FIELD
`
`@D
`
`
`
`
`INPUTTING OR SELECTION OF
`CANDIDATES FOR ONE OR MORE
`KEYWORDS REPRESENTING TITLE
`
`
`OF DOCUMENT FOR RETRIEVAL
` PRESENTATION OF
`
`
`KEYWORD CANDIDATE FOR
`
`CORRECTING KEYWORD
`
`GIVEN BY USER SO AS TO
`
`MATCHING PROCESSING OF KEYWORD
`BE INCLUDED IN ST'
`'m
`
`
`CANDIDATES WITH KEYWORDS IN
`
`KEYWORD DICTIONARY
`STRUCTURED KEYWORD DICTIONARY
`
`42I2-I
`
`
`
`YES
`
`.
`
`DETERMINATION OF DOMAIN or
`STRUCTURED KEYWORD DICTIONARY
`
`FOR RETRIEVAL
`
`42!?)
`
`
`
`ESTABLISHMENT OF CCNDIT ION FOR
`RETRIEVAL BY INPUTTING OR
`SELECTION OF KEYWORD
`
`MATCHING PROCESSING WITH KEYWmDS
`IN STRUCTURED KEYWORD DICTIONARY
`
`KEYMORO CANDIDATES
`
`CONFORM INC TO
`STRUCTURED KEYWORD
`DICTIONARY
`
`
` PRESENTATION OF
`
`
`
`MATCHING PROCESSIIG OF IEYWORDS
`OF HIGHER AND LOWER RANK
`CONCEPTS
`‘
`
`
`
`42I6-I
`
`.
`
`YES
`
`DETERMINATION OF CONDITION
`FOR RETRIEvAL
`
`42l7
`
`Page 8 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 8 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 8 of 17
`
`5,307,266
`
`FIG. 46
`
`[DESIGNATION 0F ITEMS FOR].r4220
`
`RETRIEVAL IN ITEM FIELD
`
`
`
` DESIGNATION OF ITEMS FOR
`
`MATCHING PROCESSING WITH
`
`RETRIEVAL BY INPUTTING 0R
`
`SELECTION OF KEYWORDS
`
`
`
`
`PRESENTATION OF
`KEYWORD CANDIDATES IN
`
`
`CONFORMANCE WITH _
`
`
`STRUCTURED KEYWORD
`
`4222 , DICTIONARY
`
`
`
`KEYWORDS IN STRUCTURED
`
`
`
`KEYWORD DICTIONARY 4224
`4223
`
`
`
`YES
`
`DETERMINATION OF ITEMS
`FOR RETRIEVAL
`
`Page 9 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 9 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 9 of 17
`
`5,307,266
`
`FIG. 4H
`
`[DATA COLLECTION PROCESSINQ’GmO
`
`RETRIEVAL THROUGH MATCHING
`PROCESSING OF CONDITION FOR RETRIEVAL ’
`WITH STRUCTURED KEYWORDS
`
`BIOO
`
`
`
`
`GIOO-I
`
`OINCIDENC
`IS FOUND ?
`
`YES
`
`FIG. 41
`
`EDITION OF DATA IN ACCORDANCE 6200
`WITH EDITING FORMAT
`
`
`
`
`
`
`
`MATCHING PROCESSING OF STRUCTURED
`KEYWORDS OF ITEMS FOR RETRIEVAL AND
`THOSE OF RETRIEVAL INFORMATION
`
`
`
`EXTRACTION OF DOCUMENT CONSTITUENT
`CORRESPONDING TO KEYWORD BY USING
`
`
`DOCUMENT-KEYWORD LINKAGE
`INFORMATION
`
`
`
`ENTRY OF KEYWORD CORRESPONDING
`DOCUMENT CONSTITUENT IN ITEM FIELD
`IN CONFORMANCE WITH EDITION FORMAT
`
`
`
`
`62|O
`
`6220
`
`6230
`
`Page 10 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 10 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 10 of 17
`
`5,307,266
`
`FIG. 5A
`
`THIS TIME. COMPANY A HAS QEXELQEEQ A HIGH- PERFORMANCE
`‘31
`2
`
`Llh0F fl§_QSTRUCTURE ADOPTING _|._§_F"‘IL
`
`MICROPROCESSOR
`
`.....
`
`___|____
`1%
`I
`CM08.
`THE PROCESSOR HAS INTEGER OPERATION PERFORMANCE OF
`.1
`.
`I_|§_I
`M
`
`’87 :_\
`'*3l MAY,
`L__-_§L__..I 55
`
`FIG. 58
`. MICROPROCESSOR
`°DEVELOPMENT
`(KEYWORDS DESIGNATED
`BY SENOER SYSTEM
`AS SUBJECT MATTER
`OF DOCUMENT)
`
`FIG. 5C
`[DESIGNATED
`
`STRUCTURED KEYWORD
`
`:I
`
`MICROFROCESSOR L
`MANUFUPACTURER
`NAME LI
`SEMICONDUCTOR
`TECHNOLOGY
`
`FIG. SD
`
`”xMWEL
`mammrg
`
`THE
`GENERATION g
`PROCESS I;
`
`ARCHITECTURE L4.
`
`PERFORMANCE
`
`OPERATION
`PERFORMANCEIMIPS)§
`
`CHIP NAME g1
`
`DATE
`
`\DATE OF PUBLICATION
`(MONTH, YEAR) g
`
`Page 11 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 11 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 11 of 17
`
`5,307,266
`
`
`
`ITEMS/CONDITION,
`FOR RETRIEVAL
`
`EDITING
`FORMAT
`
`
`
`
`
`
`. FIG. 8
`
`"0
`
`(M,YINA.DI
`
`NAME
`
`MIMIPSI
`
`
`
`
`
`
`
`
`
`Page 12 of 26 .
`
`MINDGEEK EXHIBIT 1006
`
`Page 12 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 12 of 17
`
`5,307,266
`
`FIG. 7
`
`00 : DESIGNAnON OF CONDITION FOR RETRIEVAL
`
`(55355555555555.0555505.5.
`
`YM = MICROPROCESSOR
`
`'
`
`? DATE
`
`DATEOFPLBLICATATION
`(MONTH, EAR)
`
`?
`
`&(RETRIEVE : JANUARY.’87 (Y M I
`
`CI
`
`1 ENTRY:(YM)
`
`;
`
`02 : ENTRY:(MICROPROCESSOR
`
`_? MANUFACTURERNAME) ;
`
`03 : ENTRYKMICROPROOESSOR
`
`?ARCHITECTURE
`
`~70:
`
`-702
`
`—703
`
`'704
`
`-'
`705
`
`-706
`
`-707
`
`-708
`
`-709
`
`-7l0
`
`-7I I
`
`?(SELECT3CISC,RISC”; "7l2
`
`a4 : ENTRY:(MI_CROPROCESSOR
`
`? PERFORMANCE
`
`? OPERATION PEFORMANCE
`(MIPSH
`
`-7I3
`
`-7I4
`
`-7I5
`
`Page 13 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 13 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`U.S_. Patent
`
`Apr. 26, 1994
`
`Sheet 13 of 17
`
`5,307,266
`
`COLH:
`
`
`R
`ETRIEVAL INFORMA I
`STORAGE
`
`‘ sr
`RUCTURED
`Kama; r
`
`271
`
`K YWOR
`
`EBug???"
`
`% RUCTUF‘ED
`OCUMEN
`DATA BUFFER
`
`1 J
`
`,950
`__J..___
`COLLECTED
`
`"I
`0000 EN
`DATA BUFFER
`
`|
`
`FIG. 9A
`
`
`mmmF.WW“?!.wI.—UMMN—KDUU
`
`mPu.
`
`..9
`
`B
`
`
`
`umgazmm5.29.2.m:2....mum:oszuwm2952292.
`
`
`
`
`
`-————_——-
`
`Page 14 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 14 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`
`
`
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 14 of 17
`
`5,307,266
`
`FIG.
`
`0
`
`293222230.2.-4
`
`"0|
`
`”IO
`
`“2|
`
`RETRIEVAL INFW
`STORAGE
`
`m
`
`
`
` .2203»!
`i.8555.$8um..
`
`
`
`
`
`
`2mm:szzmm225220.22.
`
`
`
`
`
`2mm:92;.qu29.522922.
`
`til
`
`"70 I
`
`IIBO
`
`v
`
`.‘
`
`.“
`
`LLA IO ‘
`CO
`C
`ORRECTDV (F
`EDITED
`DATA
`AND
`DATA
`
`Page 15 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 15 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 15 of 17
`
`5,307,266
`
`FIG.
`
`II
`
`MICROPROCESSOR
`
`(D
`
`( KEYWORD- DOCUMENT )
`LINKAGE INFORMATION
`
`MANUFACTURER _______ " ‘ ‘ " "
`NAME
`fluff": 3 Hi
`
`SEMICONDUCIOR
`TECHNOLOGY
`HE GENERATICN — _.— {gang Na
`m ————— {cEésZJ 1L3.
`
`I— "‘ —' "' - 1
`ARCHITECTURE —————— -LRISC
`JILL‘l
`
`PERFORMANCE
`
`.
`[— ----- '—|
`2E£Fa§mmm —— "@1491; mi
`r____
`CHIP NAME
`—————— ‘15 40540 1m
`
`OATA 0F
`PUBLICATION IMONTNYEAR
`
`"
`.
`_ r‘
`"LMfiY- 87 1131
`
`[:3 STRUCTURED KEYWORDS
`
`l" " ‘T " "I KEYWORD CORRESPONDING
`I. ._ __ .. .I DOCUMENT CONSTITUENTS
`
`(DOCUMENT CONSTIUENT)
`
`LOCATION DISIGNATING
`LINKAGE INFORMATION
`
`Page 16 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 16 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 16 of 17
`
`5,307,266
`
`FIG.
`
`IZA
`
`FIG. IZB
`
`{I l50 COLLECTED EDITED DATA
`
`II§O COLLECTED DOCUMENT DATA
`
`
`
`
`DOCUMENT
`IDENTIFYING
`INFORMATION
`-—-————_- —-—-—
`
`
`
`
`
`Page 17 of 26
`
`MINDGEEK EXHIBIT 1006
`
`_____—__.-_.__—_._.-____.__—__._______~_-—_
`
`
`
`STRUCTURE!)
`KEY WORDS
`
`
`
`
`
`
`— K
`
`
`EYWORD-
`mCUMENT LINKNI
`
`INFORMATION
`
`
`
`gRRIORPQDNDING
`OOClfiENT
`CONSTITUENTS
`
`
`
`Page 17 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`US. Patent
`
`Apr. 26, 1994
`
`Sheet 17 of 17
`
`5,307,266
`
`FIG.
`
`3
`
`{2200 INFORMATION SENDER SYSTEM
`
`_aCMLSKDRO_E2E
`
`
`.0CNREE
`
`mum9mm!uao.0mmmwmmxmmm
`mmmHmm.E2mAMDnvmEMA
`
`smmrmwmmomu.mmn
`
`IMAM
`
`INFORMATION RECEIVER
`SYSTEM
`
`225222228r
`2922232225 2|2
`
`2203.52
`
`3525::
`
`
`
`mgr—23.2.20:3.2:22ou
`
`222923228
`
`mQEzEz.
`
`
`
`
`
`
`
`35E...58pm_mum...2.9mm52:52.383.23%2958......
`
`Page 18 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 18 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`
`
`
`
`1
`
`5,307,266
`
`INFORMATION PROCESSING SYSTEM AND
`METHOD FOR PROCESSING DOCUMENT BY
`USING STRUCTURED KEYWORDS
`
`BACKGROUND OF THE INVENTION
`. The present invention relates to information process-
`ing method and. system for automatically collecting
`desired information from a large amount of information.
`As an information processing system for acquisition
`of information, there is heretofore known an informa-
`tion retrieval system which is so arranged as to make
`access to a database or a knowledge base in which infor-
`mation has previously been stored or accumulated, as is
`described in IRA-60440443.
`Further, as the methods for retrieval of information,
`there are known a method in which the user designates
`items for retrieval in accordance with items of a table
`constituting a part of a database on the basis of informa-
`tion concerning a data storage structure adopted in the
`database and a method of simplifying designation of the
`items for retrieval by resorting to an associative re-
`trieval and a synonym processing. Besides, there has
`already been proposed a method according to which
`documents added with keywords are stored as they are
`for allowing extraction of document constituent parts
`for which coincidence is found with the keywords and
`a method according to which a stored document is
`retrieved when the keywords available for the retrieval
`coincide with synonyms detected from all the texts of
`that document.
`'
`
`The first mentioned prior art method is however
`disadvantageous in that information other than the pre-
`determined table items Can not be processed because of
`the tabular structure of the database. If the number of
`table items is increased in an effort to cope with the
`above problem, then the structure of the database be-
`comes complicated, involving difficulty in maintenance
`and management thereof.
`‘
`In thecase of the document retrieving methods in
`which keywords are used for retrieval, the requisite
`information as wanted by the user can be obtained only
`when the user having read the extracted document part
`can understand the content thereof. As a consequence,
`when information is to be collected for a specific item
`or matter from many documents, the burden to be borne
`by the user will increase significantly, giving rise to a
`problem.
`
`SUMMARY OF THE INVENTION
`
`It is therefore an object of the present invention to
`provide an information processing system which is ca-
`pable of automatically collecting necessary or de-
`manded information from a large amount of stored
`information.
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`It is another object of the present invention to pro-
`vide an information processing system which is substan-
`tially immune to the shortcomings of the prior art
`method, such as difficulty in maintenance and manage-
`ment, and the serious burden imposed on the user and
`others upon automatic collection of information.
`A further object of the invention is to provide a data-
`base retrieving method and system capable of collecting
`automatically those data which meet the demand of the
`user by allowing extraction of the content of a docu-
`ment having meaning implied by keywords as desig-
`nated.
`
`65
`
`2
`In view of the above and other objects which will be
`apparent as description proceeds,
`there is provided
`according to an aspect of the present invention an infor-
`mation processing system comprising a combination of
`.a sender system and a receiver system, wherein the
`sender system includes a structured keyword dictionary
`containing keywords among which relations are sys-
`tematically structured, a unit for adding linkage infor-
`mation to constituent parts of a document as inputted
`which bear respective relations to the keywords se‘
`lected from the structured keyword dictionary and a
`unit for sending out retrieval information containing the
`structured keywords, the linkage information and the
`document data added with the linkage information,
`while the receiver system includes a retrieving unit
`responsive to reception of the retrieval information
`from the sender system for retrieving the document
`data by using the linkage information and the structured
`keywords.
`The structured keyword mentioned above may be
`implemented on a knowledge domain basis so as to have
`at least one of the links including a link to a keyword
`representing a higher rank concept, a link to a keyword
`representing a lower rank keyword and a link to a key-
`word representing a synonym, as is illustrated in FIGS.
`2 and 3 of the accompanying drawings.
`Correspondence between the keyword selected from
`the structured keywords and the corresponding constit-
`uent part of the document should be established in light
`of the structure of the structured keywords such that a
`keyword of concern representing an upper rank con-
`cept of the semantic content of a constituent part of the
`document is linked to that document part which thus
`represents the lower rank concept of the keyword of
`concern, as will be elucidated later on by reference to
`FIG. 5.
`
`Further, the retrieving unit for retrieving the docu.
`ment data with the aid of retrieval information and the
`structured keywords may be composed of a functional
`part for designating a keyword needed for the retrieval
`by consulting the structured keyword dictionary, 3
`storage for storing the structured keywords, a retriev-
`ing unit for retrieving document data by using the struc-
`tured keywords stored in the storage and the retrieval
`information, and a second storage for storing the result
`of the retrieval.
`
`In this conjunction, the second storage for storing the
`result of the retrieval may preferably be imparted with
`a function for editing the data resulting from the re-
`trieval
`in accordance with a designated or inputted
`format for editing and storing the result of editing so
`that automatic editing of the document data as retrieved
`can be performed.
`For retrieving the document data with the aid of
`retrieval information and the structured keyword desig-
`nated and stored for the retrieval, the constituent part of
`the document representing the lower rank concept of
`that keyword should preferably be extracted by using
`the linkage information.
`information for re-
`The retrieval information (i.e.
`trieval) may preferably include in addition to at least the
`structured keyword, the linkage information for indi-
`cating correspondence between the structure keyword
`and a corresponding constituent part of the document
`and document data added with the linkage information
`as described above, at least one of information resulting
`from copying or extraction of a constituent part of the
`document corresponding to the structured keyword,
`
`Page 19 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 19 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`5,307,266
`
`3
`document part location designating linkage information
`indicating the position or location of that constituent
`part in the document Or identification information for
`identifying an original document to which the docu-
`ment part belongs. In that case, the editing can corre-
`spondingly be simplified while facilitating confirmation
`and correction of the document data to another advan—
`tage.
`information supplied to the
`the retrieval
`Besides,
`receiver system from the sender system may be trans-
`mitted through a communication network, as illustrated
`in FIG. 1.
`,
`,
`Alternatively, the retrieval information to be trans-
`ferred as the output/input information may include at
`least information capable of being written in an informa-
`tion carrying medium and read therefrom, as in the case
`of an embodiment of the invention shown in FIG. 9.
`According to another aspect of the invention, there is
`further proposed for achieving the previously men-
`tioned objects an information processing method,
`wherein a process for establishing correspondences
`between the structured keywords and corresponding
`document data includes an input procedure for allowing
`a user to input a keyword representing subject matter of
`a document, a proCedure for extracting a keyword
`through matching processing of the keywords con-
`tained in a structured keyword dictionary with the
`document data, a linkage forming procedure for form-
`ing a link to a candidate for constituent parts (descrip-
`tion) in the document which corresponds to the key-
`word through syntax analysis (parsing) of the document
`data and a procedure for allowing the user to confirm
`the validity of the formed link or correct the link, as will
`hereinafter be described in detail by reference to FIG. 4.
`According to another aspect of the invention, there is
`provided an information processing method in which a
`process of collecting document data through retrieval
`for editing includes an editing format designating proce-
`dure, a procedure for designating items to be retrieved,
`conditions for the retrieval and the structured keywords
`and a data collecting procedure,
`wherein the editing format designating procedure
`includes a procedure for inputting an editing format and
`items to be retrieved and a procedure for designating
`fields for the conditions and the items for retrieval,
`the procedure for designating the items to be re-
`trieved, the condition for retrieval and the struCtured
`keyword includes a procedure for designating the re-
`trieval condition to be entered in the retrieval condition
`designating field of the editing format and a procedure
`for designating the retrieval items to be entered in the
`retrieval item designating field of the editing format,
`and
`
`the data collecting procedure includes a retrieval
`procedure for performing in response to the input of the
`retrieval information a matching processing between
`the structured keyword of the retrieval information and
`that of the retrieval condition, and a data editing proce-
`dure for editing the document data in accordance with
`the editing format, wherein the data editing procedure
`includes a matching processing procedure for perform-
`ing a matching processing between the structured key-
`word of the item for retrieval and that of the retrieval
`information, a document part extracting procedure for
`extracting a constituent part of document correspond-
`ing to the keyword by using document-keyword link-
`age information (i.e. linkage information for interlinking
`a document part and a keyword) contained in the re-
`
`4
`trieval information, and a storing procedure for storing
`the document part corresponding to the keyword in the
`retrieval item designating field in accordance with the
`editing format.
`The keywords used for the retrieval of information
`according to the present invention are conformed to the
`structured keyword dictionary in which the relations
`between or among the keywords are systematically
`structured on a knowledge-domain basis (i.e. in each
`domain of knowledge), wherein those keywords linked
`together in a standardized semantic relationships, such
`as the relationship among concepts of higher and lower
`ranks, are used in the respective relevant knowledge
`domain. Accordingly, there arises scarce differences of
`individuals in understanding the keywords because the
`semantic relations between the keywords are easy to
`understand distinctiVely.
`In conjunction with establishment of correspon-
`dences between the the constituent parts of a document
`and the structured keyword, it is noted that those con-
`stituent parts of the document which should semianti-
`cally belong to a same keyword may assume various
`meanings. Under the circumstances, it is taught accord-
`ing to the invention to previously establish correspon-
`dences between the keywords selected from the struc-
`tured keyword dictionary and the constituent parts of
`the document by using the linkage information. Conse-
`quently, according to the invention, the user can get rid
`of trouble of handling unnecessarily lots of data for the
`retrieval. Furthermore, difficulty in maintenance and
`management can significantly be mitigated by virtue of
`establishment of correspondences between the key-
`words and the constituent parts of the document as well
`as owing to utilization of the standardized structured
`keyword dictionary as a basis.
`As will be seen from the foregoing, because the link-
`age inforrnation for establishing correspondences be-
`tween the keywords and the constituent parts of a docu-
`ment which semantically correspond to the above key-
`words are added to the constituent parts of a document,
`there can easily be extracted the constituent parts of the
`document, such as words, phrases/clauses and sen-
`tences which semantically correspond to the keywords
`designated by the user for the item for which he or she
`wants to acquire information.
`Additionally, by storing internally of the information
`processing system the keywords designated by the user
`for the item for which information is to be acquired,
`extracting the constituent parts of the document corre-
`sponding semantically to the designated keywords from
`those documents supplied by way of an information
`network or other various media and storing the ex-
`tracted document constituent parts, there can be real-
`ized an automatic data collection.
`By designating the editing format for editing the data
`collected, the data desired by the user can be progres-
`sively and increasingly stored and accumulated. By way
`of example, as the editing format, a table framework
`may be provided, whereon the keywords may be desig-
`nated at locations corresponding to the items of the
`table. When fresh document data are supplied to the
`information processing system according 'to the inven-
`tion,
`the document constituent parts corresponding
`semantically to the individual keywords mentioned
`above can be extracted by using the linkage information
`and then can be written in the table at corresponding
`columns. By repeating this procedure, the table can be
`autonomously and increasingly expanded.
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`45
`
`50
`
`'
`
`55
`
`65
`
`Page 20 of 26
`
`MINDGEEK EXHIBIT 1006
`
`Page 20 of 26
`
`MINDGEEK EXHIBIT 1006
`
`
`
`5
`Finally, by attaching to the extracted document con-
`stituent parts the linkage infOrmation indicating the
`locations in a document from which the constituent
`
`parts have been extracted or by attaching the identifica-
`tion number of the document to which the extracted
`' document constituent parts have belonged, the user can
`straightforwardly read out the relevant parts of the
`document and easily confirm whether or not the corre-
`spondences between the keywords and the document
`parts are correct, whereon an error, if any, can be cor-
`rected.
`
`10
`
`BRIEF DESCRIPTION OF THE DRAWINGS
`
`FIG. 1 is a functional block diagram showing a docu-
`ment processing system according to an embodiment of 15
`the invention;
`FIG. 2 is a view for illustrating an example of a struc-
`ture of a structured keyword employed according to
`the teaching of the invention;
`FIG. 3 is a view for illustrating, by way of example
`only, linkages among structured keywords;
`FIGS. 4A to 4I are flow charts for illustrating exem-
`plary procedures involved in operation of the document
`processing carried out by the system shown in FIG. 1;
`FIGS. 5A to 5D and FIGS. 6 to 8 are views for illus-
`
`20
`
`25
`
`trating, by way of example, in What manner a document
`is processed according to an embodiment of the inven—
`tion by using the system shown in FIG. 1;
`FIGS. 9A and 9B and FIG. 10 are functional block
`diagrams showing document processing systems ac—
`cording to further embodiments of the invention, re-
`spectively;
`FIG. 11 and FIGS. 12A and 12B are views for illus-
`
`trating, by way of example only, operations of the docu-
`ment processing systems according to the further em-
`bodiments of the invention; and
`FIG. 13 is a functional block diagram showing yet
`another embodiment of the document processing sys-
`tem according to the invention.
`
`DESCRIPTION OF THE PREFERRED
`EMBODIMENTS
`
`Now, the present invention will be described in detail
`in conjunction with preferred or exemplary embodi-
`ments thereof by reference to the accompanying draw-
`mgs.
`FIG. 1 shows in a functional block diagram a general
`arrangement of a document processing system accord-
`ing to a first embodiment of the invention. In this figure,
`a reference numeral 200 denotes generally an informa-
`tion sender system adapted to generate retrieval infor-
`mation (i.e. information for retrieval) to be transmitted
`through a communication network 500 to an informa-
`tion receiver system which is generally denoted by a
`numeral 201 and arranged to perform processing for
`retrieval and editing by utilizing the retrieval informa-
`tion as supplied. Thus, it can be said that the document
`processing system illustrated in FIG. 1 is implemented
`in the form of a retrieval information transmission/-
`reception system, so to say. In the information sender
`system 200, a reference numeral 1 designates a docu-
`ment data storage, and a numeral 2 denotes a keyword
`dictionary storage. Any given one of the keywords
`contained in the keyword dictionary 2 is so structured
`as to have at least one link, including a link leading to a
`keyword representing an upper rank concept of the
`given keyword, a link to a keyword representing a
`lower rank concept and a link to a keyword having
`
`30
`
`35
`
`45
`
`50
`
`55
`
`65
`
`5,307,266
`
`6
`semantically same meaning (i.e. synonym) as the given
`keyword on the basis of the field or domain of the
`knowledge to which the given keyword belongs (i.e. on
`a knowledge-domain basis), as is illustrated in FIG. 2.
`Thus, the keyword dictionary 2 may be termed as a
`structured keyword dictionary containing keywords
`which are systematically structured by means of inter-
`keyword linkages or relations established as mentioned
`above. This dictionary 2 will hereinafter be termed the
`structured keyword dictionary, while the keywords
`related to one another by the links will be referred to as
`the structured keywords. Turning back to FIG. 1, a
`reference numeral 3 denotes a unit for selecting from
`the structured keyword dictionary 2 the structured
`keywords in the domain to which the subject matter of
`a given document contained in the document data stor-
`age 1 relates, for the purpose of adding to the selected
`document the structured keywords and linkage infor-
`mation which is required for establishing correspon-
`dence between the structured keywords and relevant
`constituent parts of the selected document. A numeral
`31 denotes a user interface for actually establishing the
`correspondences between the structured keywords and
`the constituent parts of the document in accordance
`with the output information of the unit 3. Further, a
`retrieval information storage unit generally denoted by
`10 serves for storing the retrieval information and in-
`cludes a buffer storage 12 for storing the structured
`keywords (also designated by 12) as selected, a buffer 11
`for storing the link-affixed document data (11) added
`with the'information of the linkages between the con-
`stituent parts of the document and the structured key-
`words (hereinafter simply referred to as the document-
`keyword linkage information), and a buffer 13 for stor-
`ing the document-keyword linkage information (13)
`itself. A reference numeral 21 denotes an interface
`
`through which the retrieval information 10 is sent out
`onto the communication network 500.
`In the information receiver system 201 which can be
`connected to the information sender system 200, a refer-
`ence numeral 121 denotes a communication interface
`through which the retrieval
`information described
`above is received, and a numeral 110 denotes a storage
`unit for storing the retrieval information as received.
`This storage unit 110 also includes a buffer 112 for stor-
`ing the structured keywords (also designated by 112), a
`buffer 111 for storing the link-affixed document data
`(111) and a buffer 113 for storing the document-
`keyword linkage information, as in the case of the stor-
`age unit 10 incorporated in the information sender sys-
`tem 200. At least one receiver system connected to the
`sender systems can receive the retrieval information 10
`sent out onto the communication network 500 through
`the communication interface 121, w