throbber
Declaration of Rachel J. Watters on Authentication of Publication
`
`I, Rachel J. Watters, am a librarian, and the Director of Wisconsin TechSearch
`
`("WTS"), located at 728 State Street, Madison, Wisconsin, 53706. WTS is an
`
`interlibrary loan department at the University of Wisconsin-Madison. I have worked as
`
`a librarian at the University of Wisconsin library system since 1998. I have been
`
`employed at WTS since 2002, first as a librarian and, beginning in 2011, as the Director.
`
`Through the course of my employment, I have become well informed about the
`
`operations of the University of Wisconsin library system, which follows standard library
`
`practices.
`
`This Declaration relates to the dates of receipt and availability of the following:
`
`Beckwith, R. and Miller, G.A. (1990) Implementing a lexical
`network. International Journal of Lexicography, 3(4), 302-312.
`
`Standard operating procedures for materials at the University of Wisconsin(cid:173)
`
`Madison Libraries. When an issue was received by the Library, it would be checked in,
`
`stamped with the date of receipt, added to library holdings records, and made available
`
`to readers as soon after its arrival as possible. The procedure normally took a few days
`
`or at most 2 to 3 weeks.
`
`Exhibit A to this Declaration is true and accurate copy of the journal issue cover
`
`with library date stamp of International Journal of Lexicography (1990), from the
`
`University of Wisconsin-Madison Library collection. Exhibit A also includes an
`
`excerpt of pages 302 to 312 of that issue, showing the article entitled Implementing a
`
`1
`
`Page 1 of 17
`
`GOOGLE EXHIBIT 1028
`
`

`

`Declaration of Rachel J. Watters on Authentication of Publication
`
`lexical network (1990). Based on this information, the date stamp on the journal cover
`
`indicates Implementing a lexical network (1990) was received by the University of
`
`Wisconsin-Madison Libraries on January 4, 1991.
`
`Based on the information in Exhibit A, it is clear that the issue was received by
`
`the library on or before January 4, 1991 , catalogued and available to library patrons
`
`within a few days or at most 2 to 3 weeks after January 4, 1991.
`
`I declare that all statements made herein of my own knowledge are true and that
`
`all statements made on information and belief are believed to be true; and further that
`
`these statements were made with the knowledge that willful false statements and the like
`
`so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18
`
`of the United States Code.
`
`Date: March 3, 2020
`
`Wisconsin TechSearch
`Memorial Library
`728 State Street
`Madison, Wisconsin 53706
`
`Director
`
`2
`
`Page 2 of 17
`
`

`

`ISSN 0950-3846
`
`International Journal of
`
`Lexicography
`
`Volume 3 Number 4
`Winter 1990
`
`Page 3 of 17
`
`

`

`International Journal of
`Lexicography
`
`Editor: Robert Ilson (58 Antrim Mansions, Antrim Road, London NW3 4XU, UK)
`Editorial Board
`European Association for Lexicography (EURALEX) - Executive Board:
`M Alvar Ezquerra, Ju D Apresjan, B T Atkins, H Bejoint, R R K Hartmann,
`F E Knowles, 0 Norling-Christensen, M Snell-Hornby, S Ter-Minasova, A Zampolli ,
`R F Ilson (co-opted)
`Dictionary Society of North America (DSNA)-Executive Board:
`J Algeo, W Frawley, D B Guralnik , V G McDavid , L T Milic, R J Steiner, L K
`Vandersall
`
`J Aitchison (London)
`J M Channell (Nottingham)
`COBUILD (Birmingham)
`Cordell Collection of Dictionaries
`(Terre Haute)- David E Vancil
`A P Cowie (Leeds)
`D Crystal (Bangor)
`Dictionaries Editorial Committee
`(OUP)- Lesley Burnett
`Dictionary Research Centre
`(Exeter)- R R K Hartmann
`F Dubois Charlier (Paris)
`Erlanger Zentrum fiir
`W orterbuch forsch ung (Erl a ngen(cid:173)
`N iirnberg)- F J Hausmann
`C J Fillmore (Berkeley)
`
`W Frawley (Newark, Delaware)
`Y Ikegami (Tokyo)
`Istituto di Linguistica
`Computazionale (Pisa)(cid:173)
`A Zampolli
`LADS (Liege)
`T McArthur (Cambridge)
`W Martin (Amsterdam)
`Igor Mel 'cuk (Montreal)
`MLA Discussion Group on Lexicography
`A W Read (New York)
`J Rey-Debove (Paris)
`SILEX (Lille)- P Corbin
`L Urdang (Old Lyme)
`H E Wiegand (Heidelberg)
`L Zgusta (Urbana-Champaign)
`
`Subscriptions
`Published quarterly, a t an annual subscription of £35 in the UK, US$68 in N. America, £42 elsewhere;
`single issues £10.50 in the UK, US$20 in N. America, £12.50 elsewhere. Prices include postage by
`surface mail or, for subscribers in the USA, Canada , India, Japan, Australia and New Zealand, by Air
`Speeded Post. Airmail rates are available on request.
`Contributors to the Journal are entitled to one year's subscription at half price.
`
`Order information
`Payment is required with all orders and subscriptions are accepted and entered by the volume. Payment
`may be made by cheque or Eurocheque (payable to Oxford University Press), National Girobank
`(account 500 1056), credit card (Access, Visa, American Express, Diners Club), or UNESCO coupons.
`Please send orders and requests for sample copies to the Journals Subscriptions Department , Oxford
`University Press, Pinkhill House, Southfield Road, Eynsham ox8 IJJ, UK. Telex 83147 OXPRESG.
`
`Advertising
`Advertisements are welcomed and rates will be quoted on request. Enquiries should be addressed to the
`Journals Advertising Coordinator, Journals Production Department, Oxford University Press, Pinkhill
`House, Southfield Road , Eynsham ox8 IJJ , UK.
`Authorization to photocopy items for internal or personal use, or the internal or personal use of
`specific clients, is granted by Oxford University Press for users in the USA registered with the
`Copyright Clearance Center (CCC) Transactional Reporting Service, provided that the base fee of
`$3.00 per copy is paid directly to CCC, 27 Congress Street, Salem, MA 01970. For those organizations
`that have been granted a photocopy licence by CCC a separate system of payment has been arranged.
`The fee code for users of the Transactional Reporting Service is: 0950-3846/90 $3.00.
`© 1990 Oxford University Press
`
`Page 4 of 17
`
`

`

`International Journal
`of Lexicography
`
`Volume 3 Number 4 Winter 1990
`
`SPECIAL ISSUE
`
`WordNet:
`An On-Line Lexical Database
`
`GUEST EDITOR
`GEORGE A. MILLER
`
`Page 5 of 17
`
`

`

`International Journal of
`Lexicography
`Volume 3 Number 4 Winter 1990
`
`Contents
`
`Introduction to WordNet: An On-line Lexical Database.
`George A. Miller, Richard Beckwith, Christiane Fellbaum,
`Derek Gross and Katherine J. Miller
`
`Nouns in WordNet: A Lexical Inheritance System.
`George A. Miller
`
`Adjectives in WordNet. Derek Gross and Katherine J. Miller
`
`English Verbs as a Semantic Net. Christiane Fellbaum
`
`Implementing a Lexical Network. Richard Beckwith and
`George A. Miller
`
`The EURALEX Bulletin
`
`235
`
`245
`
`265
`
`278
`
`302
`
`Oxford University Press
`
`Page 6 of 17
`
`

`

`Implementing a Lexical Network*
`Richard Beckwith and George A. Miller, Princeton University
`
`Abstract
`
`The computer implementation of the lexical database described in the preceding papers
`falls naturally into two parts. WordNet is the lexical database; LexPert is the suite of
`software tools used to build and access the database. In addition to the lexical
`information contained in the source files written by the lexicographers, WordNet also
`contains an index of the familiarity of each word.
`LexPert includes routines for verifying the syntax of the lexical source files, for
`converting the source files into a database, for adding familiarity indices to the database,
`for searching the database, for dealing with inflectional morphology, and for displaying
`information for the user. The specifications for these routines are described.
`
`Lexicographers must be concerned with the presentation as well as the content
`of their work, and this concern is heightened when presentation moves from the
`printed page to the computer monitor. Printed dictionaries have become
`relatively standardized through many years of publishing (Vizetelly, 1915);
`expectations for electronic lexicons are still up for grabs. Indeed, computer
`technology itself is evolving rapidly; an indefinite variety of ways to present
`lexical information is possible with this new technology, and the advantages
`and disadvantages of many possible alternatives are still matters for experimen(cid:173)
`tation and debate. Given this degree of uncertainty, manner of presentation
`must be a central concern for the electronic lexicographer.
`WordNet is a pioneering excursion into this new medium. Considerable
`attention has been devoted to making it useful and convenient, but the
`solutions described here are unlikely to be the final word on these matters. It is
`hoped that readers will not merely note the shortcomings of this work, but will
`also be inspired to make improvements on it.
`One's first impression of WordNet is likely to be that it is an on-line
`thesaurus. It is true that sets of synonyms are basic building blocks, and with
`nothing more than synsets the system would have all the power of a thesaurus.
`When short glosses are added to the synsets, it resembles an on-line dictionary
`that has been supplemented with synonyms for cross referencing (Calzolari,
`1988). But as readers of the preceding articles will appreciate, WordNet
`includes much more information than that. In an attempt to model the lexical
`
`* Contributions to the computer programs in LexPert have been made by Marie Bienkowski,
`George Collier, Michael Colon, David Crabb, Andrew Gomory, Derek Gross, Brian Gustafson,
`Yana Kane, Benjamin Martin, Antonio Romero, Daniel Teibel, and Benjamin Wilkes. 'Sun
`Workstation' and 'Unix' are trademarks of Sun Microsystems and AT&T Bell Laboratories,
`respectively.
`
`International Journal of Lexicography, Vol. 3 No. 4
`
`© 1990 Oxford University Press
`0950-3846/90 $3.00
`
`Page 7 of 17
`
`

`

`Implementing a Lexical Network 303
`
`knowledge of a native speaker of English, WordNet has been given detailed
`information about relations between word forms and synsets. How this
`relational structure should be presented to the user raises questions that outrun
`the experience of conventional lexicography.
`In developing this on-line database, it has been convenient (even, perhaps,
`necessary) to divide the work into two interdependent tasks which bear a vague
`similarity to the traditional tasks of writing and printing a dictionary. One task
`was to write the source files that contain the basic lexical data described in the
`preceding articles; the contents of those files are the lexical substance of
`WordNet. The second task was to create a set of computer programs that
`would accept the source files and do all the work leading ultimately to the
`generation of a display for the user; those programs, collectively, are called
`LexPert. The purpose of this article is to provide a general description of the
`functions performed by the LexPert programs.
`Both W ordNet and LexPert were developed on Sun 3 and Sun 4 worksta(cid:173)
`tions under the Unix programming environment. It is planned to develop
`versions that will run on less powerful computers, but they are not yet available
`as this is written.
`
`Source Files
`
`The source files are the files written by lexicographers. They are the product of a
`detailed relational analysis of lexical semantics: a variety of lexical and semantic
`relations are used to represent the organization of lexical knowledge. Two kinds
`of building blocks are distinguished in the source files: word forms and word
`meanings. Word forms are represented in their familiar orthography; word
`meanings are represented by synsets - lists of synonymous word forms. Two
`kinds of relations are recognized, lexical and semantic. Lexical relations hold
`between word forms; semantic relations hold between word meanings, or
`between word forms and word meanings. The contents of the source files have
`been described in some detail in the preceding articles.
`The following strings in the source files are treated as word forms: (1) an
`orthographic representation of a word, (2) an orthographic representation
`followed by an integer for disambiguation, (3) an orthographic representation
`followed by a usage note, as for postnominal adjectives, or (4) an expression in
`square brackets, used to code word:word relations. Since the information
`associated with each word form necessarily includes the synset it belongs to, the
`word forms are semantically unambiguous and it is possible to associate with
`them such additional information as any direct antonyms they may have, the
`kinds of sentence frames they can enter into, the sets of inflections they can
`take, or usage notes.
`Word meanings are represented by synsets. Strings in the source files that
`conform to the following syntactic rules are treated as synsets:
`
`(1) Each synset begins with a curly bracket, '{'
`(2) Each synset is terminated with a curly bracket, '}'
`(3) Each element within a synset consists of
`
`Page 8 of 17
`
`

`

`304 Richard Beckwith and George A. Miller
`
`(a) to code semantic relations: a word form, optionally preceded by a
`filename to indicate a word in a different file, followed by a comma,
`followed either by a space or a relational pointer and a space, or
`(b) to code lexical relations: an expression inside square brackets, '[' and ']',
`consisting of two or more elements of type (a), or
`(c) a gloss inside parentheses, '(' and ')'.
`This syntax is not inherently meaningful but adherence to it eases the
`computational burden of processing the source files. The information associ(cid:173)
`ated with each word meaning in the source files includes its semantic relations
`to other meanings. Table 1 collects those relations and their pointers in a single
`place; for elaboration, consult the preceding articles.
`In addition to the source files for nouns, verbs, and adjectives, WordNet also
`includes information about the familiarity of each word in the source files.
`
`Index of Familiarity
`One of the best known and most important psycholinguistic facts about the
`mental lexicon is that some words are much more fami liar than others. The
`familiarity of a word is known to influence a wide range of performance
`variables: speed of reading, speed of comprehension, ease of recall, probability
`of use. The effects are so ubiquitous that experimenters who hope to study
`anything else must take great pains to equate the words they use for familiarity .
`To ignore this variable in a lexical database that is supposed to reflect
`psycholinguistic principles would be unthinkable.
`In order to incorporate differences in familiarity into WordNet, a syntacti(cid:173)
`cally tagged index of familiarity is associated with each word form. This index
`does not reflect all of the consequences of differences of familiarity - some
`theorists would ask for strength indices associated with each relation - but
`accurate information on all of the consequences is not easily obtained. The
`present index is a first step.
`Frequency of use is usually assumed to be the best indicator of familiarity.
`The closed class words that play an important syntactic role are the most
`frequently used, of course, but even within the open classes of words there are
`large differences in frequency of occurrence that are assumed to correlate with -
`
`Table 1. The relations, R, and their pointers, p, in WordNet (x p-y iff
`Rp(x)= y).
`
`Nouns
`
`Synonym
`Antonym
`Hyponym
`Hypernym
`Meronym
`Holonym
`
`{ .. }
`[.!)
`
`@
`#
`%
`
`Verbs
`
`Adjectives
`
`Synonym
`Antonym
`Similar
`
`{ .. }
`(.!)
`&
`
`Synonym
`Antonym
`Troponym
`Hypernym
`Presupposition
`Inclusion
`Cause
`
`{ .. }
`[.!)
`
`@
`*
`*
`>
`
`Page 9 of 17
`
`

`

`Implementing a Lexical Network 305
`
`or to explain - the large differences in familiarity. The frequency data that are
`readily available in the technical literature, however, are inadequate for a
`database as extensive as WordNet. Thorndike and Lorge (1944) published data
`based on a count of some 5,000,000 running words of text, but they reported
`their results only for the 30,000 most frequent words. Moreover, they defined a
`'word' as any string of letters between successive spaces, so their counts for
`homographs are untrustworthy; there is no way to tell , for example, how often
`lead occurred as a noun and how often as a verb. Francis and Kucera (1982) tag
`words for their syntactic category, but they report results for only 1,014,000
`running words of text - or 50,400 word types, including many proper names -
`which is not a large enough sample to yield reliable counts for infrequently used
`words. (A comfortable rate of speaking is about 120 words/minute, so that
`1,000,000 words corresponds to 140 hours, or about two weeks of normal
`exposure to language.)
`Fortunately, an alternative indicator of familiarity is available. It has been
`known at least since Zipf ( 1945) that frequency of occurrence and polysemy are
`correlated. That is to say, on the average, the more frequently a word is used the
`more different meanings it will have in a dictionary. An intriguing finding in
`psycholinguistics (Jastrezembski, 198 I) is that polysemy seems to predict lexical
`access times as well as frequency does. Indeed, if the effect of frequency is
`controlled by choosing words of equivalent frequencies, polysemy is still a
`significant predictor of lexical decision times.
`Instead of using frequency of occurrence as an index of familiarity, therefore,
`WordNet uses polysemy. This measure can be determined from an on-line
`dictionary. If an index value of 0 is assigned to words that do not appear in the
`dictionary, and if values of I or more are assigned according to the number of
`senses the word has, then an index value can be made available for every word
`in every syntactic category. Associated with every word in WordNet, therefore,
`there is an integer that represents a count ( of the Collins English Dictionary) of
`the number of senses that word has when it is used as a noun, verb, or adjective.
`A simple example of how the familiarity index might be used is shown in
`Table 2. If, say, the superordinates of bronco are requested, WordNet can
`respond with the sequence of hypernyms shown in Table 2. Now, if all the terms
`
`Table 2. Hypernyms of bronco and their familiarity indices.
`
`Word
`
`Polysemy
`
`bronco
`@ -+horse
`@ -+equid
`@ -+odd-toed ungulate
`@ -+ herbivore
`@ -+mammal
`@ -+vertebrate
`@ -+anima l
`@ -+ organism
`
`14
`0
`0
`
`4
`2
`
`Page 10 of 17
`
`

`

`306 Richard Beckwith and George A. Miller
`
`with a familiarity index (polysemy count) of O or 1 are omitted, which are
`primarily technical terms, the hypernyms of bronco include simply: bronco
`@ -+horse@ -+animal@ -+organism. This shortened chain is much closer to
`what a layman would expect. The index of familiarity should be useful,
`therefore, when making suggestions for changes in wording. If a user requests a
`more familiar word, LexPert can search through WordNet for one with a
`higher index.
`WordNet would be a better simulation of human semantic memory if a
`familiarity index could be assigned to form-meaning pairs, rather than to word
`forms. The noun dog, for example, is used far more often with the meaning
`{dog, canine, pooch} than with the meaning {dog, detent, pawl, click}, yet both
`are presently assigned the same index, 16.
`
`Grinder
`
`In order to facilitate machine retrieval of information in WordNet, several
`stages of processing are applied to the source files. The program that does this
`processing is Grinder.
`The first step is to check that synsets in the source files are well-formed. This
`check is performed by a component routine, Verify, which is available for
`lexicographers to use prior to submitting their source files to Grinder. Verify
`also looks for two kinds of pointer errors in the noun and verb files: either a
`pointer is vacuous because it designates a non-existent synset, or it is
`ambiguous because it designates more than one synset. These errors must be
`corrected before a file can be incorporated into the database.
`Since each syntactic category has a different semantic structure, Grinder's
`next step depends on the syntactic category being processed. For nouns,
`Grinder combines the 25 source files into two: an index file and a data file. The
`index file is an alphabetical list of all the nouns in WordNet, along with the
`absolute addresses of their synsets in the data file and any information that is
`associated with undisambiguated word forms (e.g. , the index of familiarity).
`The data file contains the synsets, which in turn contain their lexical and
`semantic pointers. Grinder inserts any converse pointers that were omitted by
`the lexicographer (e.g., it inserts pointers to hyponyms), and then substitutes
`the addresses of the appropriate synsets in place of the lexicographers' semantic
`pointers. Each address bears a label that codes the kind of relation. If the
`relation is lexical, the address is for a specific word in a specific synset; if the
`relation is semantic, the address is for a synset. The synsets also contain the
`definitional gloss, if there is one in the source file.
`The source files for verbs are processed in much the same way as the noun
`files: they are combined into an index file and a data file. Associated with each
`verb form, however, is a sentence frame indicating something about the
`syntactic contexts in which the verb can be used - different verbs in the same
`synset may have different privileges of occurrence.
`Adjectives are also converted into an index and a data file, but adjectives
`demand special treatment. The two adjective source files are very different from
`each other. Pertainyms cannot be entered into the database until the addresses
`
`Page 11 of 17
`
`

`

`Implementing a Lexical Network 307
`
`of the pertinent noun or verb forms are determined. And, although the clusters
`of ascriptive adjectives are related only by antonymy and similarity, antonymy
`is a lexical relation between adjective forms, whereas similarity is a special
`relation between adjective meanings and adjective forms.
`Once the database is created by Grinder, there are two types of processed
`files: index files and processed versions of the original source files. A pair of
`these processed files exists for each syntactic category, resulting in a total of six
`such files.
`Grinder evolved progressively as work on WordNet proceeded, so the
`present account is, in effect, shooting at a moving target. Grinder was first
`written in Lisp, then C, and is now written in a combination of C and the Unix
`utility programs lex and yacc, thus enhancing the speed and portability of
`LexPert generally, and Grinder in particular.
`
`Retrieving Lexical Information
`
`In order to give a user access to information in the database, an interface is
`required. Interfaces enable end users to retrieve the lexical data and display it
`via a window-based tool or the command line. When considering the role of the
`interface, it is important to recognize the difference between a printed diction(cid:173)
`ary and a lexical database. LexPert creates its responses to a user's requests on
`the fly . Unlike an on-line version of a printed dictionary, where information is
`stored in a fixed format and displayed on demand, WordNet's information is
`stored in a format that would be meaningless to an ordinary reader. The
`interface provides a user with a variety of ways to retrieve and display lexical
`information. Different interfaces can be created to serve the purposes of
`different users, but all of them will draw on the same underlying lexical
`database.
`In the SunView interface, which is used on the Sun workstations, there is a
`window that provides a set of menus that make querying the database easy and
`intuitive. Users must initially indicate the word that they want to query -
`the
`target or seed word for the search - either by highlighting it with the mouse or
`entering it at a prompt. After the word is selected, buttons appear that indicate
`the syntactic categories WordNet has assigned to that word. The user clicks on
`one of the buttons, causing an appropriate menu to appear. This menu contains
`only choices that are relevant to the specific syntactic category that was
`indicated by the user; options that are relevant for that syntactic category but
`unavailable for that word are dimmed. Once the menu appears, the user selects
`one of the menu items, which causes the program to search for that information
`and display it in the window. The user can cancel a query by releasing the
`mouse button outside the menu area.
`The first step that the program takes in any search is to locate the word in the
`index files. If it fails to find an entry for the word, the program will search for a
`morphological variant of the word (see below). If it fails in this second attempt,
`an error message is printed in the window or (when using the command-line
`version) to the standard output (typically the terminal screen) indicating that
`no information is available for this word.
`
`Page 12 of 17
`
`

`

`308 Richard Beckwith and George A. Miller
`
`The search process is the same regardless of the type of search requested. The
`first step is to retrieve the index entry located in the appropriate index file - this
`will contain a list of addresses of the synsets in the data file in which the word
`appears. Then each of these synsets in the data file is searched for the requested
`information, which is then retrieved and formatted for output. Search is
`complicated by the fact that each synset containing the seed word also contains
`pointers to other synsets in the data file that may need to be retrieved and
`displayed, depending on the search type. For example, each synset in the
`hypernymic pathway points to the next synset in the hierarchy. If a user
`requests a recursive search on hypernyms (see Table 2), a hierarchical retrieval
`process is repeated until a synset is encountered that contains no further
`hypernymic pointers.
`All searches, regardless of the information they are searching for , are
`accomplished by a small set of flexible functions that were written in C + + .
`Table 3 contains a brief description of the different functions, which are used in
`various combinations to accomplish all of the searches available in WordNet.
`(Converse search primitives are available for presupposition, troponymy,
`cause, and inclusion, just as for hypernymy/hyponymy and meronymy/holo(cid:173)
`nymy, but appropriate names for those semantic relations have not yet been
`selected.) In order to maximize their flexibility, each function takes a small
`number of arguments that control its behavior. Thus, the same function may be
`used in recursive and non-recursive searches and in the retrieval of many
`different types of semantic information.
`The search code is a program built out of the search primitives in Table 3.
`Because LexPert uses a library of search primitives, it is possible to add new
`types of searches quite easily. The program just described has been developed in
`order to construct and test WordNet and contains a special set of searches that
`have been found useful for that purpose. That particular program does not
`exhaust the search capacity of the system; other workers might design different
`interfaces using a different search code. The present description will be limited
`to a few particular examples, simply to illustrate how search primitives can be
`combined to perform different searches.
`The simplest search is the search for synonyms, which merely calls the
`synonymy search primitive. The search is restricted to a single syntactic
`category and returns each of the synsets that the seed word is in. For example, if
`the user searches for synonyms of the noun dog, the search program returns two
`synsets: {dog, click, detent, pawl} and {pooch, dog, canine}. If the user searches
`for synonyms of the verb dog, the program returns one synset: {track, go after,
`dog, tag, trail, chase}.
`The antonym search involves more than the antonymy search primitive, in
`that it returns more than direct antonyms. Antonymy is a lexical relation, so the
`antonymy search primitive returns only words that are direct antonyms.
`LexPert's search code for antonyms reports both direct and indirect antonyms,
`so it must combine the antonymy search primitive with the synonymy search
`primitive or, in the case of adjectives, with the search primitive for similars. For
`example, if the user asks for antonyms of the noun weakness, LexPert returns
`strength. If the user asks for the antonyms of the verb weaken, LexPert returns
`
`Page 13 of 17
`
`

`

`'D
`0
`<.,.)
`
`0 .... ;,;-
`?
`~ z 0
`)< ;s·
`r-0
`
`5·
`;?.
`0 a 0
`3
`
`'O
`
`Pl
`(IQ
`
`name of source file
`definition with synsets
`sense count of seed
`returns true if seed is adjective cluster head
`restrictions on relative location of head noun
`similars (near synonyms) of seed's cluster head
`frames that are permissible with seed
`
`verbs that seed properly temporally includes
`verbs that seed causes
`verbs that are a manner of seed
`verbs presupposed by seed
`things that seed is part of
`parts of seed
`children of seed
`parents of seed
`
`N V A
`N V A
`N V A
`A
`A
`A
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`-
`
`V
`
`V
`V
`V
`V
`
`-
`
`-
`
`-
`
`-
`
`-
`
`N
`N
`N V
`N V
`
`-
`
`-
`
`source
`gloss
`familiarity
`head
`usage
`similar
`frame
`
`inclusion
`cause
`troponymy
`presupposition
`holonymy
`meronymy
`hyponymy
`hypernymy
`
`information
`Other
`
`relations
`Synset
`
`antonyms of seed
`synonyms of seed
`
`N V A
`N V A
`
`antonymy
`synonymy
`
`relations
`Word
`
`Information returned
`
`Category
`
`Function
`
`Table 3. Summary of primitive search commands.
`
`Page 14 of 17
`
`

`

`310 Richard Beckwith and George A. Miller
`
`strengthen, but then, because weaken is a synonym of de-escalate, the program
`also returns escalate. If the user asks for antonyms of the adjective weak,
`LexPert returns not only strong, but 15 indirect antonyms.
`Meronymy is coded only in the noun file, so part-oriented searches work only
`with nouns. The meronymy search primitive returns the synsets that are coded
`as parts in the synset containing the seed word. But meronymy is iQherited. In
`order to give a user all of the parts of the seed, not only the parts codea with the
`seed but all parts coded with hypernyms of the seed must be listed. LexPert's
`search code for this request must combine the meronymy search primitive with a
`recursive search for hypernyms. Moreover, since parts can have parts, the
`search must also look for meronyms of meronyms.
`Three options to the search code that can be invoked for words in any
`syntactic category are gloss, fami liarity, and source file. These options return
`the lexicographers' definitions of synsets, an index of the relative polysemy of
`the seed word, and the names of the WordNet raw source files that contain the
`seed word. The gloss and familiarity search primitives are meant to be used in
`combination with other searches. If a user asks for definitions, LexPert will
`return a definition with each synset that has one. If a user asks for familiarity,
`LexPert will return a sense count with every word that has one. The source file
`option can be used alone; it will report in which raw file the lexicographer
`included the seed word.
`
`Morphy
`
`Dictionaries ordinarily hang their information on head words, without separate
`listings for inflectional (or many derivational) forms of the word. In a printed
`dictionary, that practice causes little trouble; with a few highly irregular
`exceptions, morphologically related words are generally similar enough in
`spelling to the reference form that the eye, aided by boldface type, quickly picks
`them up. In an electronic dictionary, on the other hand, when an inflected form
`is requested, the response is likely to be a frustrating announcement that the
`word is not in the database. Users are required to know the reference form of
`every word they want to look up. In order to spare users the trouble of affix
`stripping, therefore, LexPert includes Morphy, a large program that gives
`WordNet some intelligence about English morphology.
`Morphy is designed as a stand-alone front end to the WordNet database. It
`includes inflectional morphological variants of English words, thus allowing the
`user (and automated software such as that described below) to find the root
`morphemes of inflectional variants as well as particular morphological variants
`of a root. For example, singular nouns are asso.ciated with their plurals;
`verb infinitives have pointers to (a) past and simple present tense, (b) the
`gerundive/progressive form, (c) past participles, and (d) third per

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket