`US 20020168986Al
`
`(19) United States
`(12) Patent Application Publication
`Lau et al.
`
`(10) Pub. No.: US 2002/0168986 Al
`Nov. 14, 2002
`(43) Pub. Date:
`
`(54) VOICE ACTIVATED WIRELESS WCATOR
`SERVICE
`
`(76)
`
`Inventors: David Lau, Dallas, TX (US); Sharon
`Miesen, McKinney, TX (US) ; Justin
`Richmond, Dallas, TX (US)
`
`Correspondence Address:
`Slater & MatsiJ, L.L.P.
`Suite 1000
`17950 Preston Rd.
`DaUas, TX 75252-5793 (US)
`
`(21) Appl. No.:
`
`09/841,849
`
`(22) Filed:
`
`Apr. 25, 2001
`
`Related U.S. Application Data
`
`(60) Provisional application No. 60/199,792, filed on Apr.
`26, 2000. Provisional application No. 60/199,775,
`filed on Apr. 26, 2000.
`
`Publication Classification
`
`Int. Cl.7 ....................................................... H04Q 7/ 20
`(51)
`(52) U.S. Cl .
`......................... 455/456; 455/414; 455/426;
`455/563
`
`ABSTRACT
`(57)
`A voice activated wireless locator service provides an inter(cid:173)
`active voice response system whereby a user is walked
`through a sequence of voice prompt<; to select a desired
`destination. Using either geo-coded information provided
`during the voice call or speech recognition of a shortcut
`pbra<;e as.5ociated in a database with a street address, and
`using a locator service database, the system can provide
`spoken instructions using a text to speech engine, providing
`the name, location, and driving directions to one or more
`destinations matching the criteria provided by the u<;er.
`Becau<;e the system uses speech recognition technology, it is
`particularly well suited for wireless access devices, such as
`cellular telephones, which are often used when a u5er is
`engaged in other activities, such as driving. Once the loca(cid:173)
`tion infom1ation and driving directions are retrieved and
`provided vocally to the user, the same information can be
`provided in a text format using a WAP data connection and
`a database access synchronized to the voice activated data(cid:173)
`base entries.
`
`17
`
`DATABASE
`
`14
`
`DATABASE
`
`15
`
`• TTS
`• RECOGNITION
`
`LAN
`
`WAP
`SERVER
`
`6
`
`,,,
`
`100
`
`12
`
`7
`
`WAf' GATEWAY
`CELL NElWORK
`
`""--...
`
`4
`
`2
`
`
`
`Patent Application Publication Nov.14, 2002 Sheet 1 of 4
`
`US 2002/0168986 Al
`
`17
`
`DATABASE
`
`LOCATOR
`SERVICE
`
`14
`
`DATABASE - - • TTS
`• RECOGNITION
`
`LAN
`
`WAP
`SERVER
`
`15
`
`100
`~
`
`12
`
`7
`
`WAP GATEWAY
`
`CELL NETWORK
`
`FIG. 1
`
`4
`
`8
`
`20
`
`22
`
`24
`
`26
`
`LINE INTERFACE
`
`CALL MANAGER
`
`SPEECH ENGINE
`
`LAN CONNECTION
`
`6
`
`TTS ENGINE
`
`28
`
`DATABASE
`
`17
`
`~ Q ee
`
`0 00
`0 00
`0 00
`0 00
`
`'
`
`2
`
`DATABASE
`
`15
`
`FIG. 2
`
`
`
`~ 1--' e-.
`N g
`rJ1
`0
`
`> I--
`
`0-.
`00
`\0
`00
`
`0 -.
`N
`~ ....
`rJ1 =(cid:173)~
`N s
`
`~
`
`~~
`I-(cid:173)
`~
`
`~ = =♦ 0 =
`! =:
`""= = ;-= ....
`
`z 0
`~·
`~ = ....
`O"' =:
`""= ::
`
`FIG. 3
`
`DRIVING DIRECTIONS AND STORE INFORMATION
`
`114--'1 SERVERS ,
`
`LOCATION
`
`l l B _.ri DIRECTIONS
`
`ENDPOINTS
`START ANO
`G£0COD:D0116 !DRIVING
`
`\VAP PAGE
`
`104
`
`INPUT
`1WAP
`
`102
`
`lOBb
`\
`
`DRIVING DIRECTIONS >
`STORE INFO ANO
`
`I
`
`<:::::::::::SP::::::EE=C=H :;:~PUT
`
`108o
`\
`
`24
`
`106
`
`20
`
`'yOICE CALL ORIGINATION
`
`112
`
`28
`SPEECH
`
`TEXT TO
`
`CLOSEST STORE
`REQUEST FOR
`
`110
`
`API QUERY
`MAP QUEST
`
`DRIVING DIRECTIONS
`
`LOCATOR
`
`STATE, LAT AND LONG
`ADDRESS, CITY,
`STORE: NAME, H,
`ON CLOSEST
`INFORt.MTION
`
`RECTIONS
`AND DRIVING
`INfORMATION
`STORE
`
`DI
`
`RECOGNITION
`
`STORE INFORMATION AND
`
`r-------1 DATABASE
`
`SPEECH
`MANAGER
`
`CALL
`
`SETUP
`CALL
`
`~-====~1------l-------!.!AN:..:..::D~S:::.:I:...::.AR'-'-T....;..P..c..OI_NT-==
`
`\
`8
`
`22
`
`Tl'PE AND SUBTYPE
`
`DATABASE
`
`17
`
`15
`
`
`
`Patent Application Publication Nov.14, 2002 Sheet 3 of 4
`US 2002/0168986 Al
`C?fROU FIG. 4b
`
`( START
`I
`
`'
`
`150,
`
`r
`l
`
`WHAT WOULD YOU LIKE TO FIND?
`SHOPPING
`
`TRANSPORTATION
`
`152
`\
`WHAT WOULD YOU
`LIKE TO BUY?
`( CLOTHES
`)
`(
`)
`Glf'T
`,----1- - - - 7
`, r--------,,
`I I
`I I
`YOU WOULD
`: :
`LIKE TO FIND
`: :
`I I A (CLOTHING/
`I 1
`1 1 GIFT) STORE.
`I h__
`1 1 IS THAT CORRECT? 1 I
`l 60
`I I
`I I
`I L - - - - - - - - J I
`I ,
`- --. , - - -. I
`1 I YES l 1 N0 l 1
`L __ --- __ _J
`
`_f., -=i--'
`
`( POINTS OF INTEREST ~
`[
`)
`
`RESTAURANT
`
`158
`I
`WHAT KIND OF
`ATTRACTION WOULD
`YOU LIKE TO VISIT?
`( MUSEUM )
`( MONUMENT )
`,----.1 __ __ 7
`, i- YOU-WOULD -11
`
`: 1 LIKE TO VISIT 1 :
`1 I A (MUSEUM/
`I 1
`I I
`I I
`NATIONAL
`168 ~ : MONUMENT).
`: :
`1 I IS THAT CORRECT? I 1
`1L--------J1
`-- _.,/ - _.,,,
`I
`,
`- --.. ,,----. I
`1JYES l 1 NO 1 1
`L--1---- __ .J
`
`156
`\
`DO YOU NEED
`WHAT KIND OF
`A METRO STATION
`RESTAURANT
`OR TRAVEL AGENT?
`WOULD YOU LIKE?
`154.../ (
`( CHINESE
`)
`)
`METRO
`( FAST FOOD )
`[ TRAVEL AGENT )
`r ----1---- 7 r----1----7
`r--------1
`r--------,
`1 I 1 1 YOU WOULD
`1
`I 1
`1
`1
`: 1 YOU WOULD LIKE 1 :
`: 1 LIKE TO FIND 1
`1 1 TO FIND A (METRO 1 1 1 1 A (CHINESE(
`1 :
`I 1 1 I FAST FOOD
`STATION/
`I 1
`I I
`__,.-1 I TRAVEL AGENT)
`I t--. i---164
`I 1 1 I
`162 1 1 IS THAT CORRECT? 1 1 1 1 RESTAURANT.
`1 1
`· I 1 1 I IS THAT CORRECT? I 1
`I I
`I L ________ J I I L ___ __ ___ J I
`I ,,- - - -. , - -
`-.
`I
`I ,,-- --. ,,-- - -. I
`1 1Y£S l1 NO '11 1 Y£Sl1 NO 11
`L ________ .J L__ _ _ ___ _ ...J
`
`_T., -- _.,
`
`FIG. 4a
`
`.... -=f.,, -- _.,
`~ TO FIG. 4b
`
`
`
`Patent Application Publication Nov.14, 2002 Sheet 4 of 4
`
`US 2002/0168986 Al
`
`170
`
`NEAR£ST (SUBTYPE, TYPE) IS (BUSINESS NAME).
`(DISTANCE) MILES AWAY. WOULD YOU LIKE TO HEAR THE ADDRESS,
`THE PHONE NUMBER, THE DRIVING DIRECTIONS, OR ARE YOU FINISHED?
`
`(
`
`ADDRESS
`
`) ( PHONE NUMBER )
`
`DRIVING DIRECTIONS
`
`FINISHED
`
`172
`
`(PLAY THE REQUESTED
`INFORMATION)
`
`FIG. 4b
`
`WOULD YOU LIKE TO FIND
`ANOTHER PLACE, OR
`ARE YOU FINISHED?
`FINISHED
`ANOTHER
`
`174
`
`EXIT
`
`TO FIG. 4a
`
`180
`
`182
`
`WELCOME TO
`THE WireNix.com
`LOCATOR SERVICE
`
`WHERE DO YOU
`WANT TO GO?
`
`~
`
`184
`
`188
`
`YOUR RESULTS
`BUSINESS NAME
`ADDRESS
`TELEPHONE #
`ORIVIN~ OIRECIIQNS
`NEW LOCATION
`
`186
`
`190
`
`• DRIVING
`DIRECTION
`NEXT
`PREVIOUS
`DONE
`
`RECOGNITION AND
`LOCATION RESULTS
`FIG. 5
`
`
`
`US 2002/0168986 Al
`
`Nov. 14, 2002
`
`1
`
`VOlCE ACTIVATED wmELESS LOCATOR
`SERVICE
`[0001] This Application claims benefit of U.S. Provisional
`Application No. 60/199,792 filed on Apr. 26, 1999. This
`Application is related to commonly assigned U.S. Provi(cid:173)
`sional Patent Application No. 60/199,775 filed Apr. 26, 1999
`and entitled Voice Activated Wireless Navigation of a Com(cid:173)
`puter Network, and to co-pending, commonly assigned U.S.
`patent application Ser. No. ___ , filed concurrently here-
`with and entitled Voice Activated Wireless Navigation of a
`Computer Network, which patent applications are hereby
`incorporated by reference.
`
`FIELD OF THE INVENTION
`[0002] The invention relates generally to wireless access
`to a locator service and more specifically to a voice activated
`wireles.s system for locating desired destinations, such as
`restaurants, shopping, and the like.
`
`BACKGROUND OF THE INVENTION
`[(H)03] Two of the most rapidly growing and developing
`areas of technology today are wireless communications and
`the Internet. Not surprisingly, these two technologies are
`experiencing a rapid convergence, much as wire-based tele(cid:173)
`phony and personal computers converged in the 1990 's and
`continue to do so today.
`[0004] One of the primary motivating factors behind the
`convergence of wireless telephony and Internet technology
`is the ubiquitous presence of the World Wide Web in all
`facets of society. E-mail, e-commerce, entertainment, busi(cid:173)
`ness-to-business commerce, and many other resources are
`commonly available as World Wide Web resources. Not
`surprisingly, consumers desire to have such resources be as
`convenient and mobile as today's hand-held devices, such as
`cellular telephones and personal digital assistants (PDA's).
`Because the Internet and World Wide Web developed based
`upon wire-based telephony and relatively powerful comput(cid:173)
`ers, several technological hurdles must be overcome before
`the World Wide Web can be accessed from a wireless device
`with sufficient ease and convenience to make the Web a truly
`wireless resource.
`[0005] One service provided by the World Wide Web that
`is rapidly growing in usage and popularity is so-called
`locator services. A locator service functions essentially as an
`electronic yellow pages. A user can identify a particular type
`of destination, such as a restaurant, type of store, or public
`park, and the locator service will identify the name and
`address of destinations in the vicinity that match the user's
`criteria. In order to use the locator service, however, the user
`must know the service's web address or URI.
`[0006] One shortcoming in a typical current wireless
`access device is the limited means for inputting data, such as
`the uniform resource identifier (URI) of a desired Web
`resource. Whereas the typical Web user uses a personal
`computer (PC) with a mouse and keyboard for inputting
`information, a wireles.s access device user generally must
`rely upon a cumbersome and tedious process of inputting a
`URI one letter at a time using the limited keypad capabilities
`of a typical cellular telephone or PDA. This is because cell
`phone and PDA's were developed to provide other func(cid:173)
`tions, and were not originally intended for the type of data
`input intensive operations Web browsing often entails.
`
`[0007] The shortcomings of wireless access devices are
`exacerbated by the fact that such devices are typically used
`when the end-user is outside of his or her home, oftentimes
`engaged in other activities such as walking or driving. Under
`those circumstances, it is most undesirable that the user be
`distracted from the primary task (such as driving) in order to
`tediously input a request for location services one letter at a
`time.
`[0008] Therefore, a need exists for a system and method
`whereby World Wide Web resources, as well as other
`resources available over the Internet or some other computer
`network, can be easily accessed using the functionality
`provided in a typical wireless access device.
`[0009] Another shortcoming in the prior art is that most
`locator services available on the World Wide Web were
`designed for stationary users, such as consumers located at
`a stationary desktop workstation. These systems fail to
`recognize that the location of a mobile user (such as a user
`on a mobile cellular telephone) can be an important factor in
`identifying or selecting from among several destinations
`(such as restaurants) that would be of interest/benefit to the
`user. Therefore, a need exists in the art for a system that not
`only allows for easy and convenient inputting of information
`to a locator service, but also one that provides a flexible
`response based at least in part on the user's current location.
`
`SUMMARY OF TI-IE INVENTION
`
`[0010]
`In one aspect the present invention provides for a
`voice activated wireless locator service comprising a wire(cid:173)
`less access device, a speech server configured to receive
`incoming voice calls from the wireless access device and
`including a call manager, a speech recognition engine, a text
`to speech engine, and a locator service accessible to the
`speech server, receiving from the speech server a type of
`destination and a location of the wireles.s acces.s device, and
`returning to the speech server location information relating
`to at least one destination matching the type of destination
`received as input. The system further includes a WAP server
`configured to receive request from the wireless access
`device and to retrieve the location information relating to the
`at least one destination matching the type of destination
`received as input, and to display said location information on
`the wireless access device.
`[0011]
`In another aspect, the invention provides for a
`method of providing voice activated locator services to a
`user using a wireless access device. The method includes
`initiating a voice connection between the wireless access
`device and a speech server, initiating a call flow in which
`voice prompts are provided to the user and the user's spoken
`responses are received, and converting the user's spoken
`responses into database query commands. The method fur(cid:173)
`ther includes forwarding the database query commands to a
`locator service, along with location information for the
`wireless acces.s device, receiving destination information
`from the locator service, converting the destination infor(cid:173)
`mation into a voice prompt and presenting same to the user
`over the voice connection, and converting the destination
`information into a graphical representation and presenting
`the same to the user over a data connection between the
`wireless access device and a WAP server.
`[0012]
`In yet another aspect, the invention provides for a
`method of providing a user of a wireless acces.s device with
`
`
`
`US 2002/0168986 Al
`
`Nov. 14, 2002
`
`2
`
`voice driven access for specifying the starting location used
`to provide subsequent locator services without reference to
`the actual physical location of the wireless device. The
`method includes initiating a voice connection between the
`wireless access device and a speech server, initiating a call
`flow in which voice prompts are provided to the user, and
`receiving as input both a spoken request for locator services
`and a spoken shortcut phrase. The method further includes
`comparing the shortcut phra<;e to entries stored in a street
`address database and determining the street address associ(cid:173)
`ated with the shortcut phrase, as well as converting the
`user's spoken responses into database query commands. The
`method further includes forwarding the database query
`commands and the starting street address to a locator ser(cid:173)
`vice, receiving destination information from the locator
`service, converting the destination information into a voice
`prompt and presenting same to the user over the voice
`connection, and converting the destination information into
`a graphical representation and presenting the same to the
`user over a data connection between the wireless access
`device and a WAP server.
`
`BRIEF DESCRIPTION OF THE DRAWINGS
`[(H)l3] FIG. 1 illustrates in block diagram format, the
`hardware components of a preferred embodiment system for
`providing voice activated wireless locator services.
`[0014] FIG. 2 illustrates in greater detail a speech server
`for use in a preferred embodiment system.
`[0015] FIG. 3 illustrates in block diagram format, the
`software 1..'0mponents of a preferred embodiment system for
`providing voice activated wireless locator services.
`[0016] FIGS. 4a and 4b illustrates steps in the operation
`of the preferred embodiment system.
`[0017] FIG. 5 illustrates schematically the operation of an
`exemplary preferred embodiment call flow process.
`
`DETAILED DESCRIPTION OF THE
`PREFERRED EMBODIMENTS
`
`[0018] A first preferred embodiment system and method
`will be described with reference to FIGS. 1 and 2, with
`FIG. I being illustrative of various hardware components of
`the system and FIG. 2 being illustrative of the software
`component<;. The system, referred to generally as 100,
`includes a wireless access device 2, which is preferably a
`Wireless Access Protocol (WAP) compatible cellular tele(cid:173)
`phone handset, such as the Motorola iDEN "plus" WAP
`phone available from Motorola Corp., Schaumburg, Ill.
`Cellular phone 2 runs a WAP compatible browser, specially
`configured for the limited memory and storage capabilities
`of a cellular phone, such as the UP Browser available from
`Open Wave Systems, Inc. of Redwood City, Calif. Alterna(cid:173)
`tively, wireless access device 2 could be a personal digital
`as.5i5tant (PDA), such as a Palm Pilot VII, available from
`Palm Computing, configured to include a WAPWeb browser
`and cellular or wireless communication capabilities. For
`clarity, wireless acces.5 device 2 may be referred to as a
`cellular phone in the following description, even though
`other embodiment devices, such as PDA's are also contem(cid:173)
`plated.
`[0019] As illustrated, wireles.5 access device 2 is prefer(cid:173)
`ably configured to transmit either "data" or "voice." In
`
`practice, both "data" and "voice" are transmitted as analog
`or digital signals using similar radio frequency modulation
`and communication schemes. The difference between data
`and voice is the protocol used in handling the received signal
`at the receiving end. " Data" communications will be
`demodulated and treated as digital information, whereas
`"voice" communications will be de-modulated, then passed
`to a digital-to-analog converter (DCA) to re-create a voice
`signal.
`
`[0020] Voice communications are transmitted over a cel(cid:173)
`lular service provider network 4 to the public switched
`telephone network (PSTN) 6 and thence to the desired
`destination (as indicated by the telephone number dialed). In
`the illustrated case, the desired destination is a speech server
`8, for which additional details will be provided below.
`
`[0021] Data communications will also be transmitted from
`wireless acces.5 device 2 through cellular service provider
`network 4 and then to a WAP gateway 7, which serves as a
`sort of translator and border crossing between the wireless
`communications network 4 and the Internet 12. WAP gate(cid:173)
`way 7 accepts incoming requests in WAP protocol and
`forwards those requests onto the Internet using TCP/IP
`protocols. Likewise, TCP/IP protocol requests originating
`on the Internet will be translated to WAP protocol by WAP
`gateway before being passed on to cellular service network
`4. Once translated to TCP/IP network protocols, the requests
`from wireless access device 2 can be transmitted over the
`Internet 12 to a specified destination, such as WAP server 10.
`
`In the preferred embodiments, WAP/Web server 10
`[0022]
`(sometimes described herein a<; a Web server 10 and some(cid:173)
`times referred herein as a WAP server 10, depending upon
`the functionality being provided at that time), provides
`standard Web server functionality, such a<; receiving incom(cid:173)
`ing requests for resources and serving up Web pages or other
`Web resources in respon5e. When the request originates
`from a WAP-compliant browser on a wireless access device,
`pages and resources designed for the limited graphics capa(cid:173)
`bilities of wireless access devices are served up. When the
`request originates from a full-featured browser, pages and
`resources designed for the full graphics capabilities of that
`browser are served up. A preferred example of such a server
`is Microsoft IIS, available from Microsoft Corp., Redmond,
`Wa<;h. The server can run OD a x86 based platform, such as
`a Dell Pentium based Server, available from Dell Computer
`Corp., Austin, Tex.
`
`[0023] WAP Server 10 may obtain additional resources
`and services from speech server 8. In the preferred embodi(cid:173)
`ments, speech server 8 is a compact PCI chassis housing
`appropriate cards and Unix based processors for line inter(cid:173)
`face 20, a call manager 22, a speech recognition engine 24,
`and a Local Area Network (LAN) 26. Connected to speech
`server 8 is a location database 15 and locator service 114.
`Location database 15 is preferably a SQL compliant rela(cid:173)
`tional database configured to handle the demands of the
`particular system to which it is attached. The database can be
`located locally to speech server 8 or may be located OD
`another computer, a<; illustrated, and connected by a LAN or
`similar network connection of sufficient speed and band(cid:173)
`width. As will be described in more detail below, location
`database 15 contains geo-coded location information for
`destinations of interest to a user. In the preferred embodi(cid:173)
`ments, destinations such as stores and restaurant<; will be
`
`
`
`US 2002/0168986 Al
`
`Nov. 14, 2002
`
`3
`
`described; the teachings apply equally to other destinations,
`however, such as residences, parks, places of interest, hos(cid:173)
`pitals, schools, public meeting places, government offices,
`and the like. Also connected to speech server 8 is a street
`address database 17.
`
`[0024) Street address database 17 is preferably a SQL
`compliant relational database configured to handle the
`demands of the particular system to which it is attached. The
`database can be located locally to speech server 8 or may be
`located on another computer, as illustrated, and connected
`by a LAN or similar network connection of sufficient speed
`and bandwidth. As will be described in more detail below,
`street address database 17 contains shortcut phrases associ(cid:173)
`ated with geo-coded locations of interest to a user.
`
`[0025) Locator service 114 is shown connected to speech
`server 8 by way of LAN 13. Alternatively, and preferably,
`locator service 114 is connected to speech server 8 by way
`of an Internet connection, using well known TCP/IP proto(cid:173)
`cols or other network protocols.
`
`[0026) Further details will now be provided regarding
`speech server 8 with reference to FIG. 2. As shown, speech
`server 8 includes a line interface 20, a call manager 22, a
`speech recognition engine 24, and a Local Area Network
`(lAN) connection 26. Speech server 8 is preferably an x86
`based workstation, such as a Pentium based personal com(cid:173)
`puter.
`
`interface 20 provides interface between
`[0027) Line
`speech server 8 and the public switched telephone network
`6. An exemplary line interface card is the D/41 available
`from Dialogic Corp., which provides four ports for incoming
`calls. In commercial embodiments, greater call handling
`capacity would be preferable.
`
`[0028) Call manager 22 operates as a manager and arbi(cid:173)
`incoming calls and outgoing
`trator of resources for
`responses, as will be described in greater detail below.
`Speech recognition engine 24 is preferably a Nuance 6.2.2
`speech recognition engine, available from Nuance Commu(cid:173)
`nications Menlo Park, Calif. Finally, LAN connection 26
`provides interface between speech server 8 and other com(cid:173)
`ponents also connected to a LAN 13 (FIG. 1), such as WAP
`server 10 and also text to speech ("TTS") 28. TfS engine 28
`is preferably a Fonix FAAST 4.0 ITS product, or Lernout &
`Hauspie, Inc. " RealSpeak" TTS product, or similar ITS
`program. In other embodiments, ITS engine 28 can run on
`the same computer and be considered as part of speech
`server 8. Preferably, however, the ITS engine runs on a
`separate computer in order to provide for quicker response
`times and to mitigate the effects of competition for computer
`resources.
`
`[0029) Operation of system 100 will be described with
`reference to FIG. 3, which illustrates the software compo(cid:173)
`nents of system 100 and their interaction. Operation begins
`with a user placing a WAP data call to WAP server 10 (FIG.
`1) using wireless access device 2. This is illustrated sche(cid:173)
`matically in FIG. 3 by arrow 102 between wireless access
`device 2 and WAP Page 104. WAP page 104 is served up by
`WAP server 10 when a data connection is first initiated by
`a user. Included on WAP page 104 is a hyper-link or similar
`resource that will cause initiation of a voice call between
`wireles.s access device 2 and speech server 8. When the link
`is selected by the user, the Web browser in wireles.s access
`
`device 2 will make system calls to the device's operating
`system to originate the voice call, as indicated by arrow 106.
`At this time, data connection 102 between wireless access
`device 2 and WAP server 10 will be suspended, but will not
`be terminated. In other embodiments, the end-user initiates
`access to speech server 8 by dialing the speech server's
`telephone number using wireless access device 2. The
`telephone number can be input manually using the device's
`numeric keypad, or may be stored in the device's memory
`and selected from a menu or list. Alternatively, the user
`might select an icon from a graphical user interface provided
`on the device 2, which icon bas associated with it the
`telephone number for speech server 8.
`
`[0030) Call setup firmware 108 represenLs firmware resid(cid:173)
`ing on the line interface card 20 of speech server 8 (FIG. 2).
`Once the call is established, call manager 22 initiates a call
`flow. A call flow is which is a sequence of voice prompL5
`(either pre-recorded or generated by ITS engine 28),
`received responses (which are recognized by speech recog(cid:173)
`nition engine 24) and calls to other resources, such as calL5
`to databases 15 and 17. Voice prompts are provided to the
`end user and answers are received as indicated by arrows
`108a and 108b.
`
`[0031] An example of the initial portion of a call flow is
`provided in FIGS. 4a and 4b. The call flow begins at step
`150 where a voice prompt is provided to the end user. This
`voice prompt can be either a pre-recorded message that is
`accessed and served up by call manager 22 or, for greater
`flexibility, the prompt could be a text message that is
`converted to speech through calls to TIS engine 28. As
`shown, the voice prompt at 150 requests information as to
`what destination the user seeks. As illustrated, four possible
`spoken responses will be considered valid when received
`and translated by speech recognition engine 24. These four
`valid responses are "shopping,""transportation,""points of
`interest" and "restaurant." Optionally, the voice prompt can
`list the four destination types that are available as valid
`responses. These are provided for example only. Numerous
`other destinations can be accommodated as well.
`
`[0032] The user responds to the prompt by answering with
`bis or her destination type. The response is pas.sect by call
`manager 22 to spee-ch recognition engine 24 where the
`response i5 converted into a text-like format and compared
`to the grammar of acceptable responses. Depending upon
`the user's response, the call flow ,,viii proceed to either step
`152, 154, 156, or 158. Although not illustrated for clarity,
`also contemplated in the preferred embodiments is a branch
`for error handling in the event the user's response is not
`recognized by speech recognition engine 24 or is recog(cid:173)
`nized, but is an invalid response. Under those circumstances,
`the call flow could loop back to step 150 for a pre-set number
`of times, or could transfer the call to a human operator for
`farther assistance. Other error handling routines, such as
`requesting responses in DTMF keypad format could also be
`employed.
`
`[0033) Once the user bas indicated the type of desired
`destination, a specific sub-type is requested in one of steps
`152, 154, 156, or 158 (depending on the type of destination
`identified in step 150). Using step 152 as an example, which
`will be reached if the user identified shopping in response to
`the prompt at s tep 150, call manager 22 then provides the
`voice prompt "what would you like to buy?"The prompt
`
`
`
`US 2002/0168986 Al
`
`Nov. 14, 2002
`
`4
`
`could optionally also list out specific sub-types that are
`available, such as "clothes,""gifts," and the like. Again, the
`user's response is received, forwarded to speech recognition
`engine 24, and compared to a list of valid responses to
`determine the desired subtype of destination. Error handling
`routines a<; described above may be employed if a valid
`response is not received.
`[0034) Optionally, the call flow may progress to a confir(cid:173)
`mation prompt, such as provided in step 160. This step
`requires TIS engine 28 in addition to pre-recorded voice
`prompt<; because this prompt will insert the type and subtype
`of destination that bas been determined in steps 150 and 152,
`respectively (or 150 and 154, 150 and 156, or 150 and 158).
`One skilled in the art will readily recognize how to insert the
`type and sub-type generated by TI'S engine 28 into the
`appropriate locations of a pre-recorded prompt, or alterna(cid:173)
`tively the entire prompt at steps 160, 162, 164, and 166 may
`be TI'S generated. If the user responds in the negative to the
`confirmation prompt, the call flow proceeds back to step 150
`to give the user another opportunity lo identify the type and
`subtype of desired destination. Alternatively, an error han(cid:173)
`dling routine can be called in response to the negative
`response.
`[0035) Referring again to FIG. 3, call manager 22 next
`sends a Common Object Module (COM) call to locator 110,
`as indicated by communication arrow 112. The COM call
`provides the desired type and subtype of destination to
`locator 110. Locator 110 is preferably a specialized program
`or function call that coordinates between the databases,
`locator services, and speech server, as described herein. The
`locator 110 queries database 15 for all entries having a
`matching type and subtype and retrieves for those entries the
`name, street address, and geo-coded location data (such as
`latifllde and longifllde information).
`In order to determine which of the matching des(cid:173)
`[0036)
`tinations is closest, locator 110 than forwards to the geo(cid:173)
`coded information for each of the destinations to a locator
`service 114, as indicated by communication arrow 116. This
`request is preferably sent across an Internet connection, as
`the locator service 114 is typically a third party vendor of
`locator services, such as MapQuest.
`[0037] Assuming the desired destination is Chinese res(cid:173)
`taurants, then the two, or three, or however many entries in
`database 15 that match the type "restaurant" and the subtype
`"Chinese" will be retrieved by locator 110, along with the
`name, addres.s and geo-coded data for those restaurants. The
`geo-coded data for each restaurant is forwarded to the
`locator service 114, as described above, in order to deter(cid:173)
`mine which restaurant is closest.
`[0038] The invention provides for the user to select how to
`indicate the s tarting location, i.e., the current actual location
`of the user to be provided to the locator service in order to
`determine which destination is closest. The call flow will
`play a prompt presenting the option of using the actual
`location of wireless access device 2 (which can be deter(cid:173)
`mined from cell network 4 using known signal triangulation
`techniques or alternatively from geo-coded information pro(cid:173)
`vided by the wireless access device itselt). Alternatively, the
`user can identify his or her location using one of several
`previously stored starting locations or Quick.PlacesTM as
`explained in detail below. The user makes the selection by
`speaking the desired shortcut phrase, or by requesting to use
`the actual location.
`
`[0039]
`If the user selected to use the actual location, the
`geo-coded information for the actual location of wireless
`access device 2 is provided lo the locator service. In some
`embodiments, the latitude and longitude or other geo-coded
`data for the wireless access device can be provided by
`incorporating a global satellite positioning system (not
`shown) in the device itself. A more preferred embodiment is
`to employ cellular site triangulation in the cellular service
`net\1/ork itself, whereby the geo-coded data for each wireless
`access device is passed on to speech server 8 (or any other
`termination point of a call originating at the wireless device)
`as part of the call and communication services provided by
`the \vireless service provider. Details regarding the triangu(cid:173)
`lation proces.s are not necessary for an understanding of the
`invention and have hence been omitted for clarity.
`
`[0040] The following paragraphs describe bow the con(cid:173)
`cept of speech shortcuts can be applied to selecting a starting
`street address using voice recognition of a speech shortcut to
`identify and select tbe desired street address. T he combina(cid:173)
`tion of a speech shortcut and a street address is known as a
`QuickPlace. TM
`
`[0041]
`Initially, the QuickPlaces must be created and
`stored for future reference. Returning to FIG. 2, street
`address database 17 is shown connected to speech server 8
`by way of LAN 13. Database 17 is preferably a SOL
`compliant relational database, as is well known in the art,
`although any appropriately configured database is sufficient.
`QuickPlaces are stored to street address database 17 in
`several ways. The simplest manner of storing QuickPlaces
`would be for a PC user to access a Web page served up by
`Web server 10, which Web page provides text fields whereby
`a user can input a street address and an associated short cut
`phrase