`Wesemann et al.
`
`(10) Patent N0.:
`(45) Date of Patent:
`
`US 6,731,724 B2
`May 4, 2004
`
`US006731724B2
`
`1/1999 Kalveram ................. .. 704/233
`5,864,804 A *
`5/1999 Tatchell etal.
`..
`. 379/88.01
`5,905,774 A *
`9/1999 Culbreth etal.
`. 379/88.25
`5,953,393 A *
`;;(tfi16116ta1~ ............. 379/197
`2 *
`3/2000 Furukawa 8181.
`....... 379/88.01
`6,044,133 A *
`7/2000 Trandal et 81
`379/189
`6 088 428 A *
`7/2000 Hunt et 81.
`....::::::....:..379/88.04
`6:094:476 A *
`11/2000 Surace et 81.
`6,144,938 A
`6,167,251 A * 12/2000 Segal 61; 81.
`............... .. 455/405
`1/2001 Wu ......................... .. 379/67.1
`6,173,042 B1 *
`
`6,195,417 B1 *
`2/2001 Dans . . . . .
`. . . .. 379/352
`.................. .. 379/88.02
`6,198,947 B1 *
`3/2001 Barber
`6,349,132 B1 *
`2/2002 Wesemann et al.
`.... .. 379/88.01
`T
`L1lSTlg£il'TLCI1 61261.
`. . . . . . . ..
`6,501,832 B1 * 12/2002 Saylor et al.
`.......... .. 379/88.04
`2002/0001370 A1 *
`1/2002 Walker et al.
`......... .. 379/88.04
`
`
`
`OTHER PUBLICATIONS
`
`ISBN
`H. Newton, Newton’s Telecom Dictionary,
`1—57820—023—7, Mar. 1998, Flatiron Publishing, 14”“ ed.,
`pp. 72 and 73*
`
`(54) VOICE-ENABLED USER INTERFACE FOR
`VOICEMAIL SYSTEMS
`
`(75)
`
`Inventors: Darren L. Wesemann, North Salt
`Elke’ U.T (US)’ Makanl M‘%S°“> Salt
`ake Clty’ UT (USA J“ Wlflesen’
`W"5‘Va11‘°~Y Ci‘Y> UT (US); T3°'D°°k
`KW90I1> Sa1tLa1<€ C1tY>UT(US)é
`Dong-Kyun Nam, Murray, UT (US)
`
`(73) Assignee: Pumatech, Inc., San Jose, CA (US)
`
`( * ) Notice:
`
`Subject to any disclaimer, the term of this
`patent is extended or adjusted under
`USC. 154(b) by 43 days.
`
`(21) Appl. No.: 09/887,215
`
`(22) Filed;
`
`Jun_ 22, 2001
`
`(65)
`
`Prior Publication Data
`
`(60)
`
`Related US Application Data
`Provisional application No. 60/263,469, filed on Jan. 22,
`2001'
`Int. Cl.7 ................................................. H04M 1/64
`(51)
`(52) U.S. Cl.
`.............................. .. 379/88.16; 379/88.01;
`379/88.18; 704/231; 704/270.1; 455/563
`(58) Field of Search ........................... 379/88.01, 88.03,
`379/8804-8809’ 8813-8825’ 9101’ 9326-9328’
`21801’ 455/79’ 563’ 704/2701’ 277’ 231’
`246’ 270
`
`56
`
`(
`
`)
`
`C-t d
`R f
`1 e
`e erences
`U.S. PATENT DOCUMENTS
`5,208,848 A *
`5/1993 Pula ......................... .. 379/196
`5,353,336 A * 10/1994 Hou et 81.
`................ .. 370/354
`5,471,521 A * 11/1995 Minakami et 81.
`..... .. 379/88.01
`5,659,597 A 4
`8/1997 Bards et a1.
`.......... N 379/88.01
`5,737,393 A
`4/1993 w01f
`5,822,405 A * 10/1998 Astarabadi ................ .. 379/352
`5,864,605 A
`1/1999 Keshav
`
`Primary Examiner—Roland G. Foster
`(74) Atmr'16)’; Agent; 0" F1''m—W0rkman Nydegger
`(57)
`ABSTRACT
`b1.
`th d f
`bl d
`A .
`.
`t
`f
`d
`or ena mg a
`e .u.Ser In e,r ace ME me, 0
`V01Ce'e,na
`1flSCI' that ishproviding audio andtylotcal inpult to access datta
`drgftoengpmffi§fe”:;::Y?,?]3“;M1§,,)a;fgggjjizfjzjesz
`the voice-enabled 1i1ser inferface with a tegle hone device and
`provides input that is translated into a DTIVTJF translation that
`can be recognized by DTMF telephone service system. A
`template of the voice-enabled user interface maps menu
`states, prompts, and acceptable inputs of the DTMF tele-
`phone service system. The template also monitors the cur-
`Fem Sf.“ ffthe tf1’11)‘:’p1“.’“e,Serl"1".e Systfem at an “mes”
`“M” “E1 “ 5° °““ °” “ ”;°‘h° J”‘1“Ph‘°m 0”“ .m°“” 5 “ °
`‘O. 3”“ er men“ State .0
`t C “’ GP One Sewlce System
`Without having to enter input for each menu state between
`the first and the second menu states.
`
`28 Claims, 5 Drawing Sheets
`
`
`
`/100
`110
`
`VOICE-ENABLED
`
`USER INTERFACE
`160
`
`
`TELEPHONY
`NETWORK
`
`150
`
`VOICEMAIL
`DATA
`
`
`
`
`DATABASE
`
`
`
`IPRZO17-01039
`
`Unified EX1004 Page 1
`
`IPR2017-01039
`Unified EX1004 Page 1
`
`
`
`U.S. Patent
`
`May 4, 2004
`
`Sheet 1 of 5
`
`US 6,731,724 B2
`
`cm?
`
`=<sm2o>
`
`<4<a
`
`Qm4m<zwmoa>
`
`mo<_mmHz_¢mm=
`
`o©_
`
`°r_
`
`=<Emos>
`
`mH<Hm
`
`mme<z<s
`
`m_=aos
`
`xwm=
`
`mH<Hm
`
`mw¢<z<E
`
`w4=Qo=
`
`4.3
`
`mw<m<H<Q
`
`IPRZO17-01039
`
`Unified EX1004 Page 2
`
`IPR2017-01039
`Unified EX1004 Page 2
`
`
`
`
`
`
`
`U.S. Patent
`
`May 4, 2004
`
`Sheet 2 of 5
`
`US 6,731,724 B2
`
`MANAGER MODULE
`M
`
`330
`
`VOICEMAIL
`AUTHENTICATION
`
`KEEP
`
`ALIVE VOICEMAIL STATE
`
`MANAGE1R OMODULE
`L
`
`350
`
`IPRZO17-01039
`
`Unified EX1004 Page 3
`
`IPR2017-01039
`Unified EX1004 Page 3
`
`
`
`U.S. Patent
`
`May 4, 2004
`
`Sheet 3 of 5
`
`US 6,731,724 B2
`
`VMCE-ENABLED
`USERINTERFACE
`
`TELEPHONE SERWCE
`SYSTEM
`
`420
`
`PRoNPT USER
`FOR ACCOUNT
`
`INFORMATION
`
`AUTHENTICATE
`
`ACCOUNT
`
`INFORMATION
`
`
`
`ACCESS
`TELEPHONE
`
`SERVICE SYSTEM
`
`470
`
`IS ACCOUNT
`INFORMATION STORED
`
`
`
`RETRIEVE
`TRANSFER
`PROMPT TO
`ACCOUNT
`
`USER
`
`
`
`INFORMATION I
`INFORMATION
`SIGNAL? I TRANSMIT
`
`PROMPT FOR
`ACCOUNT
`
`I INFORMATION
`
`460
`
`AUTHENTICATE
`USER ACCOUNT
`INFORMATION
`
`492
`
`PRownE
`MENU
`PROMPT
`
`WAIT FOR
`USER|NPUT
`
`494
`
`496
`
`
`
`
`
`
`
`490
`
`FIG. 4
`
`DTMF ACCOUNT
`INFORMATION
`To TELEPHONE
`SERVICE SYSTEM
`
`
`
`IPRZO17-01039
`
`Unified EX1004 Page 4
`
`USER
`
`ACCESS
`vmcEENA3LEo
`USER INTERFACE
`
`ENTER
`
`.N?8°°R‘.%N
`RM
`
`410
`
`43°
`
`473
`
`ENTER
`ACCOUNT
`
`479
`
`AITNNE
`
`IPR2017-01039
`Unified EX1004 Page 4
`
`
`
`U.S. Patent
`
`May 4, 2004
`
`Sheet 4 of 5
`
`US 6,731,724 B2
`
`
`
`3;:§>§W235:$§Ez_$3a§z§_g$2.
`
`...E;E53E$.§_._5%m_
`
`5%as2:N.322222:5:2as
`:53523EE2;5%2
`
`E:as:
`
`E22;3
`
`
`
`:_§~_g_>s_._
`
`5%:_._z_
`
`5.28$5,.
`
`WES$253
`
`W25V85:
`
`32:;
`
`mE
`
`533,.
`
`E2=_,_mzs:5:220.325::5E;E32%2E2".2E5325:SEE552$53E22:
`s_>§gm5;5:2E32,:201%:E:222259E3E2.
`s_§§E2.2%6N252$52%E5:
`
`21%w._%¢=o
`
`E55s_>§
`
`§
`
`IPRZO17-01039
`
`Unified EX1004 Page 5
`
`IPR2017-01039
`Unified EX1004 Page 5
`
`
`
`
`
`
`U.S. Patent
`
`May 4, 2004
`
`Sheet 5 of 5
`
`US 6,731,724 B2
`
`35
`
`§\‘
`
`So
`
`22.222
`
`23
`
`E5
`
`E2;
`
`3%
`
`8:;
`
`myE
`
`§z_§
`
`3::
`
`$.23
`
`OS
`
`So
`
`IPRZO17-01039
`
`Unified EX1004 Page 6
`
`IPR2017-01039
`Unified EX1004 Page 6
`
`
`
`
`
`
`US 6,731,724 B2
`
`1
`VOICE-ENABLED USER INTERFACE FOR
`VOICEMAIL SYSTEMS
`
`RELATED APPLICATIONS
`
`This application claims the benefit of U.S. Provisional
`Patent Application Ser. No. 60/263,469, entitled “Universal
`Voicemail Gateway (UVG),” filed Jan. 22, 2001, which is
`incorporated herein by reference.
`BACKGROUND OF THE INVENTION
`
`1. The Field of the Invention
`
`invention relates to a voice-enabled user
`The present
`interface for accessing a telephone service system that
`responds to dial tones. More particularly, the present inven-
`tion relates to methods and systems for enabling a user, with
`voice commands, to access and navigate a telephone service
`system that only recognizes dual tone multi-frequency sig-
`nals.
`2. The Prior State of the Art
`
`Through the years, it has become commonplace to use a
`telephone, not only to communicate, but also to perform
`electronic business and financial transactions, and to obtain
`prerecorded information.
`It
`is possible to perform these
`activities over a telephone, in part, because of the develop-
`ment of corresponding telephone service systems, which are
`used in virtually every industry and business environment.
`Voicemail is perhaps the most common type of telephone
`service system. Voicemail,
`like an answering machine,
`enables an incoming telephone message to be recorded for
`review by an intended recipient at a later time. However,
`unlike an answering machine, voicemail can record an
`incoming message even when a corresponding telephone is
`“off the hook.” Accordingly, voicemail is particularly useful
`when a person cannot answer an incoming telephone call
`because that person is already engaged in another telephone
`call.
`
`The prior art also includes various other types of tele-
`phone service systems that facilitate business and financial
`transactions and enable users to obtain prerecorded infor-
`mation. Essentially, a telephone service system includes any
`automated system that provides a user with data over a
`telephone device. Virtually every industry uses a telephone
`service system of one type or another, even if the telephone
`service system consists of only a customized telephone
`directory of available business services or personnel.
`Some of the more useful telephone service systems enable
`users using a telephone device to interact with a business
`network to transact various activities, such as for performing
`home banking, purchasing consumer products, receiving
`customer support, and accessing news, entertainment,
`financial, and travel information. Some telephone service
`systems also enable a user to access information that is
`generated on-the-fly, such as when a telephone service
`system generates text-to-speech translations of dynamic
`information.
`
`5
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`Telephone service systems are particularly useful from a
`user’s perspective because they enable a user to access
`information and to make business and financial transactions
`
`60
`
`when and where it is most convenient for the user. Telephone
`service systems are useful from a business’s perspective
`because they enable businesses to hire fewer employees to
`answer incoming calls and to perform other tasks that can be
`automated by telephone service systems.
`One problem with certain telephone service systems,
`however,
`is that
`they only recognize dual
`tone multi-
`
`65
`
`2
`
`frequency signals (“DTMF”) signals. A DTMF telephone
`service system is generally problematic because a user can
`only use the system by manually pressing the keys on a
`DTMF telephone keypad. This is a problem because it is
`easy for a user to inadvertently press an incorrect key,
`sometimes without realizing it. Pressing an incorrect key
`may result in an undesirable consequence. For example, by
`pressing an incorrect key a user may cause the telephone
`service system to transition from one menu state to an
`undesired menu state. This may also require a user to return
`to the prior menu state before a desired selection can be
`made. This is not only inconvenient, it is a waste of time. By
`pressing incorrect keys, a user may also cause a telephone
`service system to perform an act that is not desired. For
`example, by pressing an incorrect key, a telephone service
`system may terminate a phone call, call a wrong extension
`number, or in the case of home banking, transfer an incorrect
`amount of money between accounts or to a wrong account.
`It should be appreciated that these examples are given by
`way of illustration and not limitation.
`To overcome the problems that are associated with inad-
`vertently pressing a wrong key, many telephone service
`systems provide prompts that request a user to confirm
`certain entries before they are processed. This, however,
`does not resolve the underlying problem associated with
`DTMF telephone service systems, namely, requiring a user
`to enter input by pressing keys on a telephone keypad.
`Another problem associated with DTMF telephone ser-
`vice systems is that a user is required to move a handheld
`telephone back and forth between a position that enables the
`user to listen to the voicemail system and to a position that
`allows the user to press keys on the telephone keypad.
`Having to do this is inconvenient. It can also be dangerous,
`particularly for a person who is driving a car because it may
`require that person to take his eyes off the road and his hands
`off the steering wheel.
`To overcome the problems associated with DTMF tele-
`phone service systems, the industry has developed telephone
`service systems that recognize speech so that a user can
`speak commands and other input without having to press
`keys on a telephone keypad. There are various methods for
`performing speech recognition, which are well known in the
`art. Some examples include spectral analysis, dynamic time
`warping, neural networks, and recognition by discrete and
`continuous hidden Markov modeling.
`Although speech recognition telephone service systems
`provide a significant advantage over DTMF telephone ser-
`vice systems,
`they do not resolve all of the associated
`problems with the prior art telephone service systems. In
`particular, a user must still expend the time that is necessary
`to move systematically through a hierarchy of levels or
`menu states that are associated with the menu structure of a
`
`telephone service system, even when a user already knows
`what the final menu state will be. For example, from a main
`menu a user may be required to transition through several
`“in between” menu states before finally reaching a final
`menu state that contains the information that is desired by
`the user. Many telephone service systems also require a user
`to return to the main menu before navigating to a second
`desired menu state. Navigation through a menu hierarchy is
`particularly time consuming when the hierarchy is complex,
`or has many menu states. This type of navigation through a
`telephone service system can also be expensive, particularly
`when access to the telephone service system requires a user
`to incur costs associated with long-distance and/or a wireless
`telephone service plan.
`Another problem with speech recognition telephone ser-
`vice systems is that because of variations in speech it is
`
`IPRZO17-01039
`
`Unified EX1004 Page 7
`
`IPR2017-01039
`Unified EX1004 Page 7
`
`
`
`US 6,731,724 B2
`
`3
`impossible to guarantee that speech recognition software
`will recognize user input. It is also expensive for businesses
`to modify existing DTMF telephone service systems to
`recognize and process audio and vocal data with speech
`recognition software.
`Accordingly, it would be beneficial to provide a method
`for enabling a user to use an existing DTMF telephone
`service system by speaking into a telephone device without
`requiring modification to the DTMF telephone service sys-
`tem. It would also be beneficial to accomplish this while
`enabling a user to jump to any level or state in a menu
`hierarchy of a telephone service system without requiring
`the user to systematically navigate through numerous “in
`between” menu states.
`
`SUMMARY OF THE INVENTION
`
`The present invention is directed towards a voice-enabled
`user interface for enabling a user to speak over a telephone
`device to access data from a telephone service system that is
`only responsive to dual tone multi-frequency (“DTMF”)
`signals.
`In one embodiment, a user accesses the voice-enabled
`user interface with a telephone device and enters authenti-
`cation information. The voice-enabled user interface verifies
`
`the identity of the user and accesses a telephone service
`system that has information that is desired by the user, but
`which is only responsive to DTMF signals. The user
`responds to prompts generated by the telephone service
`system by speaking into the telephone device. The voice-
`enabled user interface translates the user’s input into a
`DTMF translation that is recognized by the telephone ser-
`vice system. The voice-enabled user interface maps the
`hierarchy of menu states and corresponding prompts of the
`telephone service system within a template. During use, the
`voice-enabled user interface uses the template to determine
`what menu state the telephone service system is in at all
`times. Auser can jump from one menu state to another menu
`state by providing input that the voice-enabled user interface
`associates with a corresponding menu state. The voice-
`enabled user interface generates output
`that causes the
`telephone service system to transition to the menu state that
`corresponds with the user input. Once the telephone service
`system is in the appropriate menu state, the voice-enabled
`user interface transmits a DTMF translation of the user input
`to the telephone service system so that it can be processed.
`One advantage of the present invention is that it enables
`users using voice commands to access data from existing
`DTMF telephone service systems without requiring any
`modification to the existing systems. The present invention
`also enables users to navigate through a hierarchy of menu
`states of a telephone service system without requiring the
`users to enter input to transition through “in between” menu
`states. Instead, the invention enables a user to jump over “in
`between” menu states, from a first menu state to a second
`menu state with only a single user input.
`These and other objects, features, and advantages of the
`present invention will become more fully apparent from the
`following description and appended claims, or may be
`learned by practicing the invention as set forth below.
`BRIEF DESCRIPTION OF THE DRAWINGS
`
`A more extensive description of the present invention,
`including the above-recited features and advantages will be
`rendered with reference to the specific embodiments that are
`illustrated in the appended drawings. Because these draw-
`ings depict only exemplary embodiments,
`the drawings
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`4
`should not be construed as imposing any limitation on the
`present invention’s scope. As such, the present invention
`will be described and explained with additional specificity
`and detail through use of the accompanying drawings in
`which:
`
`FIG. 1 is a block diagram showing one suitable network
`environment for practicing the invention that includes a
`telephone device, a voice-enabled user interface, and a
`telephone service system;
`FIG. 2 is a block diagram showing increased detail of the
`components that make up the user state manager module of
`the voice-enabled user interface of the present invention;
`FIG. 3 is a block diagram showing increased detail of the
`components that make up the voicemail state manager
`module of the voice-enabled user interface of the present
`invention;
`FIG. 4 is a flow chart illustrating an embodiment for
`enabling a user to access a telephone service system with the
`voice-enabled user interface of the invention;
`FIG. 5 is a flow chart illustrating an embodiment of the
`invention for enabling a user to access data from a telephone
`service system with the voice-enabled user interface of the
`invention;
`FIG. 6 is a block diagram illustrating one embodiment of
`a hierarchal menu structure of a telephone service system
`that can be accessed and navigated using the voice-enabled
`user interface of the invention.
`
`DETAILED DESCRIPTION OF THE
`PREFERRED EMBODIMENTS
`
`The present invention is directed to a voice-enabled user
`interface and methods for enabling users to access and use
`an existing telephone service system that is only responsive
`to dial tone signals. More particularly, the present invention
`enables users to speak into a telephone device to access data
`from an existing telephone service system that
`is only
`responsive to dual tone multi-frequency (“DTMF”) signals.
`The term telephone service system, as used herein, should
`generally be construed to include any telephone service that
`is responsive to user input entered over a telephone device
`and is capable of providing data to a user. One type of
`telephone service system that can be accessed by using the
`present invention includes a voicemail system.
`Users can perform various functions with a voicemail
`system. These functions include, but are not
`limited to,
`listening to messages that are left by a third party, recording
`messages or greetings for a third party to listen to, forward-
`ing messages, deleting messages, and configuring voicemail
`answering and recording procedures. Users perform func-
`tions with a voicemail system by entering input
`into a
`telephone device that can be recognized and processed by
`the voicemail system. DTMF voicemail systems are one
`type of voicemail systems that are only responsive to DTMF
`signals. DTMF voicemail systems cannot recognize audio or
`vocal input.
`In one embodiment of the invention, a voice-enabled user
`interface enables users to perform functions with a DTMF
`voicemail system by only speaking into a telephone device.
`It should be appreciated, however,
`that although certain
`embodiments of the present
`invention are described as
`applying to DTMF voicemail systems, the invention is not
`limited to any particular type of telephone service system.
`Similarly, a user can access and use the voice-enabled user
`interface of the invention with any type of telephone device,
`including, but not limited to wire bound telephones, wireless
`
`IPRZO17-01039
`
`Unified EX1004 Page 8
`
`IPR2017-01039
`Unified EX1004 Page 8
`
`
`
`US 6,731,724 B2
`
`5
`telephones, and
`telephones, cellular telephones, digital
`PDA’s and computers having telephony capabilities.
`Embodiments of the invention, as described herein, may
`comprise a special purpose or general-purpose computer
`comprising various computer hardware. Embodiments may
`also include computer-readable media having computer-
`executable instructions or data structures stored thereon.
`
`Such computer-readable media can be any available media
`that can be accessed by a general-purpose or special-purpose
`computer. By way of example, and not limitation, such
`computer-readable media can comprise RAM, ROM,
`EEPROM, CD-ROM or other optical disk storage, magnetic
`disk storage or other magnetic storage devices, or any other
`medium which can be used to store the desired executable
`
`instructions or data structures and which can be accessed by
`a general-purpose or special-purpose computer.
`When information is transferred or provided over a net-
`work or other communications connection to a computer, the
`computer properly views the connection as a computer-
`readable medium. Thus, such a connection is also properly
`termed a computer-readable medium. Combinations of the
`above should also be included within the scope of computer-
`readable media. Computer-executable instructions
`comprise, for example, instructions and data which cause a
`general-purpose computer, special-purpose computer, or
`special-purpose processing device to perform a certain func-
`tion or group of functions. The computer-executable instruc-
`tions and associated data structures represent an example of
`program code means for executing the steps of the invention
`disclosed herein.
`
`The invention will be described in the general context of
`computer-executable instructions, such as program modules,
`being executed by a computer. Generally, program modules
`include routines, programs, objects, components, data
`structures, or the like that perform particular tasks or imple-
`ment particular abstract data types. Moreover, those skilled
`in the art will appreciate that the invention may be practiced
`with other computer system configurations, including hand-
`held devices, multi-processor systems, microprocessor-
`based or programmable consumer electronics, network PCs,
`minicomputers, mainframe computers, and the like. The
`invention may also be practiced in distributed computing
`environments where tasks are performed by remote process-
`ing devices that are linked through a communications net-
`work. In a distributed computing environment, program
`modules may be located in both local and remote memory
`storage devices.
`Turning now to FIG. 1, one suitable environment 100 for
`practicing the method of the invention for enabling a user to
`access data from a telephone service system is illustrated. As
`shown, a voice-enabled user interface 110 is in communi-
`cation with a telephone device 120 and a telephone service
`system through telephony networks 130. The invention may
`be practiced in various telephony network environments,
`including, but not limited to Public Switched Telephone
`Networks (PSTN),
`Integrated Services Digital Network
`(ISDN) and Fiber Distributed Digital Interface (FDDI). In
`the present embodiment, the telephone service system com-
`prises voicemail system 140 which can be accessed by a user
`using telephone device 120.
`Voicemail system 140 comprises voicemail data 150 that
`can include any type of data that can be transmitted to a user
`over a telephone device.
`In one preferred embodiment,
`voicemail data 150 comprises a prerecorded message, such
`as, for example, a menu prompt, a telephone phone message,
`a greeting, etc. Voicemail data 150 may also include data
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`6
`that is dynamic or that is accessed from another network. For
`example, in one embodiment, voicemail system 140 com-
`prises a telephone service system of a financial institution
`and voicemail data 150 includes dynamic stock quote data
`that is obtained by the voicemail system 140 from another
`network. Voicemail data 150 may also include data that is
`generated on-the-fly. For example, stock quotes can be
`provided to voicemail system 140 in the form of text, such
`that during use voicemail system 140 generates a text-to-
`speech translation of the stock quotes on-the-fly.
`According to the present embodiment, voicemail system
`140 is responsive only to DTMF signals that are generated
`by pressing the keys on a keypad of a DTMF telephone
`device, such as telephone device 120. DTMF telephone
`signals are well known in the art of telephone communica-
`tions. According to this embodiment, voicemail system 140
`cannot recognize audio or vocal input that is generated by a
`user speaking into telephone device 120. However, voice-
`enabled user interface 110 of the invention, enables a user to
`access voicemail data 150 from the voicemail system 140 by
`speaking into a telephone device 120, as will be described
`herein.
`Voice-enabled user interface 110 contains various com-
`
`ponents that enable a user to access voicemail data 150 from
`voicemail system 140. These components are generally
`described in reference to a user state manager module 160,
`a voicemail state manager module 170, and a database 180.
`FIG. 2 shows some of the basic components that make up
`user state manager module 160. User authentication 210
`verifies the identity of a user accessing the voice-enabled
`user interface 110 by comparing the user account informa-
`tion to account information 220 that is stored in database
`
`180. The act of verifying the identity of the user may require
`the user to enter a password, personal identification number,
`an account number, or any other information that can be
`verified by user authentication 210.
`According to one embodiment, and upon authenticating
`the identity of the user, user authentication 210 connects the
`voice-enabled user interface to a telephone service system
`that contains data that
`is desired by the user, such as
`voicemail system 140. According to this embodiment, user
`account information 220 includes information that is used to
`
`connect to and access voicemail system 140. For example,
`user account information 220 can include a telephone num-
`ber for calling voicemail system 140, and account informa-
`tion and a password for accessing voicemail system 140.
`According to another embodiment, user account informa-
`tion 220 includes a telephone number for calling voicemail
`system 140, but does not include account information or a
`password that is necessary to access voicemail system 140.
`According to this embodiment, the user must provide the
`account information and password during use, after user
`authentication 210 has connected the voice-enabled user
`
`interface to the voicemail system 140.
`Speech recognition 230 interprets audio or voice data that
`is received from a telephone device, which is generated by
`a user speaking into the telephone device. Speech recogni-
`tion 230 can also interpret user input comprising telephone
`signals, such as DTMF signals. Techniques for implement-
`ing speech recognition 230 are well known in the respective
`art, and include, but are not limited to spectral analysis,
`dynamic time warping, neural networks, and recognition by
`discrete and continuous hidden Markov modeling.
`Translator 240 receives user input comprising audio or
`voice data and translates the data into DTMF signals that can
`be recognized by voicemail system 140. According to the
`
`IPRZO17-01039
`
`Unified EX1004 Page 9
`
`IPR2017-01039
`Unified EX1004 Page 9
`
`
`
`US 6,731,724 B2
`
`7
`present embodiment, voicemail system 140 does not recog-
`nize audio or voice data and is only responsive to DTMF
`signals. The accuracy of making a translation of user input
`into a DTMF signal is enhanced by comparing user input
`with anticipated user input according to data that is stored in
`template 232 of database 180. It should be noted that the
`translation of user input that is generated by translator 240
`is not a direct translation, but instead is a translation of the
`data stored in template 232 that most resembles the user
`input. Because template 232 has a finite number of accept-
`able inputs and responses, translator 240 is generally able to
`make an accurate translation of user input, which can vary
`from one user to another, by comparing the user input to the
`known data in template 232.
`A template, such as template 232, maps all of the menu
`states of a telephone menu system and all of the correspond-
`ing prompts and acceptable responses and inputs. Atechni-
`cian prepares only a single template for each telephone
`service system that is accessible to users using the voice-
`enabled user interface of the invention. Accordingly, a single
`template is used for making translations of user input from
`multiple users accessing the same telephone service system.
`Communication switch 250 controls the transmission of
`
`data through the voice-enabled user interface that is trans-
`mitted between a telephone device 120 and voicemail sys-
`tem 140. In one embodiment, two simplex channels 252, 254
`connect the voice-enabled user interface to the telephone
`device 120 and two simplex channels 256, 258 connect the
`voice-enabled user interface to the voicemail system 140,
`each set of simplex channels comprising an inbound and an
`outbound simplex channel. A simplex channel is simply a
`one-way communication channel. It should be appreciated
`that other types of communication channels can also be
`used, such as a half-duplex channel.
`According to this embodiment, communication switch
`250 synchronizes all communication over the simplex chan-
`nels 252-258. During use, the communication switch 250
`connects the inbound simplex channel 256, from the voice-
`mail system 140, to the outbound simplex channel 252 that
`is directed to the telephone device 120. This enables the user
`to hear transmissions of data directly from the voicemail
`system 140. The communication switch 250 also directs
`transmissions of data from inbound simplex channel 254 to
`speech recognition 230, while directing all outbound trans-
`missions of the voice-enabled user interface to voicemail
`
`system 140 through outbound simplex channel 258.
`The communication switch 250 also enables the voice-
`
`enabled user interface to interrupt the voicemail system 140,
`whenever appropriate, by disconnecting the inbound sim-
`plex channel 256 from the outbound simplex channel 252.
`One appropriate circumstance to interrupt
`the voicemail
`system 140 is to prevent the user from hearing prompts that
`the user does not need to respond to. For example, after user
`authentication 210 connects the voice-enabled user interface
`
`to voicemail system 140, the voicemail system 140 may
`transmit prompts for the user to enter a valid account number
`and password.
`the voice-enabled user interface
`In one embodiment,
`disconnects the inbound simplex channel 256 from the
`outbound simplex channel 252 while user authentication
`210 transmits the user’s account number and password to
`voicemail system 140 from database 180. According to this
`example,
`the communication switch 250 prevents the
`incoming transmission of data from the voicemail system
`140 from being transmitted over the outbound simplex
`channel 252 to the user.
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`8
`Another appropriate circumstance for interrupting the
`voicemail system 140 is when the voice-enabled user inter-
`face is causing the voicemail system to jump over “in
`between” menu states, from a menu state to final menu state.
`This will be described later in reference to FIG. 6.
`
`State manager 260 executes user requests and performs
`various general control functions. State manager 260 also
`coordinates the connectivity and operation of other
`components, such as, for example enabling components 210,
`230, 240, and 250 to access database 180, and coordinating
`the transmission of data to and from components 210, 230,
`240, and 250 to simplex channels 252-258.
`FIG. 3 shows some of the basic components that make up
`voicemail state manager module 170, of FIG. 1. In one
`preferred embodiment, voicemail authentication 310 con-
`nects the voice-enabled user interface to voicemail system
`140 of FIGS. 1-2. According to this embodiment, voicemail
`authentication 310 retrieves account information 220 from
`database 180 and accesses voicemail system 140. Account
`information 220 can include a telephone number for calling
`voicemail system 140, and account voicemail information
`and password for accessing voicemail system 140.
`According to another embodiment, necessary account
`information 220 includes a telephone number for calling
`voicemail system 140, but does not include account infor-
`mation or a password for accessing voicemail system 140.
`According to this embodiment, a user must provide the
`account information and password duri