`(12) Patent Application Publication (10) Pub. No.: US 2003/0182132 A1
`(43) Pub. Date:
`Sep. 25, 2003
`Niemoeller
`
`US 20030182132A1
`
`(54) VOICE-CONTROLLED ARRANGEMENT
`AND METHOD FOR VOICE DATA ENTRY
`AND VOICE RECOGNITION
`
`Publication Classi?cation
`
`(76) Inventor: Meinrad Niemoeller, Holzkirchen (DE)
`
`(51) Int. Cl? ................................................... .. G10L 11/00
`(52) US. Cl. ............................................................ ..704/275
`
`Correspondence Address:
`BELL, BOYD & LLOYD, LLC
`P. O. BOX 1135
`CHICAGO, IL 60690-1135 (US)
`
`(21) Appl. No.:
`
`10/363,121
`
`(22) PCT Filed:
`
`Aug. 16,2001
`
`(86) PCT No.:
`
`PCT/EP01/09475
`
`(30)
`
`Foreign App?cation Priority Data
`
`Aug. 31, 2000 (EP) ...................................... .. 001188952
`
`(57)
`
`ABSTRACT
`
`The invention relates to a voice-controlled arrangement (1)
`comprising a plurality of devices to be controlled (3 to 9)
`and a mobile voice data entry unit (11) Which is connected
`to said devices by a Wireless communication link. At least
`some of the devices each have a device vocabulary memory
`_
`_
`_
`(3a to 9a) and a vocabulary transmission umt (3b to 9b), and
`the voice data entry unit has selection means for selecting
`the vocabularies to he loaded according to the route desti
`nation.
`
`J‘
`5
`
`0
`I:
`5G~W~ 5c
`c
`5a
`_5b
`
`3
`k
`I
`
`\ x S If
`1
`‘
`1
`I‘! “a /
`
`v.
`
`f
`
`7
`R
`
`3dvl,
`
`C30 12X 113a 11?,
`
`38%“
`
`Van
`
`_
`
`ooooooooo
`
`,
`~9
`
`“by
`
`-
`
`l». 1~11d
`
`14f "4
`\410
`
`9Cv~~
`
`9b _
`
`~-9d
`
`V9
`a
`
`GOOGLE EXHIBIT 1012
`
`Page 1 of 10
`
`
`
`H0..uaCHbUPH0.UaC.__II__MAtHBtaP
`
`Sep. 25, 2003 Sheet 1 of 2
`
`US 2003/0182132 A1
`
`Page 2 of 10
`
`
`
`Patent Application Publication Sep. 25, 2003 Sheet 2 0f 2
`
`US 2003/0182132 A1
`
`FIG 2
`
`1320
`L _133
`
`‘
`
`1
`NW 4V7
`
`15/" HI
`
`~11b
`
`[J
`110
`11a
`
`Page 3 of 10
`
`
`
`US 2003/0182132 A1
`
`Sep. 25, 2003
`
`VOICE-CONTROLLED ARRANGEMENT AND
`METHOD FOR VOICE DATA ENTRY AND VOICE
`RECOGNITION
`
`[0001] The invention relates to a voice-controlled arrange
`ment comprising a plurality of devices according to the
`preamble of claim 1, and to a method for inputting and
`recogniZing a voice, Which can be applied in such an
`arrangement.
`[0002] Since voice recognition systems have increasingly
`developed into a standard component in poWerful computers
`for professional and private use, including PCs and Note
`books in the medium and loWer price ranges, more and more
`Work is being carried out on the possibilities of applying
`such systems in devices Which are used in everyday life.
`Electronic devices such as mobile phones, cordless phones,
`PDAs and remote controls for audio systems and video
`systems etc. usually have an input keypad Which comprises
`at least one numerical input array and a series of functional
`keys.
`[0003] Some of these devices—in particular of course the
`various kinds of telephones, but also increasingly remote
`controls and other devices—are increasingly equipped With
`microphones and possibly also headphones for inputting and
`outputting voice. Devices of this type (for example some
`types of mobile phones) in Which a simple voice recognition
`procedure is implemented for control functions on the
`device itself are already knoWn. One example of this is the
`voice-controlled setting up of links by a voice input of a
`name into a mobile phone, said name being stored in an
`electronic telephone directory of the telephone. Further
`more, primitive to simple voice controls are also knoWn for
`other devices Which are used in everyday life, for example
`in remote controls for audio systems or lighting systems. All
`knoWn devices of this type each have a separate dedicated
`voice recognition system.
`
`[0004] It is possible to envisage a development Which Will
`entail an increasing number of technical devices and systems
`from everyday life, in particular in the domestic sphere and
`in motor vehicles, being equipped With their oWn respective
`voice recognition systems. As such systems are relatively
`complex in terms of hardWare and softWare, and thus
`expensive, if they are to provide an acceptable level of
`operator convenience and sufficient recognition reliability,
`this development is a fundamental factor Which drives costs
`higher and is thus Welcomed by consumers only to a limited
`degree. For this reason, the primary goal is to reduce the
`expenditure on hardWare and softWare further in order to be
`able to make available the most cost-effective solutions
`possible.
`[0005] Arrangements have already been proposed in
`Which a plurality of technical devices are assigned an
`individual voice input unit via Which various functions of
`these devices are controlled by voice control. The control
`information is preferably transmitted here in a Wire-free
`fashion to terminals (?xed or even mobile). HoWever, the
`technical problem arises here that the voice input unit has to
`store a very large vocabulary for the voice recognition in
`order to be able to control various terminals. HoWever,
`handling a large vocabulary involves adverse effects on the
`speed and precision of the recognition processes. In addi
`tion, such an arrangement has the disadvantage that it is not
`readily possible to make later updates With additional
`
`devices, Which may not have been envisaged When the voice
`input unit Was implemented. Last but not least, such a
`solution is still alWays very expensive, in particular due to
`the high memory requirements oWing to the very large
`vocabulary.
`
`[0006] In a German patent application Which Was not
`published before the priority date and Which originates from
`the applicant, a voice-controlled arrangement comprising a
`plurality of devices to be controlled and a mobile voice input
`unit Which is connected to the devices via an, in particular,
`Wire-free telecommunications link is disclosed in Which a
`device-speci?c vocabulary, but no processing means for the
`voice recognition, are respectively provided in the indi
`vidual devices of the arrangement. On the other hand, the
`processing components of a voice recognition system are
`implemented in the voice input unit (in addition to the voice
`input means).
`[0007] At least some of the devices each have a device
`vocabulary memory for storing a device-speci?c vocabulary
`and a vocabulary transmission unit for transmitting the
`stored vocabulary to the voice input unit. In contrast, the
`voice input unit comprises a vocabulary reception unit for
`receiving the vocabulary transmitted by a device or the
`vocabularies transmitted by devices. If the voice input unit
`is placed in the spatial vicinity of one or more devices, so
`that a telecommunications link is set up betWeen the voice
`input unit and devices, the devices transmit their vocabu
`laries to the voice input unit Which buffers them. As soon as
`the telecommunications link betWeen one or more devices
`and the voice input unit is broken, for example if the spatial
`distance becomes too large, the voice input unit can reject
`one or more buffered vocabularies again. The voice input
`unit accordingly administers the vocabularies of the termi
`nals in a dynamic fashion.
`
`[0008] The advantage of this arrangement is principally
`the fact that means With a relatively small storage capacity
`are sufficient to store the vocabularies in the voice input unit
`as, oWing to the spatial separation of the vocabularies from
`the actual voice recognition capacity, the vocabularies do not
`need to be continuously stored in the voice input unit. This
`also increases the recognition rate in the voice input unit as
`feWer vocabularies are to be processed. HoWever, When
`there is a plurality of spatially closely adjacent devices, in
`particular if their transmission ranges overlap, the voice
`input unit may nevertheless have to store and process a large
`number of vocabularies or may not be able to serve all the
`terminals given a limited storage capacity. Particularly the
`latter case is inconvenient for a user as he has no in?uence
`on Which vocabularies are loaded into the voice input unit by
`terminals and Which are rejected. Even if the transmission
`ranges of the terminals are comparatively small—for
`example have diameters of only a feW meters—it is possible,
`particularly given a concentration of a large number of
`different terminals in a small space as in the domestic sphere
`or in an office, for the user to be able to carry out voice
`control on only some of these terminals oWing to the
`abovementioned problems.
`
`[0009] The invention is therefore based on the object of
`proposing an arrangement of this type Which in particular
`avoids the abovementioned problems and especially devel
`ops the selection of the terminals to be controlled by voice.
`
`Page 4 of 10
`
`
`
`US 2003/0182132 A1
`
`Sep. 25, 2003
`
`The arrangement is also intended to be distinguished by loW
`costs and an ef?cient method for inputting and recognizing
`voice.
`
`[0010] This object is achieved by means of an arrange
`ment having the features of patent claim 1 and by means of
`a method having the features of patent claim 13.
`
`[0011] The invention develops the voice-controlled
`arrangement mentioned at the beginning having a plurality
`of devices and a mobile voice input unit connected to the
`devices via a Wire-free telecommunications link in particular
`by virtue of the fact that selection means for selecting
`vocabularies to be loaded into the voice input unit are
`provided in the voice input unit. For this purpose, the
`selection means evaluate a directional information item of
`received signals Which have been transmitted by the devices.
`The principle applied here originates from human commu
`nication: one person communicates With another by direct
`ing his attention at the person. Conversations in the sur
`roundings of the tWo communicating people are “blanked
`out”. Other people to Whom the communicating people do
`not direct their attention therefore also feel that they are not
`being addressed.
`
`[0012] The invention ensures that only speci?c vocabu
`laries are loaded by devices Which have been selected by the
`selection means. As a result, the recognition rate is signi?
`cantly improved With spatially closely adjacent terminals as,
`oWing to the directionally dependent selection, feWer
`vocabularies are loaded into the voice input unit, and there
`fore feWer vocabularies have to be processed. For example,
`radio or else infrared transmission links are possible as
`Wire-free transmission methods betWeen the devices and the
`voice input unit.
`
`[0013] The selection means preferably comprise a detec
`tor, in particular an antenna, With a directional characteristic.
`The directionally dependent selection takes place by orient
`ing the detector With the devices to be controlled as the level
`of a received signal of a device changes With the orientation
`of the detector With respect to a device transmitting the
`signal. In the case of an infrared transmission link, the
`selection means comprise an infrared detector Which has a
`limited detection range, for eXample by virtue of a lens
`placed in front of it, so that infrared signals outside the
`detection range do not cause a corresponding vocabulary to
`be loaded.
`
`[0014] In order to be able to evaluate the level of received
`signals, the voice input unit preferably has a level evaluation
`and control device. The latter determines the level of at least
`one received signal and controls, as a function thereof, the
`loading of a vocabulary into the vocabulary buffer or buffers
`by means of the vocabulary reception unit, said vocabulary
`being transmitted by means of the signal. The level evalu
`ation and control device is preferably designed in such a Way
`that it does not load a vocabulary transmitted by a received
`signal until a speci?c level is exceeded.
`
`[0015] In one preferred embodiment, a plurality of
`vocabularies of devices are loaded simultaneously into the
`voice input unit. The level evaluation and control device is
`eXpediently constructed in this embodiment in such a Way
`that the vocabulary of a further device is loaded into the
`voice input unit and replaces a vocabulary loaded there as
`soon as the received signal of the further device eXceeds a
`
`prede?ned level and/or the levels of the signal Which trans
`mits the vocabulary to be replaced and/or is assigned to it.
`Aplurality of vocabularies are thus stored in the voice input
`unit so that even a corresponding multiplicity of devices can
`be controlled. HoWever, this gives rise to a corresponding
`need for storage in the voice input unit.
`
`[0016] In one development, precisely one vocabulary of a
`device, Which is replaced by the vocabulary of another
`device, can then be loaded into the voice input unit as soon
`as a received signal of the other device eXceeds a prede?ned
`level and/or the level of the signal Which transmits the
`vocabulary to be replaced and/or is assigned thereto. There
`fore, as soon as the voice input unit is directed to another
`device so that its transmitted signal ful?ls the criteria for
`loading into the voice input unit, the vocabulary Which has
`already been loaded is replaced. The advantage of this
`embodiment is in particular the loW storage requirement in
`the voice input unit as only one vocabulary is ever loaded.
`
`[0017] In the preceding embodiment, the level evaluation
`and control device is eXpediently also designed to allocate
`different priorities to the vocabularies loaded into the voice
`input unit. If a neW vocabulary is loaded, the vocabulary to
`be replaced can be determined by reference to the priorities.
`A vocabulary to be loaded Will usually replace the loaded
`vocabulary With the loWest priority. The priorities can be
`allocated as a function of various criteria such as for
`eXample prioritiZation of the devices, the frequency of
`control of the devices, the time for Which the vocabularies
`remain in the voice input unit, etc. The prioritiZation Will
`appropriately be allocated as a function of the frequency
`With Which the devices are controlled, i.e. devices Which are
`controlled very often have a higher priority than devices
`Which, in comparison, are controlled rarely. HoWever, the
`assignment of priorities preferably takes place as a function
`of the conditions of the levels of the signals Which transmit
`the vocabularies and/or are assigned to them. A relatively
`high level brings about a higher priority than a relatively loW
`level here.
`
`[0018] In one particularly preferred embodiment, the level
`evaluation and control device generates at least one control
`signal Which can control or in?uence the recognition func
`tion of the voice recognition stage, speci?cally as a function
`of the evaluated level of a received signal. The in?uencing
`or control is advantageously carried out by raising or loW
`ering the probabilities of the occurrence of a Word or a
`plurality of Words and/or the probabilities of a boundary
`betWeen Words of a vocabulary Which is in particular
`proportional to the level.
`[0019] By in?uencing the probabilities during recognition,
`use is made of the fact that a plurality of terminals have the
`same instructions and, When such an instruction is input, the
`probability is used to decide Which device is to be con
`trolled. In other Words, various devices can be controlled
`With identical instructions, Which of the devices is addressed
`being determined by the user by the orientation of the voice
`input unit.
`
`[0020] The communication betWeen the voice input unit
`and the devices preferably takes place according to the
`Bluetooth standard. For this purpose, the vocabulary trans
`mission unit or vocabulary transmission units and vocabu
`lary reception unit are embodied as a radio transceiver unit
`according to the Bluetooth standard. The Bluetooth standard
`
`Page 5 of 10
`
`
`
`US 2003/0182132 A1
`
`Sep. 25, 2003
`
`is particularly suitable for this purpose as it is provided in
`particular for transmitting control instructions (for example
`betWeen a PC and a printer). Particularly in the present case,
`instructions or vocabularies are mainly exchanged betWeen
`the voice input unit and the devices. Higher level transmis
`sion protocols and description standards such as, for
`example, WAP or XML can also be used as standards for
`transmitting the vocabularies in the system. In an alternative
`preferred embodiment, the vocabulary transmission unit or
`vocabulary transmission units and vocabulary reception unit
`may be embodied as an infrared transceiver unit.
`
`[0021] A typical embodiment of the voice-controlled
`arrangement functions in such a Way that, in order to carry
`out a directionally dependent selection of signals Which are
`transmitted by devices, the detector is directed at speci?c
`devices so that only the signals of these devices are received.
`Then, the levels of the received signals are determined in the
`voice input unit by means of the level evaluation and control
`device. Depending on hoW the voice input unit—in the case
`of a radio link, the antenna With a directional characteris
`tic—is oriented With respect to the devices, some of the
`received signals have a greater ?eld strength and thus a
`higher level than the other signals. By reference to the
`speci?c levels of the received signals, the level evaluation
`and control device controls the vocabulary reception unit in
`such a Way that only vocabularies of devices Whose signals
`have been determined by the level evaluation and control
`device to be suf?cient, ie in particular are above a pre
`de?ned threshold level, are received. Even if the voice input
`unit, to be more precise the detector, is located in the
`transmission or radio range of a plurality of devices, as a
`result of this only the vocabularies of some of the devices are
`loaded. The recognition rate in the voice input unit therefore
`does not drop if the voice input unit is in the transmission or
`radio range of a large number of devices and accordingly a
`large number of vocabularies Would be loaded if there Were
`no directionally dependent selection according to the inven
`tion.
`
`[0022] Avocabulary contains instruction Words or phrases
`in orthographic or phonetic transcription and possibly addi
`tional information for the voice recognition. The vocabulary
`is loaded into the voice recognition system on the voice
`input unit after suitable conversion, speci?cally advanta
`geously into a vocabulary buffer of said system, Which
`buffer is preferably connected betWeen the vocabulary
`reception unit and the voice recognition stage. The magni
`tude of the vocabulary buffer, Which is preferably embodied
`as a volatile memory (for example DRAM, SRAM, etc.), is
`expediently adapted to the number of vocabularies to be
`processed or the number of devices to be controlled simul
`taneously. In order to make available a cheap voice input
`unit, a saving can be made in terms of the vocabulary buffer
`by con?guring the selection means for evaluating and con
`trolling levels in such a Way that, for example, at most tWo
`vocabularies for controlling tWo devices can be loaded
`simultaneously into the voice input unit. It Would also be
`conceivable to have a programmable embodiment of the
`selection means for evaluating levels, Which means can be
`correspondingly set to control a plurality of devices When
`the vocabulary buffer is enlarged.
`
`[0023] The selection means can have in particular an
`arithmetic unit Which, from the level of a received signal,
`calculates the distance of a device transmitting the signal
`
`from the voice input unit. In addition, a threshold value
`corresponding to a prede?ned distance is stored in a thresh
`old value memory. The calculated distance is then compared
`With the stored threshold value by means of a comparison
`device. Depending on the comparison result, in particular
`the vocabulary reception unit and the voice recognition stage
`are enabled or disabled. For this purpose, the comparison
`device generates a disable/enable signal. The criteria for
`enabling and disabling can be prede?ned by means of the
`threshold value Which, for example, can also be adapted by
`the user by means of programming or setting operations. For
`example, the user could prede?ne that only devices at a
`distance of 2 m are enabled for the voice input unit. In
`contrast, devices further aWay should be disabled.
`
`[0024] In summary, the voice-controlled arrangement
`according to the invention provides the advantages that
`
`[0025] the recognition in the case of spatially close
`devices Which compete With one another is
`improved,
`
`[0026] the vocabulary to be processed in the voice
`input unit is optimiZed not only in terms of its siZe,
`but also in terms of probabilities,
`
`[0027] the vocabularies of the various devices do not
`have to be matched to one another, i.e. may contain
`identical instructions, and
`
`[0028] a user can control different devices With the
`same instructions, and merely by the orientation of
`the voice input unit a user can determine Which of the
`devices is to be addressed.
`
`[0029] By using directionally dependent information of
`received signals, the overall vocabulary Which is to be stored
`in the voice input unit can be kept at a loW level overall. As
`a result, the voice modeling of the voice recognition stage
`can also be optimiZed. At the same time, the problem of the
`possible overlapping of vocabularies is solved. The arrange
`ment according to the invention can advantageously be used
`in Wire-free telecommunications links With a short range, for
`example in Bluetooth systems or else infrared systems.
`[0030] Advantages and expedient aspects of the invention
`also emerge from the dependent claims and the folloWing
`description of a preferred exemplary embodiment by refer
`ence to the draWing, in Which
`
`[0031] FIG. 1 shoWs a sketch-like functional block dia
`gram of a device con?guration composed of a plurality of
`voice-controlled devices, and
`
`[0032] FIG. 2 shoWs a functional block diagram of an
`exemplary embodiment of a voice input unit.
`
`[0033] The device con?guration 1 shoWn in FIG. 1 in a
`sketch-like functional block diagram comprises a plurality
`of voice-controlled devices, speci?cally a television set 3, an
`audio system 5, a lighting unit 7 and a cooker hob 9 With a
`voice input unit 11 (referred to beloW as mobile voice
`control terminal).
`[0034] The devices 3 to 9 to be controlled each have a
`device vocabulary memory 3a to 9a, a vocabulary transmis
`sion unit 3b to 9b operating according to the Bluetooth
`standard, a control instruction reception unit 3c to 9c and a
`microcontroller 3a' to 9c.
`
`Page 6 of 10
`
`
`
`US 2003/0182132 A1
`
`Sep. 25, 2003
`
`[0035] The mobile voice control terminal 11 has a voice
`transmitter 11a, a display unit 11b, a voice recognition stage
`11c Which is connected to the voice transmitter 11a and to
`Which a vocabulary buffer 11a' is assigned, a vocabulary
`reception unit 116, a control instruction transmission unit
`11a, an antenna 12 With directional characteristics and a
`level evaluation and control device 13.
`
`[0036] The various transmission and reception units of the
`devices 3 to 9 and of the voice control terminal 11 are
`embodied—in a manner knoWn per se—such that their range
`is matched to the character of the device and to the custom
`ary spatial relations betWeen the device and user—for
`example the range of the vocabulary transmission unit 9b of
`the cooker hob 9 is signi?cantly smaller than that of the
`vocabulary transmission unit 7b of the illumination control
`unit 7.
`
`[0037] In the vocabulary buffer 11d of the voice control
`terminal 11, it is possible to implement a basic vocabulary
`of control instructions and additional terms Which ensures
`that the entire system and speci?c emergency or protection
`functions are activated in every situation of use. The device
`vocabulary memories contain special vocabularies for con
`trolling the respective device. After their transmission, the
`voice recognition stage 11c can access them and the user can
`utter control instructions for the respective device. These
`instructions are transmitted by the control instruction trans
`mission unit 11f of the voice control terminal 11 to the
`control instruction reception units 3c to 9c and converted
`into control signals by the respective microcontroller 3a' to
`9d of the devices 3 to 9.
`
`[0038] If the voice control terminal 11 is located in the
`radio area of the devices 3 to 9, ie there are Wire-free
`telecommunications links betWeen the voice control termi
`nal 11 and the devices 3 to 9, the devices 3a' to 9d transmit
`their vocabularies from the respective device vocabulary
`memories 3a to 9a to the voice control terminal 11. The
`latter receives the corresponding signals via its antenna 12
`Which has a directional characteristic so that the ?eld
`strength of the signals transmitted by the devices 3 and 5,
`toWard Which the voice control terminal 11, in particular its
`antenna 12, is directed, is greater than the ?eld strength of
`the signals transmitted by the devices 7 and 9.
`
`[0039] The level evaluation and control device 13 deter
`mines the level from the ?eld strength of all the received
`signals by means of an amplitude measurement of the output
`signals corresponding to the received signals at an antenna
`booster connected doWnstream of the antenna 12. The
`corresponding digitiZed output signals can then be further
`processed by means of a microcontroller in the voice control
`terminal 11. Which of the vocabularies corresponding to the
`signals are to be loaded into the vocabulary buffer 11d via
`the vocabulary reception unit 116 is calculated by an arith
`metic unit 13a of the level evaluation and control device
`from the output signals of the antenna booster.
`
`[0040] In the present case, the arithmetic unit 13a deter
`mines that the ?eld strength of the signals received by the
`devices 3 and 5 is greater than the ?eld strength of the
`signals received by the devices 7 and 9, and consequently
`controls the vocabulary reception unit 116 and the vocabu
`lary buffer 11a' in such a Way that the vocabularies of the
`devices 3 and 5 are received and loaded. In addition, the
`level evaluation and control device 13 controls the voice
`
`recognition stage 11c so that the latter interprets the received
`vocabularies. The ?eld strength of the received signals of the
`devices 3 to 9 is continuously measured. By reference to the
`measurement results, the arithmetic unit 13a of the level
`evaluation and control device 13 determines a control signal
`14 Which is transmitted to the voice recognition stage 11c
`and raises the probabilities of the occurrence of one Word or
`a plurality of Words and/or probabilities of boundaries
`betWeen Words of the respective vocabulary (if the ?eld
`strength of the received signal increases) in proportion to the
`measured ?eld strength of a reception signal, or reduces
`them (if the ?eld strength of the received signal decreases).
`The voice recognition rate is thus in?uenced by means of the
`control signal 14 through the orientation of the voice control
`terminal 11 With respect to the devices 3 to 9.
`
`[0041] If the voice control terminal 11 is directed at the
`cooker hob 9, the level evaluation and control device 13
`determines an increase in the ?eld strength of the signal
`Which has been transmitted by the cooker hob 9, and it
`decides ?rstly Whether the vocabulary of the cooker hob 9 is
`received and loaded into the vocabulary buffer 11d via the
`vocabulary reception unit 116. At the same time, the level
`evaluation and control device 13 decides Which of the
`vocabularies already stored in the vocabulary buffer 11a' is
`to be rejected. This is usually the vocabulary of the device
`Which transmits the signal With the loWest ?eld strength or
`Whose signal is no longer received at all.
`
`[0042] FIG. 2 shoWs, by means of a functional block
`circuit diagram, the internal structure of the voice control
`terminal 11 and in particular the Wiring of the essential
`function blocks.
`
`[0043] A signal Which is received via the antenna 12 With
`a directional characteristic is fed to a transceiver 16, doWn
`stream of Which on the one hand a reception ampli?er 17 and
`on the other hand the vocabulary reception unit 116 are
`connected. Asignal Which is received via the antenna 12 and
`conditioned by the transceiver 16 is fed to the level evalu
`ation and control device 13. OWing to the directional char
`acteristic of the antenna, only signals which 116 in the
`“directed” reception region of the antenna are received. A
`subset of signals Which lie in the reception range of the
`antenna is thus selected from a multiplicity of signals by
`means of the antenna. The level evaluation and control
`device 13 comprises the arithmetic unit 13a, a comparison
`device 13c as Well as a threshold value memory 13b. From
`the ?eld strength of the received signal, the arithmetic unit
`13a calculates the distance from a device transmitting the
`signal. The supplied signal is then compared, by means of
`the comparison device 13c, With a (threshold) value Which
`is stored in the threshold value memory 13b and corresponds
`to a prede?ned distance. As a result, the signals Which are
`received via the antenna are selected once more as a function
`of the distance of their sources.
`
`[0044] Depending on the comparison, at least one disable/
`enable signal 15 is formed Which is fed to the vocabulary
`reception unit 116, to the vocabulary buffer 11d and to the
`voice recognition stage 11c and disables or enables it. It is
`enabled if the signal fed to the level evaluation and control
`device 13 is above the value stored in the threshold value
`memory 13b, and otherWise disabling takes place. If the
`abovementioned units are disabled, the vocabulary of the
`device Which has sent the signal cannot be loaded. In this
`
`Page 7 of 10
`
`
`
`US 2003/0182132 A1
`
`Sep. 25, 2003
`
`case, the device is outside the range for voice control or the
`reception range covered by the antenna 12.
`[0045] The arithmetic unit 13a is also used to generate the
`threshold value. For this purpose, the signal at the output of
`the reception ampli?er 11 is fed to the arithmetic unit 13a.
`The latter can compare the supplied signal internally With
`the calculated and current threshold value, and if appropriate
`form a neW threshold value from the signal and store said
`threshold value in the threshold value memory 13b. The
`direct feeding of the signal also serves to generate a control
`signal 14 Which is used by the voice recognition stage for
`setting the voice recognition. Depending on the ?eld
`strength of a received signal, the arithmetic unit 13a calcu
`lates hoW the probabilities of the occurrence of a Word or a
`plurality of Words and/or probabilities of boundaries
`betWeen Words are to be in?uenced.
`[0046] The folloWing description of a typical constellation
`Will serve for explanatory purposes: a subscriber moves
`aWay from a device Which is to be controlled and Whose
`vocabulary is loaded into the voice control terminal 11, or
`sWivels the voice control terminal 11 in such a Way that the
`signal transmitted by the device is received more Weakly by
`the antenna With a directional characteristic. As a Whole, the
`reception ?eld strength of the signal Which is output by the
`device is reduced at the voice control terminal 11. The signal
`is hoWever still received via the antenna 12 and fed to the
`arithmetic unit 13a via the transceiver 16 and the reception
`ampli?er 17. Said arithmetic unit 13a calculates, for
`example, the ?eld strength from the signal level and detects
`that said ?eld strength is Weaker than before (but larger than
`the threshold value as otherWise the corresponding vocabu
`lary Would be removed from the vocabulary buffer in favor
`of another vocabulary). From the difference betWeen the
`current ?eld strength and the previous ?eld strength, the
`arithmetic unit 13a then calculates the control signal 14
`Which reduces, in the voice recognition stage, the probabili
`ties of the occurrence of a Word or a plurality of Words
`and/or probabilities of boundaries betWeen Words of the
`vocabulary of the device in proportion to the difference
`(conversely there can also be a rise if the ?eld strength has
`become greater).
`[0047] Aparticularly advantageous implementation of the
`voice control terminal takes the form of a mobile phone
`Whose voice input facility and-computing poWer can be
`used, at least in modern devices, perfectly Well for the voice
`control of other devices. In a mobile phone, there are usually
`