Voice command triggered speech enhancement

US10319372B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10319372-B2
Application numberUS-201715688380-A
CountryUS
Kind codeB2
Filing dateAug 28, 2017
Priority dateDec 18, 2013
Publication dateJun 11, 2019
Grant dateJun 11, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Received data representing speech is stored, and a trigger detection block detects a presence of data representing a trigger phrase in the received data. In response, a first part of the stored data representing at least a part of the trigger phrase is supplied to an adaptive speech enhancement block, which is trained on the first part of the stored data to derive adapted parameters for the speech enhancement block. A second part of the stored data, overlapping with the first part of the stored data, is supplied to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data. A second trigger phrase detection block detects the presence of data representing the trigger phrase in the enhanced stored data. In response, enhanced speech data are output from the speech enhancement block for further processing, such as speech recognition.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of processing received data representing speech, comprising: on detecting that a segment of the received data represents a trigger phrase: selecting a part of the received data representing the trigger phrase, wherein the selected part of the received data contains speech data; training an adaptive speech enhancement block on he selected part of the received data to derive parameters for the adaptive speech enhancement block; passing said segment of received data through said adaptive speech enhancement block using said derived parameters to generate an enhanced segment of data; and attempting to detect said trigger phrase in the enhanced segment of data. 2. A method as claimed in claim 1 , comprising, if unsuccessful in attempting to detect said trigger phrase in the enhanced segment of data, resetting a detector for detecting that a segment of the received data represents a trigger phrase. 3. A method as claimed in claim 1 , comprising, if successful in attempting to detect said trigger phrase in the enhanced segment of data, passing further received data through the adaptive speech enhancement block for further downstream signal processing. 4. A method as claimed in claim 1 , comprising attempting to detect said trigger phrase in the enhanced segment of data using different detection criteria from those used in detecting whether the segment of the received data represents a trigger phrase. 5. A method as claimed in claim 4 , comprising attempting to detect said trigger phrase in the enhanced segment of data using more rigorous detection criteria from those used in detecting whether the segment of the received data represents a trigger phrase. 6. A method as claimed in claim 1 , comprising: receiving and storing data from multiple microphones; detecting that the segment of the received data represents a trigger phrase, based on data received from a subset of said microphones; training the adaptive speech enhancement block on at least a part of said segment of the received data received from said multiple microphones to derive said adapted parameters for the speech enhancement block; and passing said segment of the received data received from said multiple microphones through said adaptive speech enhancement block using said derived parameters to generate the enhanced segment of data. 7. A method as claimed in claim 6 , wherein the speech enhancement block is a beamformer. 8. A method as claimed in claim 1 , comprising passing said segment of received data through said adaptive speech enhancement block and generating the enhanced segment of data at a higher rate than real time. 9. A speech processor, comprising: an input, for receiving data representing speech; and a speech processing block, wherein the speech processing block is configured to perform a method comprising: on detecting that a segment of the received data represents a trigger phrase: selecting a part of the received data representing the trigger phrase, wherein the selected part of the received data contains speech data; training an adaptive speech enhancement block on the selected part of the received data to derive parameters for the adaptive speech enhancement block; passing said segment of received data through said adaptive speech enhancement block using said derived parameters to generate an enhanced segment of data; and attempting to detect said trigger phrase in the enhanced segment of data. 10. A mobile device, comprising a speech processor as claimed in claim 9 . 11. A speech processor, comprising: an input, for receiving data representing speech; and an output, for connection to a speech processing block, wherein the speech processing block is configured to perform a method comprising: on detecting that a segment of the received data represents a trigger phrase: selecting a part of the received data representing the trigger phrase, wherein the selected part of the received data contains speech data; training an adaptive speech enhancement block on the selected part of the received data to derive parameters for the adaptive speech enhancement block; passing said segment of received data through said adaptive speech enhancement block using said derived parameters to generate an enhanced segment of data; and attempting to detect said trigger phrase in the enhanced segment of data. 12. A mobile device, comprising a speech processor as claimed in claim 11 . 13. A computer program product, comprising computer readable code embodied in non-transitory computer-readable media, for causing a processing device to perform a method comprising: selecting a part of the received data representing the trigger phrase, wherein the selected part of the received data contains speech data; training an adaptive speech enhancement block on the selected part of the received data to derive parameters for the adaptive speech enhancement block; passing said segment of received data through said adaptive speech enhancement block using said derived parameters to generate an enhanced segment of data; and attempting to detect said trigger phrase in the enhanced segment of data.

Assignees

Inventors

Classifications

  • Microphone arrays; Beamforming · CPC title

  • Noise filtering · CPC title

  • G10L21/02Primary

    Speech enhancement, e.g. noise reduction or echo cancellation (reducing echo effects in line transmission systems H04B3/20; echo suppression in hands-free telephones H04M9/08) · CPC title

  • the user being prompted to utter a password or a predefined phrase · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10319372B2 cover?
Received data representing speech is stored, and a trigger detection block detects a presence of data representing a trigger phrase in the received data. In response, a first part of the stored data representing at least a part of the trigger phrase is supplied to an adaptive speech enhancement block, which is trained on the first part of the stored data to derive adapted parameters for the spe…
Who is the assignee on this patent?
Cirrus Logic Int Semiconductor Ltd, Cirrus Logic Inc
What technology area does this patent fall under?
Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 11 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).