Who is the assignee on this patent?

Cirrus Logic Int Semiconductor Ltd

What technology area does this patent fall under?

Primary CPC classification G10L21/0208. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Oct 03 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Voice command triggered speech enhancement

US9779726B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9779726-B2
Application number	US-201415105882-A
Country	US
Kind code	B2
Filing date	Dec 17, 2014
Priority date	Dec 18, 2013
Publication date	Oct 3, 2017
Grant date	Oct 3, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Received data representing speech is stored, and a trigger detection block detects a presence of data representing a trigger phrase in the received data. In response, a first part of the stored data representing at least a part of the trigger phrase is supplied to an adaptive speech enhancement block, which is trained on the first part of the stored data to derive adapted parameters for the speech enhancement block. A second part of the stored data, overlapping with the first part of the stored data, is supplied to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data. A second trigger phrase detection block detects the presence of data representing the trigger phrase in the enhanced stored data. In response, enhanced speech data are output from the speech enhancement block for further processing, such as speech recognition.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of processing received data representing speech, comprising: storing the received data; detecting a presence of data representing a trigger phrase in the received data; in response to said detecting, supplying a first part of the stored data representing at least a part of the trigger phrase to an adaptive speech enhancement block; training the speech enhancement block on the first part of the stored data to derive adapted parameters for the speech enhancement block; supplying a second part of the stored data to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data, wherein the second part of the stored data overlaps with the first part of the stored data; detecting the presence of data representing the trigger phrase in the enhanced stored data; and outputting enhanced speech data from the speech enhancement block for further processing, in response to detecting the presence of data representing the trigger phrase in the enhanced stored data; wherein the detecting the presence of data representing the trigger phrase in the received data is carried out by means of a first trigger phrase detection block; and wherein the detecting the presence of data representing the trigger phrase in the enhanced stored data is carried out by means of a second trigger phrase detection block, and wherein the second trigger phrase detection block operates with different detection criteria from the first trigger phrase detection block. 2. A method as claimed in claim 1 , comprising, in response to failing to detect the presence of data representing the trigger phrase in the enhanced stored data, resetting the first trigger phrase detection block. 3. A method as claimed in claim 1 , wherein the second trigger phrase detection block operates with more rigorous detection criteria than the first trigger phrase detection block. 4. A method as claimed in claim 1 , comprising: receiving and storing data from multiple microphones; supplying data received from a subset of said microphones to the first trigger phrase detection block for detecting the presence of data representing the trigger phrase in the data received from the subset of said microphones; in response to said detecting, supplying the first part of the stored data from said multiple microphones, representing at least a part of the trigger phrase, to the adaptive speech enhancement block; training the speech enhancement block on the first part of the stored data from said multiple microphones to derive adapted parameters for the speech enhancement block; and supplying the second part of the stored data from said multiple microphones to the adaptive speech enhancement block operating with said adapted parameters, to form said enhanced stored data. 5. A method as claimed in claim 4 , wherein the speech enhancement block is a beamformer. 6. A method as claimed in claim 1 , wherein the first part of the stored data is the data stored from a first defined starting point. 7. A method as claimed in claim 6 , wherein the second part of the stored data is the data stored from a second defined starting point, and the second defined starting point is later than the first defined starting point. 8. A method as claimed in claim 1 , comprising supplying the second part of the stored data to the speech enhancement block and outputting the enhanced speech data from the speech enhancement block at a higher rate than real time. 9. A method as claimed in claim 8 , comprising supplying the second part of the stored data to the speech enhancement block and outputting the enhanced speech data from the speech enhancement block at a higher rate than real time until the data being supplied is substantially time aligned with the data being stored. 10. A speech processor, comprising: an input, for receiving data representing speech; and a speech processing block, wherein the speech processing block is configured to perform a method of processing received data representing speech, comprising: storing the received data; detecting a presence of data representing a trigger phrase in the received data; in response to said detecting, supplying a first part of the stored data representing at least a part of the trigger phrase to an adaptive speech enhancement block; training the speech enhancement block on the first part of the stored data to derive adapted parameters for the speech enhancement block; supplying a second part of the stored data to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data, wherein the second part of the stored data overlaps with the first part of the stored data; detecting the presence of data representing the trigger phrase in the enhanced stored data; and outputting enhanced speech data from the speech enhancement block for further processing, in response to detecting the presence of data representing the trigger phrase in the enhanced stored data; wherein the detecting the presence of data representing the trigger phrase in the received data is carried out by means of a first trigger phrase detection block; and wherein the detecting the presence of data representing the trigger phrase in the enhanced stored data is carried out by means of a second trigger phrase detection block, and wherein the second trigger phrase detection block operates with different detection criteria from the first trigger phrase detection block. 11. A speech processor, comprising: an input, for receiving data representing speech; and an output, for connection to a speech processing block, wherein the processing block is configured to perform a method of processing received data representing speech, comprising: storing the received data; detecting a presence of data representing a trigger phrase in the received data; in response to said detecting, supplying a first part of the stored data representing at least a part of the trigger phrase to an adaptive speech enhancement block; training the speech enhancement block on the first part of the stored data to derive adapted parameters for the speech enhancement block; supplying a second part of the stored data to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data, wherein the second part of the stored data overlaps with the first part of the stored data; detecting the presence of data representing the trigger phrase in the enhanced stored data; and outputting enhanced speech data from the speech enhancement block for further processing, in response to detecting the presence of data representing the trigger phrase in the enhanced stored data; wherein the detecting the presence of data representing the trigger phrase in the received data is carried out by means of a first trigger phrase detection block; and wherein the detecting the presence of data representing the trigger phrase in the enhanced stored data is carried out by means of a second trigger phrase detection block, and wherein the second trigger phrase detection block operates with different detection criteria from the first trigger phrase detection block. 12. A mobile device, comprising a speech processor, wherein the speech processor is configured to perform a method of processing received data representing speech, comprising: storing received data; detecting a presence of data representing a trigger phrase in the received data; in response to said detecting, supplying a first part of the stored data representing at least a part of the trigger phrase to an adaptive speech enhancement block; training the speech enhance

Assignees

Cirrus Logic Int Semiconductor Ltd

Inventors

Classifications

G10L2021/02166
Microphone arrays; Beamforming · CPC title
G10L21/0208Primary
Noise filtering · CPC title
G10L21/02Primary
Speech enhancement, e.g. noise reduction or echo cancellation (reducing echo effects in line transmission systems H04B3/20; echo suppression in hands-free telephones H04M9/08) · CPC title
G10L17/24
the user being prompted to utter a password or a predefined phrase · CPC title
G10L15/22Primary
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

View patent family 50070964

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9779726B2 cover?: Received data representing speech is stored, and a trigger detection block detects a presence of data representing a trigger phrase in the received data. In response, a first part of the stored data representing at least a part of the trigger phrase is supplied to an adaptive speech enhancement block, which is trained on the first part of the stored data to derive adapted parameters for the spe…
Who is the assignee on this patent?: Cirrus Logic Int Semiconductor Ltd
What technology area does this patent fall under?: Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Oct 03 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).