System and method for speech enhancement in multisource environments

US10482878B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10482878-B2
Application numberUS-201715831808-A
CountryUS
Kind codeB2
Filing dateDec 5, 2017
Priority dateNov 29, 2017
Publication dateNov 19, 2019
Grant dateNov 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, computer program product, and computer system for receiving, by a computing device, a first signal emitted from one or more sources. A second signal may be received emitted from the one or more sources. A first confidence level that the wake-up-word is included in the first signal may be determined. A second confidence level that the wake-up-word is included in the second signal may be determined. It may be identified that the wake-up-word originated from a first source of the one or more sources based upon, at least in part, the first and second confidence levels. The first source may be enabled to participate in a dialog phase.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving, by a computing device, a first signal emitted from one or more sources; receiving a second signal emitted from the one or more sources; determining a first confidence level that a wake-up-word is included in the first signal; determining a second confidence level that the wake-up-word is included in the second signal; identifying that the wake-up-word originated from a first source of the one or more sources based upon, at least in part, the first and second confidence levels; and enabling the first source to participate in a dialogue phase. 2. The computer-implemented method of claim 1 further comprising excluding at least a second source of the one or more sources from participating in the dialogue phase based upon, at least in part, historical information of the one or more sources. 3. The computer-implemented method of claim 1 wherein a Bayes decision process is used, at least in part, to identify the one or more sources. 4. The computer-implemented method of claim 1 wherein a wrapped Gaussian mixture model is used, at least in part, to identify the one or more sources. 5. The computer-implemented method of claim 1 further comprising tracking movement of the first source with at least one of one or more core localizers. 6. The computer-implemented method of claim 1 wherein the first signal and the second signal are received at a microphone group. 7. The computer-implemented method of claim 1 further comprising determining one or more angles at a given frequency expected to exhibit a maximum grating lobe based upon, at least in part, a latest model state. 8. The computer-implemented method of claim 1 further comprising extracting, by a beamformer, any source of the one or more sources except sources of the one or more sources deemed as interference. 9. A computer program product residing on a non-transitory computer readable storage medium having a plurality of instructions stored thereon which, when executed across one or more processors, causes at least a portion of the one or more processors to perform operations comprising: receiving a first signal emitted from one or more sources; receiving a second signal emitted from the one or more sources; determining a first confidence level that a wake-up-word is included in the first signal; determining a second confidence level that the wake-up-word is included in the second signal; identifying that the wake-up-word originated from a first source of the one or more sources based upon, at least in part, the first and second confidence levels; and enabling the first source to participate in a dialogue phase. 10. The computer program product of claim 9 further comprising excluding at least a second source of the one or more sources from participating in the dialogue phase based upon, at least in part, historical information of the one or more sources. 11. The computer program product of claim 9 wherein a Bayes decision process is used, at least in part, to identify the one or more sources. 12. The computer program product of claim 9 wherein a wrapped Gaussian mixture model is used, at least in part, to identify the one or more sources. 13. The computer program product of claim 9 wherein the operations further comprise tracking movement of the first source with at least one of one or more core localizers. 14. The computer program product of claim 9 wherein the operations further comprise extracting, by a beamformer, any source of the one or more sources except sources of the one or more sources deemed as interference. 15. The computer program product of claim 9 wherein the operations further comprise determining one or more angles at a given frequency expected to exhibit a maximum grating lobe based upon, at least in part, a latest model state. 16. A computing system including one or more processors and one or more computer readable media having a plurality of instructions stored thereon which, when executed across the one or more processors, causes at least a portion of the one or more processors to perform operations comprising: receiving a first signal emitted from one or more sources; receiving a second signal emitted from the one or more sources; determining a first confidence level that a wake-up-word is included in the first signal; determining a second confidence level that the wake-up-word is included in the second signal; identifying that the wake-up-word originated from a first source of the one or more sources based upon, at least in part, the first and second confidence levels; and enabling the first source to participate in a dialogue phase. 17. The computing system of claim 16 wherein the operations further comprise excluding at least a second source of the one or more sources from participating in the dialogue phase based upon, at least in part, historical information of the one or more sources. 18. The computing system of claim 16 wherein a Bayes decision process is used, at least in part, to identify the one or more sources. 19. The computing system of claim 16 wherein a wrapped Gaussian mixture model is used, at least in part, to identify the one or more sources. 20. The computing system of claim 16 wherein the operations further comprise at least one of extracting, by a beamformer, any source of the one or more sources except sources of the one or more sources deemed as interference and determining one or more angles at a given frequency expected to exhibit a maximum grating lobe based upon, at least in part, a latest model state.

Assignees

Inventors

Classifications

  • Multi-channel systems specially adapted for direction-finding, i.e. having a single aerial system capable of giving simultaneous indications of the directions of different signals · CPC title

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

  • Applications of wireless loudspeakers or wireless microphones · CPC title

  • microphones · CPC title

  • Word spotting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10482878B2 cover?
A method, computer program product, and computer system for receiving, by a computing device, a first signal emitted from one or more sources. A second signal may be received emitted from the one or more sources. A first confidence level that the wake-up-word is included in the first signal may be determined. A second confidence level that the wake-up-word is included in the second signal may b…
Who is the assignee on this patent?
Nuance Communications Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).