Filtering audio-based interference from voice commands using interference information

US10832678B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10832678-B2
Application numberUS-201816004234-A
CountryUS
Kind codeB2
Filing dateJun 8, 2018
Priority dateJun 8, 2018
Publication dateNov 10, 2020
Grant dateNov 10, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method, according to one embodiment, includes: receiving a complex audio signal which includes an intended audio signal and at least one interfering audio signal. Moreover, the intended audio signal is a voice-based command originating from a user. Information which corresponds to the at least one interfering audio signal is also received. The received information is used to identify portions of the complex audio signal as being the at least one interfering audio signal. Furthermore, the identified portion of the complex audio signal is removed from the complex audio signal, and a remaining portion of the complex audio signal is output.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user; receiving information which corresponds to the at least one interfering audio signal, wherein the received information includes network activity experienced by a network router which corresponds to the user; using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal; removing the identified portion of the complex audio signal from the complex audio signal; and outputting a remaining portion of the complex audio signal, wherein using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal includes: identifying audio signals included in the network activity; comparing the audio signals with the complex audio signal; and identifying matches between the audio signals and the complex audio signal as portions of the at least one interfering audio signal. 2. The computer-implemented method of claim 1 , wherein the received information includes: a timestamp which identifies a time that the voice-based command was originated; content which corresponds to the at least one interfering audio signal; and an offset which identifies a portion of the content which produced the at least one interfering audio signal at the time that the voice-based command was originated. 3. The computer-implemented method of claim 2 , wherein the content includes an actual copy of the at least one interfering audio signal, or a live stream of the at least one interfering audio signal. 4. The computer-implemented method of claim 1 , wherein the received information includes: a timestamp which identifies a time that the voice-based command was originated. 5. The computer-implemented method of claim 1 , wherein the received information includes: a timestamp which identifies a time that the voice-based command was originated; and audio samples collected from one or more other users at about the time that the voice-based command was originated, the one or more other users being located in a same area as the user which originated the voice-based command, wherein using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal includes: comparing the audio samples with the complex audio signal; and identifying matches between the audio samples and the complex audio signal as portions of the at least one interfering audio signal. 6. The computer-implemented method of claim 1 , wherein using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal includes: converting the complex audio signal into a plurality of fingerprints; comparing the plurality of fingerprints to a database of fingerprints which represent known audio signals; and using matches between the plurality of fingerprints and the database of fingerprints to determine whether the respective portions of the complex audio signal correspond to the interfering audio signal. 7. The computer-implemented method of claim 6 , comprising: using the received information which corresponds to the at least one interfering audio signal to reduce a size of the database compared to the plurality of fingerprints, wherein the received information includes contextual information which corresponds to the user and/or user integrated devices in a same environment as the user. 8. The computer-implemented method of claim 1 , wherein the received information includes a second complex audio signal, wherein the second complex audio signal includes the intended audio signal and at least a second interfering audio signal, wherein using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal includes: comparing the complex audio signal with the second complex audio signal; identifying matches between the complex audio signal and the second complex audio signal as portions of the intended audio signal; and identifying the remaining portions of the complex audio signal and the second complex audio signal which do not match as portions of the at least one interfering audio signal. 9. A computer-implemented method, comprising: receiving a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user; converting the complex audio signal into text which represents a plurality of words included in the complex audio signal; receiving information which corresponds to the at least one interfering audio signal, wherein the received information includes audio samples collected from one or more other users at about a time that the voice-based command was originated, the one or more other users being located in a same area as the user which originated the voice-based command, wherein the audio samples collected from the one or more other users includes the at least one interfering audio signal; using the received information to identify at least some of the text as representing words which correspond to the at least one interfering audio signal; discarding the identified text; and outputting a remaining portion of the text, wherein the received information includes a second complex audio signal, wherein the second complex audio signal includes the intended audio signal and at least a second interfering audio signal, wherein using the received information to identify portions of the complex audio signal as being the at least one interfering audio signal includes: converting the second complex audio signal into supplemental text which represents a plurality of words included in the second complex audio signal, comparing the text with the supplemental text, identifying matches between the text and the supplemental text as representing words which correspond to the intended audio signal, and identifying the remaining portions of the text and the supplemental text which do not match as representing words which correspond to the at least one interfering audio signal. 10. The computer-implemented method of claim 9 , wherein the received information includes: a timestamp which identifies the time that the voice-based command was originated; content which corresponds to the at least one interfering audio signal; and an offset which identifies a portion of the content which produced the at least one interfering audio signal at the time that the voice-based command was originated. 11. The computer-implemented method of claim 10 , wherein the content includes an actual copy of the at least one interfering audio signal, or a live stream of the at least one interfering audio signal. 12. The computer-implemented method of claim 9 , wherein the received information includes: a timestamp which identifies the time that the voice-based command was originated; and network activity experienced by a network router which corresponds to the user, wherein using the received information to identify at least some of the text as representing words which correspond to the at least one interfering audio signal includes: identifying audio signals included in the network activity; converting the identified audio signals into supplemental text which represents a plurality of words includ

Assignees

Inventors

Classifications

  • Noise filtering · CPC title

  • for comparison or discrimination · CPC title

  • Execution procedure of a spoken command · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Query processing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10832678B2 cover?
A computer-implemented method, according to one embodiment, includes: receiving a complex audio signal which includes an intended audio signal and at least one interfering audio signal. Moreover, the intended audio signal is a voice-based command originating from a user. Information which corresponds to the at least one interfering audio signal is also received. The received information is used…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 10 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).