Voice recognition with timing information for noise cancellation

US12165667B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12165667-B2
Application numberUS-202318352931-A
CountryUS
Kind codeB2
Filing dateJul 14, 2023
Priority dateJan 25, 2019
Publication dateDec 10, 2024
Grant dateDec 10, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, devices, and methods are described for reducing degradation of a voice recognition input. An always listening device may always be listening for voice commands via a microphone and may experience interference from unwanted audio such as from the output audio of television speakers. The always listening device may receive data associated with the output audio over a first communications channel. The always listening device may also receive, on a second communications channel, timing information associated with data. The always listening device may adjust admission of the audio received by the microphone to enable it to arrive at approximately the same time as the data received via the first communications channel. The unwanted output audio included in the audio received via the microphone may then be determined and may be removed so that a voice command in the audio received by the microphone may be processed.

First claim

Opening claim text (preview).

What is claimed: 1. A method comprising: receiving first data indicative of content being rendered by a device; receiving, via a user input, second data; synchronizing, based on timing data associated with the first data, the second data with the first data; determining, based on the synchronized first data and second data, whether the second data comprises a portion of the content; and processing, based on filtering the portion of the content from the second data, a voice command indicated by the filtered second data. 2. The method of claim 1 , further comprising: determining, based on the timing data, a delay associated with receiving the first data. 3. The method of claim 2 , wherein the first data comprises a plurality of first timestamps, wherein the timing data comprises a plurality of second timestamps corresponding with the plurality of first timestamps, wherein the delay indicates a difference between a first arrival time, of a first timestamp of the plurality of first timestamps, and a second arrival time, of a second timestamp of the plurality of second timestamps that corresponds with the first timestamp. 4. The method of claim 2 , wherein the determining, based on the timing data, the delay further comprises: receiving the timing data via a Radio Frequency for Consumer Electronics (RF4CE) channel. 5. The method of claim 2 , wherein the delay indicates that the first data is delayed with respect to time in comparison to the second data, or wherein the delay indicates that the first data is ahead with respect to time in comparison to the second data. 6. The method of claim 1 , wherein the first data is received via a local area network (LAN) interface. 7. The method of claim 1 , wherein the device comprises at least one of: a speaker, a television, or a computing device. 8. The method of claim 1 , wherein the second data is received via a user device, wherein the user device comprises one of: an always-on listening assistant device, a remote control with a microphone, a speakerphone, a smartphone, a tablet, or a computing device. 9. A device comprising: one or more processors; and memory storing instructions that, when executed by the one or more processors, cause the device to: receive first data indicative of content being rendered by an output device; receive, via a user input, second data; synchronize, based on timing data associated with the first data, the second data with the first data; determine, based on the synchronized first data and second data, whether the second data comprises a portion of the content; and process, based on filtering the portion of the content from the second data, a voice command indicated by the filtered second data. 10. The device of claim 9 , wherein the instructions, when executed, further cause the device to: determine, based on the timing data, a delay associated with receiving the first data. 11. The device of claim 10 , wherein the first data comprises a plurality of first timestamps, wherein the timing data comprises a plurality of second timestamps corresponding with the plurality of first timestamps, wherein the delay indicates a difference between a first arrival time, of a first timestamp of the plurality of first timestamps, and a second arrival time, of a second timestamp of the plurality of second timestamps that corresponds with the first timestamp. 12. The device of claim 10 , wherein the determining, based on the timing data, the delay further comprises: receiving the timing data via a Radio Frequency for Consumer Electronics (RF4CE) channel. 13. The device of claim 10 , wherein the delay indicates that the first data is delayed with respect to time in comparison to the second data, or wherein the delay indicates that the first data is ahead with respect to time in comparison to the second data. 14. The device of claim 9 , wherein the first data is received via a local area network (LAN) interface. 15. The device of claim 9 , wherein the output device comprises at least one of: a speaker, a television, or a computing device. 16. The device of claim 9 , wherein the second data is received via a user device, wherein the user device comprises one of: an always-on listening assistant device, a remote control with a microphone, a speakerphone, a smartphone, a tablet, or a computing device. 17. A non-transitory computer-readable storage medium storing computer-readable instructions that, when executed by a processor, cause: receiving first data indicative of content being rendered by a device; receiving, via a user input, second data; synchronizing, based on timing data associated with the first data, the second data with the first data; determining, based on the synchronized first data and second data, whether the second data comprises a portion of the content; and processing, based on filtering the portion of the content from the second data, a voice command indicated by the filtered second data. 18. The non-transitory computer-readable storage medium of claim 17 , wherein the instructions, when executed, further cause: determining, based on the timing data, a delay associated with receiving the first data. 19. The non-transitory computer-readable storage medium of claim 18 , wherein the first data comprises a plurality of first timestamps, wherein the timing data comprises a plurality of second timestamps corresponding with the plurality of first timestamps, wherein the delay indicates a difference between a first arrival time, of a first timestamp of the plurality of first timestamps, and a second arrival time, of a second timestamp of the plurality of second timestamps that corresponds with the first timestamp. 20. The non-transitory computer-readable storage medium of claim 18 , wherein the delay indicates that the first data is delayed with respect to time in comparison to the second data, or wherein the delay indicates that the first data is ahead with respect to time in comparison to the second data.

Assignees

Inventors

Classifications

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Execution procedure of a spoken command · CPC title

  • the noise being echo, reverberation of the speech · CPC title

  • for synchronising with other signals, e.g. video signals · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12165667B2 cover?
Systems, devices, and methods are described for reducing degradation of a voice recognition input. An always listening device may always be listening for voice commands via a microphone and may experience interference from unwanted audio such as from the output audio of television speakers. The always listening device may receive data associated with the output audio over a first communications…
Who is the assignee on this patent?
Comcast Cable Comm Llc
What technology area does this patent fall under?
Primary CPC classification G10L21/0216. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 10 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).