Devices, systems, and methods for distributed voice processing

US10867604B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10867604-B2
Application numberUS-201916271560-A
CountryUS
Kind codeB2
Filing dateFeb 8, 2019
Priority dateFeb 8, 2019
Publication dateDec 15, 2020
Grant dateDec 15, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.

First claim

Opening claim text (preview).

We claim: 1. A method comprising: detecting sound via a microphone array of a first playback device; transmitting data associated with the detected sound from the first playback device to a second playback device over a local area network; analyzing, via a wake word engine of the second playback device, the transmitted data associated with the detected sound for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound from the second playback device to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving via the second playback device a response from the remote computing device, wherein the response is based on the detected sound; transmitting a message from the second playback device to the first playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions to perform an action; and performing the action via the first playback device. 2. The method of claim 1 , wherein the action is a first action and the method further comprises performing a second action via the second playback device, wherein the second action is based on the response from the remote computing device. 3. The method of claim 1 , further comprising disabling a wake word engine of the first playback device in response to the identification of the wake word via the wake word engine of the second playback device. 4. The method of claim 3 , further comprising enabling a wake word engine of the first playback device after the second playback device receives the response from the remote computing device. 5. The method of claim 4 , wherein the wake word is a second wake word, and wherein the wake word engine of the first playback device is configured to detect a first wake word that is different than the second wake word. 6. The method of claim 1 , wherein the first playback device is configured to communicate with the remote computing device associated with the particular voice assistant service. 7. The method of claim 1 , wherein the remote computing device is a first remote computing device and the voice assistant service is a first voice assistant service, and wherein the first playback device is configured to detect a wake word associated with a second voice assistant service different than the first voice assistant service. 8. A first playback device comprising: one or more processors; a computer-readable medium storing instructions that, when executed by the one or more processors, cause the first playback device to perform operations comprising: receiving, from a second playback device over a local area network, data associated with sound detected via a microphone array of the second playback device; analyzing, via a wake word engine of the first playback device, the data associated with the detected sound for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving a response from the remote computing device, wherein the response is based on the detected sound; and transmitting a message to the second playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions for the second playback device to perform an action. 9. The first playback device of claim 8 , wherein the action is a first action and the operations further comprise performing a second action via the first playback device, wherein the second action is based on the response from the remote computing device. 10. The first playback device of claim 8 , wherein the operations further comprise disabling a wake word engine of the second playback device in response to the identification of the wake word via the wake word engine of the first playback device. 11. The first playback device of claim 10 , wherein the operations further comprise enabling the wake word engine of the second playback device after the first playback device receives the response from the remote computing device. 12. The first playback device of claim 11 , wherein the wake word is a first wake word, and wherein the wake word engine of the second playback device is configured to detect a second wake word that is different than the first wake word. 13. The first playback device of claim 8 , wherein the second playback device is configured to communicate with the remote computing device associated with the particular voice assistant service. 14. The first playback device of claim 8 , wherein the remote computing device is a first remote computing device and the voice assistant service is a first voice assistant service, and wherein the second playback device is configured to detect a wake word associated with a second voice assistant service different than the first voice assistant service. 15. A system, comprising: a first playback device comprising: one or more processors; a microphone array; and a first computer-readable medium storing instructions that, when executed by the one or more processors, cause the first playback device to perform first operations, the first operations comprising: detecting sound via the microphone array; transmitting data associated with the detected sound to a second playback device over a local area network; the second playback device comprising: one or more processors; and a second computer-readable medium storing instructions that, when executed by the one or more processors, cause the second playback device to perform second operations, the second operations comprising: analyzing, via a wake word engine of the second playback device, the transmitted data associated with the detected sound from the first playback device for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving a response from the remote computing device, wherein the response is based on the detected sound; and transmitting a message to the first playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions to perform an action, wherein the first computer-readable medium of the first playback device causes the first playback device to perform the action from the instructions received from the second playback device. 16. The system of claim 15 , wherein the action is a first action and the second operations further comprise performing a second action via the second playback device, wherein the second action is based on the response from the remote computing device. 17. The system of claim 15 , wherein the second operations further comprise disabling a wake word engine of the first playback device in response to the identification of the wake word via the wake word engine of the

Assignees

Inventors

Classifications

  • Word spotting · CPC title

  • Public address systems (circuits for preventing acoustic reaction H04R3/02; circuits for distributing signals to loudspeakers H04R3/12; {monitoring or testing arrangements for public address systems H04R29/007}; amplifiers H03F) · CPC title

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Audio distribution systems for home, i.e. multi-room use · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10867604B2 cover?
Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area n…
Who is the assignee on this patent?
Sonos Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 15 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).