Voice control of remote device by disabling wakeword detection

US10074371B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10074371-B1
Application numberUS-201715458628-A
CountryUS
Kind codeB1
Filing dateMar 14, 2017
Priority dateMar 14, 2017
Publication dateSep 11, 2018
Grant dateSep 11, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system configured to enable remote control to allow a first user to provide assistance to a second user. The system may receive a command from the second user granting remote control to the first user, enabling the first user to initiate a voice command on behalf of the second user. In some examples, the system may enable the remote control by treating a voice command originating from the first user as though it originated from the second user instead. For example, the system may receive the voice command from a first device associated with the first user but may route the voice command as though it was received by a second device associated with the second user. To enable this functionality, during a remote control session the first device may disable wakeword detection so that the voice command is correctly routed to the second device.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: generating, by a first speech-controlled device in a first environment, first audio data using one or more microphones; sending at least a portion of the first audio data to a second speech-controlled device in a second environment physically remote from the first environment, the first audio data corresponding to a communication from the first speech-controlled device to the second speech-controlled device; detecting that a wakeword is represented in the first audio data; generating second audio data including at least a portion of the first audio data following the wakeword, the second audio data corresponding to a first command to disable wakeword detection; sending the second audio data to a remote server; receiving, from the remote server in response to the first command, an instruction to disable wakeword detection; disabling the wakeword detection; generating, while the wakeword detection is disabled, third audio data corresponding to the communication from the first speech-controlled device to the second speech-controlled device and including a second command to perform a second action; and sending at least a portion of the third audio data to the second speech-controlled device. 2. The computer-implemented method of claim 1 , further comprising: detecting, using a wakeword detection component, that the wakeword is represented in the third audio data; determining that the wakeword detection is disabled; determining to ignore the wakeword; detecting, using the wakeword detection component, that a second wakeword is represented in the third audio data, the second wakeword corresponding to a command to enable the wakeword detection; and enabling the wakeword detection. 3. The computer-implemented method of claim 1 , further comprising, by the first speech-controlled device prior to receiving the instruction to disable the wakeword detection: detecting, using the wakeword detection, that a second wakeword is represented in the first audio data; generating fourth audio data including at least a portion of the first audio data following the second wakeword; and sending the fourth audio data to the remote server, the fourth audio data corresponding to a third command to perform a third action. 4. The computer-implemented method of claim 1 , further comprising: receiving fourth audio data originating from the second speech-controlled device; outputting first audio corresponding to a first portion of the fourth audio data, the first audio having a first volume level; generating fifth audio data using the one or more microphones; detecting that the wakeword is represented in the fifth audio data; outputting second audio corresponding to a second portion of the fourth audio data, the second audio having a second volume level that is less than the first volume level; receiving a notification that the second speech-controlled device is granted remote control of the first speech-controlled device; receiving sixth audio data originating from the second speech-controlled device; outputting third audio corresponding to a first portion of the sixth audio data, the third audio having the first volume level; generating, using the one or more microphones, seventh audio data corresponding to at least a portion of the third audio; detecting that the wakeword is represented in the seventh audio data; and outputting fourth audio corresponding to a second portion of the sixth audio data, the fourth audio having the first volume level. 5. A computer-implemented method, comprising: generating, by a first device in a first environment, first audio data; sending at least a portion of the first audio data to a second device in a second environment physically remote from the first environment, the first audio data corresponding to a communication from the first device to the second device; detecting that a wakeword is represented in the first audio data; generating second audio data including at least a portion of the first audio data following the wakeword, the second audio data corresponding to a first command to disable wakeword detection; sending at least the second audio data to a remote server; receiving, from the remote server in response to the first command, an instruction to disable wakeword detection; disabling the wakeword detection; generating, while the wakeword detection is disabled, third audio data corresponding to the communication from the first device to the second device and including a second command to perform an action; and sending at least a portion of the third audio data to the second device. 6. The computer-implemented method of claim 5 , further comprising, by the first device prior to receiving the instruction to disable the wakeword detection: detecting that the wakeword is represented in the first audio data; generating fourth audio data including at least a portion of the first audio data following the wakeword; and sending the fourth audio data to the remote server, the fourth audio data corresponding to a third command to perform a second action associated with a first user profile corresponding to the first device. 7. The computer-implemented method of claim 6 , further comprising, by the first device prior to receiving the instruction to disable the wakeword detection: detecting that a second wakeword is represented in the first audio data, the second wakeword being different from the first wakeword; generating fifth audio data including at least a portion of the first audio data following the second wakeword; and sending the fifth audio data to the remote server, the fifth audio data corresponding to a fourth command to perform a third action associated with a second user profile corresponding to the second device. 8. The computer-implemented method of claim 5 , further comprising: receiving, from the remote server, a second instruction to enable the wakeword detection; and enabling the wakeword detection. 9. The computer-implemented method of claim 5 , further comprising: detecting, using a wakeword detection component, that the wakeword is represented in the third audio data; determining that the wakeword detection is disabled; and determining to ignore the wakeword. 10. The computer-implemented method of claim 5 , further comprising: detecting, using a wakeword detection component, that a second wakeword is represented in the third audio data, the second wakeword different than the first wakeword and corresponding to a third command to enable the wakeword detection; and enabling the wakeword detection. 11. The computer-implemented method of claim 5 , further comprising, by the first device prior to receiving the instruction to disable the wakeword detection: detecting that wakeword is represented in the first audio data; generating fourth audio data including at least a portion of the first audio data following the wakeword; and sending the fourth audio data to the remote server, the fourth audio data corresponding to a third command to perform a second action associated with a user profile corresponding to the second device. 12. The computer-implemented method of claim 5 , further comprising: receiving fourth audio data originating from the second device; outputting first audio corresponding to a first portion of the fourth audio data, the first audio having a first volume level; generating fifth audio data; detecting that the wakeword is represented in the fifth audio data; outputting second audio corresponding to a second portion of the fourth audio data, the second audio having a second volume level

Assignees

Inventors

Classifications

  • Telecontrol · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Speech classification or search · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Execution procedure of a spoken command · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10074371B1 cover?
A system configured to enable remote control to allow a first user to provide assistance to a second user. The system may receive a command from the second user granting remote control to the first user, enabling the first user to initiate a voice command on behalf of the second user. In some examples, the system may enable the remote control by treating a voice command originating from the fir…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 11 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).