Speech-processing system

US11790902B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11790902-B2
Application numberUS-202016781773-A
CountryUS
Kind codeB2
Filing dateFeb 4, 2020
Priority dateFeb 4, 2020
Publication dateOct 17, 2023
Grant dateOct 17, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing system may then process the response data to determine second response data that includes an indication of the second speech-processing system and cause output of audio corresponding to the second response data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing data representing a spoken command, the method comprising: receiving, from a voice-controlled device, first audio data representing an utterance; determining that the first audio data includes: a first portion of audio data representing a wakeword corresponding to a first speech-processing system having a first style of synthesized speech, and a second portion of audio data representing a command; determining a domain associated with the command; determining that the domain is associated with a second speech-processing system having a second style of synthesized speech; determining an indication that the second speech-processing system is capable of processing command data including the command; sending, from the first speech-processing system to the second speech-processing system, the command data; receiving, from the second speech-processing system, a first response to the command, the first response including a representation of first natural language corresponding to the second style of synthesized speech; determining, using the first speech-processing system and based at least in part on the first response, a second response to the command, the second response including a representation of second natural language corresponding to the first style of synthesized speech; and sending second audio data to the voice-controlled device corresponding to the second response associated with the first style of synthesized speech. 2. The method of claim 1 , further comprising: determining that the command corresponds to an action set to occur at a later time; and sending, from the second speech-processing system to the voice-controlled device, a second command to execute the action, wherein the second audio data includes a representation of a name of the second speech-processing system. 3. The method of claim 1 , further comprising: receiving, from the voice-controlled device, third audio data representing a second utterance; determining that the third audio data corresponds to a second command; determining that the second command corresponds to a user device proximate a user; and sending, to the user device via an application programming interface, an instruction corresponding to the second command. 4. A computer-implemented method comprising: receiving input data corresponding to a command; determining, using a first natural-understanding system, that the command is associated with a second natural-understanding system; determining a first indication that the second natural-understanding system is capable of processing first data corresponding to the command; sending, from the first natural-understanding system to the second natural-understanding system, the first data; receiving, from the second natural-understanding system, second data corresponding to a first response to the command, the second data including a second indication of the second natural-understanding system; determining, by the first natural-understanding system and based at least in part on the second data, third data corresponding to a second response to the command, the third data including a third indication of the first natural-understanding system; and causing output corresponding to the third data associated with the first natural-understanding system. 5. The computer-implemented method of claim 4 , further comprising: prior to causing the output, determining that the second natural-understanding system is associated with a request for permission; determining to cause second output corresponding to the request for permission before sending the first data; and receiving fourth input data corresponding to the request for permission, wherein the third data further includes a representation of the request for permission. 6. The computer-implemented method of claim 4 , further comprising: determining that the command corresponds to a second output of the second natural-understanding system, the second output occurring after the output, wherein the third data includes a representation of a name of the second natural-understanding system. 7. The computer-implemented method of claim 4 , further comprising: determining that receiving the input data is associated with a first device; and determining that causing the output is associated with a second device, wherein the third data includes a representation of a name of the second natural-understanding system. 8. The computer-implemented method of claim 4 , further comprising: determining that receiving the input data is associated with a first device; determining a user account associated with the first device; and determining that the user account includes a fourth indication of the second natural-understanding system, wherein the third data includes a representation of a name of the second natural-understanding system. 9. The computer-implemented method of claim 4 , further comprising: determining that the input data includes a representation of a wakeword associated with the first natural-understanding system. 10. The computer-implemented method of claim 4 , wherein determining that the command is associated with the second natural-understanding system comprises: determining a domain corresponding to the input data; and determining that the second natural-understanding system corresponds to the domain. 11. The computer-implemented method of claim 4 , further comprising: receiving second audio data corresponding to a second command; determining that the second command is associated with the second natural-understanding system; and sending, to the second natural-understanding system, a third command to process third audio data. 12. The computer-implemented method of claim 4 , further comprising: determining that the second data lacks first information; sending, from the first natural-understanding system to the second natural-understanding system, fourth data corresponding to a request for the first information; and receiving, from the second natural-understanding system, fifth data corresponding to the first information. 13. A computing device comprising: at least one processor; and at least one memory including instructions that, when executed by the at least one processor, cause the computing device to: receive input data corresponding to a command; determine, using a first natural-understanding system, that the command is associated with a second natural-understanding system; determine a first indication that the second natural-understanding system is capable of processing first data corresponding to the command; send, from the first natural-understanding system to the second natural-understanding system, the first data; receive, from the second natural-understanding system, second data corresponding to a first response to the command, the second data including a second indication of the second natural-understanding system; determine, by the first natural-understanding system and based at least in part on the second data, third data corresponding to a second response to the command, the third data including a third indication of the first natural-understanding system; and cause output corresponding to the third data associated with the first natural-understanding system. 14. The computing device of claim 13 , wherein the at least one memory further includes instructions that, when executed by the at least one processor, further cause the computing device to: prior to causing the output, determine that the second natural-understanding system is associated with a request for permission

Assignees

Inventors

Classifications

  • G10L15/30Primary

    Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • of application context · CPC title

  • Semantic analysis · CPC title

  • Named entity recognition · CPC title

  • Recognition of textual entities · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11790902B2 cover?
A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing s…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 17 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).