Text-to-speech processing using input voice characteristic data
US-11373633-B2 · Jun 28, 2022 · US
US11790902B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11790902-B2 |
| Application number | US-202016781773-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 4, 2020 |
| Priority date | Feb 4, 2020 |
| Publication date | Oct 17, 2023 |
| Grant date | Oct 17, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing system may then process the response data to determine second response data that includes an indication of the second speech-processing system and cause output of audio corresponding to the second response data.
Opening claim text (preview).
What is claimed is: 1. A method for processing data representing a spoken command, the method comprising: receiving, from a voice-controlled device, first audio data representing an utterance; determining that the first audio data includes: a first portion of audio data representing a wakeword corresponding to a first speech-processing system having a first style of synthesized speech, and a second portion of audio data representing a command; determining a domain associated with the command; determining that the domain is associated with a second speech-processing system having a second style of synthesized speech; determining an indication that the second speech-processing system is capable of processing command data including the command; sending, from the first speech-processing system to the second speech-processing system, the command data; receiving, from the second speech-processing system, a first response to the command, the first response including a representation of first natural language corresponding to the second style of synthesized speech; determining, using the first speech-processing system and based at least in part on the first response, a second response to the command, the second response including a representation of second natural language corresponding to the first style of synthesized speech; and sending second audio data to the voice-controlled device corresponding to the second response associated with the first style of synthesized speech. 2. The method of claim 1 , further comprising: determining that the command corresponds to an action set to occur at a later time; and sending, from the second speech-processing system to the voice-controlled device, a second command to execute the action, wherein the second audio data includes a representation of a name of the second speech-processing system. 3. The method of claim 1 , further comprising: receiving, from the voice-controlled device, third audio data representing a second utterance; determining that the third audio data corresponds to a second command; determining that the second command corresponds to a user device proximate a user; and sending, to the user device via an application programming interface, an instruction corresponding to the second command. 4. A computer-implemented method comprising: receiving input data corresponding to a command; determining, using a first natural-understanding system, that the command is associated with a second natural-understanding system; determining a first indication that the second natural-understanding system is capable of processing first data corresponding to the command; sending, from the first natural-understanding system to the second natural-understanding system, the first data; receiving, from the second natural-understanding system, second data corresponding to a first response to the command, the second data including a second indication of the second natural-understanding system; determining, by the first natural-understanding system and based at least in part on the second data, third data corresponding to a second response to the command, the third data including a third indication of the first natural-understanding system; and causing output corresponding to the third data associated with the first natural-understanding system. 5. The computer-implemented method of claim 4 , further comprising: prior to causing the output, determining that the second natural-understanding system is associated with a request for permission; determining to cause second output corresponding to the request for permission before sending the first data; and receiving fourth input data corresponding to the request for permission, wherein the third data further includes a representation of the request for permission. 6. The computer-implemented method of claim 4 , further comprising: determining that the command corresponds to a second output of the second natural-understanding system, the second output occurring after the output, wherein the third data includes a representation of a name of the second natural-understanding system. 7. The computer-implemented method of claim 4 , further comprising: determining that receiving the input data is associated with a first device; and determining that causing the output is associated with a second device, wherein the third data includes a representation of a name of the second natural-understanding system. 8. The computer-implemented method of claim 4 , further comprising: determining that receiving the input data is associated with a first device; determining a user account associated with the first device; and determining that the user account includes a fourth indication of the second natural-understanding system, wherein the third data includes a representation of a name of the second natural-understanding system. 9. The computer-implemented method of claim 4 , further comprising: determining that the input data includes a representation of a wakeword associated with the first natural-understanding system. 10. The computer-implemented method of claim 4 , wherein determining that the command is associated with the second natural-understanding system comprises: determining a domain corresponding to the input data; and determining that the second natural-understanding system corresponds to the domain. 11. The computer-implemented method of claim 4 , further comprising: receiving second audio data corresponding to a second command; determining that the second command is associated with the second natural-understanding system; and sending, to the second natural-understanding system, a third command to process third audio data. 12. The computer-implemented method of claim 4 , further comprising: determining that the second data lacks first information; sending, from the first natural-understanding system to the second natural-understanding system, fourth data corresponding to a request for the first information; and receiving, from the second natural-understanding system, fifth data corresponding to the first information. 13. A computing device comprising: at least one processor; and at least one memory including instructions that, when executed by the at least one processor, cause the computing device to: receive input data corresponding to a command; determine, using a first natural-understanding system, that the command is associated with a second natural-understanding system; determine a first indication that the second natural-understanding system is capable of processing first data corresponding to the command; send, from the first natural-understanding system to the second natural-understanding system, the first data; receive, from the second natural-understanding system, second data corresponding to a first response to the command, the second data including a second indication of the second natural-understanding system; determine, by the first natural-understanding system and based at least in part on the second data, third data corresponding to a second response to the command, the third data including a third indication of the first natural-understanding system; and cause output corresponding to the third data associated with the first natural-understanding system. 14. The computing device of claim 13 , wherein the at least one memory further includes instructions that, when executed by the at least one processor, further cause the computing device to: prior to causing the output, determine that the second natural-understanding system is associated with a request for permission
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
of application context · CPC title
Semantic analysis · CPC title
Named entity recognition · CPC title
Recognition of textual entities · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.