Multi-assistant natural language input processing

US11636851B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11636851-B2
Application numberUS-202117387157-A
CountryUS
Kind codeB2
Filing dateJul 28, 2021
Priority dateSep 24, 2019
Publication dateApr 25, 2023
Grant dateApr 25, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natural language input, using a TTS voice of a second NLP system assistant.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving, by a system corresponding to at least a first assistant voice and a second assistant voice, first audio data representing a first spoken natural language input; performing speech processing using the first audio data to determine a first portion corresponding to a first command; determining first output data responsive to the first spoken natural language input; based at least in part on the first command, selecting first data configured to generate synthesized speech in the first assistant voice; performing speech synthesis processing using the first data and the first output data to generate second audio data; and causing the second audio data to be output. 2. The computer-implemented method of claim 1 , wherein the first command corresponds to a first wakeword associated with the first assistant voice. 3. The computer-implemented method of claim 1 , wherein the first assistant voice corresponds to a first virtual assistant and wherein the method further comprises: determining at least a portion of the first output data based at least in part on the first virtual assistant. 4. The computer-implemented method of claim 1 , further comprising: receiving third audio data representing a second spoken natural language input corresponding to a same dialog as the first spoken natural language input; performing speech processing using the third audio data to determine a second portion corresponding to a second command; determining second output data responsive to the second spoken natural language input; based at least in part on the second command, selecting second data configured to generate synthesized speech in the second assistant voice; performing speech synthesis processing using the second data and the second output data to generate fourth audio data; and causing the fourth audio data to be output. 5. The computer-implemented method of claim 1 , further comprising: determining second output data responsive to a second portion of the first spoken natural language input; based at least in part on the second portion, selecting second data configured to generate synthesized speech in the second assistant voice; performing speech synthesis processing using the second data and the second output data to generate fourth audio data; and causing the fourth audio data to be output. 6. The computer-implemented method of claim 5 , further comprising: including, in the first output data, an identifier corresponding to the second assistant voice, wherein the second audio data includes a representation of the identifier. 7. The computer-implemented method of claim 1 , further comprising: processing the first audio data to determine a user identifier corresponding to the first spoken natural language input, wherein selection of the first assistant voice is further based at least in part on the user identifier. 8. The computer-implemented method of claim 1 , further comprising: determining use of the first assistant voice is enabled with respect to the first spoken natural language input. 9. The computer-implemented method of claim 1 , wherein the first audio data is received from a first device and the method further comprises: after selecting the first data, associating an assistant identifier with the first device, wherein the assistant identifier corresponds to the first assistant voice. 10. The computer-implemented method of claim 1 , wherein the first audio data is received from a first device and the method further comprises: after causing the second audio data to be output, deleting an association between an assistant identifier and the first device, wherein the assistant identifier corresponds to the first assistant voice. 11. A system comprising: at least one processor; and at least one memory comprising instructions that, when executed by the at least one processor, cause the system to: receive, by a system corresponding to at least a first assistant voice and a second assistant voice, first audio data representing a first spoken natural language input; perform speech processing using the first audio data to determine a first portion corresponding to a first command; determine first output data responsive to the first spoken natural language input; based at least in part on the first command, select first data configured to generate synthesized speech in the first assistant voice; perform speech synthesis processing using the first data and the first output data to generate second audio data; and cause the second audio data to be output. 12. The system of claim 11 , wherein the first command corresponds to a first wakeword associated with the first assistant voice. 13. The system of claim 11 , wherein the first assistant voice corresponds to a first virtual assistant and wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine at least a portion of the first output data based at least in part on the first virtual assistant. 14. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: receive third audio data representing a second spoken natural language input corresponding to a same dialog as the first spoken natural language input; perform speech processing using the third audio data to determine a second portion corresponding to a second command; determine second output data responsive to the second spoken natural language input; based at least in part on the second command, select second data configured to generate synthesized speech in the second assistant voice; perform speech synthesis processing using the second data and the second output data to generate fourth audio data; and cause the fourth audio data to be output. 15. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine second output data responsive to a second portion of the first spoken natural language input; based at least in part on the second portion, select second data configured to generate synthesized speech in the second assistant voice; perform speech synthesis processing using the second data and the second output data to generate fourth audio data; and cause the fourth audio data to be output. 16. The system of claim 15 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: include, in the first output data, an identifier corresponding to the second assistant voice, wherein the second audio data includes a representation of the identifier. 17. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: process the first audio data to determine a user identifier corresponding to the first spoken natural language input, wherein selection of the first assistant voice is further based at least in part on the user identifier. 18. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine use of the first assistant voice is enabled with respect to the first spoken natural language input.

Assignees

Inventors

Classifications

  • G10L13/033Primary

    Voice editing, e.g. manipulating the voice of the synthesiser · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Parsing for meaning understanding · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Speech synthesis; Text to speech systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11636851B2 cover?
Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natura…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L13/033. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 25 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).