Multi-mode voice triggering for audio devices

US11922948B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11922948-B2
Application numberUS-202318137958-A
CountryUS
Kind codeB2
Filing dateApr 21, 2023
Priority dateMar 11, 2021
Publication dateMar 5, 2024
Grant dateMar 5, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Implementations of the subject technology provide systems and methods for multi-mode voice triggering for audio devices. An audio device may store multiple voice recognition models, each trained to detect a single corresponding trigger phrase. So that the audio device can detect a specific one of the multiple trigger phrases without consuming the processing and/or power resources to run a voice recognition model that can differentiate between different trigger phrases, the audio device pre-loads a selected one of the voice recognition models for an expected trigger phrase into a processor of the audio device. The audio device may select the one of the voice recognition models for the expected trigger phrase based on a type of a companion device that is communicatively coupled to the audio device.

First claim

Opening claim text (preview).

What is claimed is: 1. An electronic device, comprising: a memory; and at least one processor configured to: provide, to an audio output device, an indicator of a particular virtual assistant at the electronic device; receive a trigger signal from the audio output device prior to receiving audio input data from the audio output device, wherein the trigger signal corresponds to the particular virtual assistant; and activate, responsive to the trigger signal, the particular virtual assistant for receiving the audio input data from the audio output device. 2. The electronic device of claim 1 , wherein the particular virtual assistant corresponds to a manufacturer of the electronic device that is different from a manufacturer of the audio output device. 3. The electronic device of claim 1 , wherein the at least one processor is further configured to provide audio content to the audio output device for output by the audio output device. 4. The electronic device of claim 1 , wherein the particular virtual assistant is one of multiple virtual assistants at the electronic device. 5. The electronic device of claim 1 , wherein the at least one processor is configured to provide the indicator of the particular virtual assistant at the electronic device to the audio output device while establishing a connection with the audio output device. 6. The electronic device of claim 5 , wherein the at least one processor is configured to provide the indicator of the particular virtual assistant at the electronic device to the audio output device while establishing the connection with the audio output device by including the indicator in connection information associated with establishing the connection. 7. The electronic device of claim 1 , wherein at least one processor is further configured to perform, with the activated particular virtual assistant, an active listening operation in cooperation with the audio output device based on the audio input data from the audio output device. 8. The electronic device of claim 7 , wherein the at least one processor is configured to perform the active listening operation, at least in part, by: receiving voice data that is based on voice input captured by a microphone of the audio output device, from the audio output device; and providing the voice data to one or more voice recognition models at the electronic device that are trained for identifying words and phrases for voice control of the particular virtual assistant. 9. The electronic device of claim 1 , wherein the trigger signal has been generated by the audio output device responsive to an output from a voice recognition model at the audio output device, the voice recognition model having been selected by the audio output device based on the indicator provided by the electronic device. 10. A method, comprising: providing, to an audio output device from an electronic device, an indicator of a particular virtual assistant at the electronic device; receiving a trigger signal from the audio output device at the electronic device prior to receiving audio input data from the audio output device, wherein the trigger signal corresponds to the particular virtual assistant; and activating, by the electronic device responsive to the trigger signal, the particular virtual assistant for receiving the audio input data from the audio output device. 11. The method of claim 10 , wherein the particular virtual assistant corresponds to a manufacturer of the electronic device that is different from a manufacturer of the audio output device. 12. The method of claim 10 , further comprising providing audio content to the audio output device for output by the audio output device. 13. The method of claim 10 , wherein the particular virtual assistant is one of multiple virtual assistants at the electronic device. 14. The method of claim 10 , wherein providing the indicator of the particular virtual assistant at the electronic device to the audio output device comprises providing the indicator to the audio output device while establishing a connection with the audio output device. 15. The method of claim 14 , wherein providing the indicator of the particular virtual assistant at the electronic device to the audio output device while establishing the connection with the audio output device comprises including the indicator in connection information associated with establishing the connection. 16. The method of claim 10 , further comprising performing, with the activated particular virtual assistant, an active listening operation in cooperation with the audio output device based on the audio input data from the audio output device. 17. The method of claim 16 , wherein the active listening operation comprises: receiving voice data that is based on voice input captured by a microphone of the audio output device, from the audio output device; and providing the voice data to one or more voice recognition models at the electronic device that are trained for identifying words and phrases for voice control of the particular virtual assistant. 18. The method of claim 10 , wherein the trigger signal has been generated by the audio output device responsive to an output from a voice recognition model at the audio output device, the voice recognition model having been selected by the audio output device based on the indicator provided by the electronic device. 19. A processor configured to: provide, to an audio output device that does not include the processor, an indicator of a particular virtual assistant accessible by the processor; receive a trigger signal from the audio output device prior to receiving audio input data from the audio output device, wherein the trigger signal corresponds to the particular virtual assistant; and activate, responsive to the trigger signal, the particular virtual assistant for receiving the audio input data from the audio output device. 20. The processor of claim 19 , wherein the particular virtual assistant is one of multiple virtual assistants accessible by the processor.

Assignees

Inventors

Classifications

  • G10L15/32Primary

    Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title

  • Recognition networks (G10L15/142, G10L15/16 take precedence) · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • Word spotting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11922948B2 cover?
Implementations of the subject technology provide systems and methods for multi-mode voice triggering for audio devices. An audio device may store multiple voice recognition models, each trained to detect a single corresponding trigger phrase. So that the audio device can detect a specific one of the multiple trigger phrases without consuming the processing and/or power resources to run a voice…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/32. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 05 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).