VAS toggle based on device orientation

US11862161B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11862161-B2
Application numberUS-202117536572-A
CountryUS
Kind codeB2
Filing dateNov 29, 2021
Priority dateOct 22, 2019
Publication dateJan 2, 2024
Grant dateJan 2, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

As noted above, example techniques relate to toggling a cloud-based VAS between enabled and disabled modes. An example implementation involves a NMD detecting that the housing is in a first orientation and enabling a first mode. Enabling the first mode includes disabling voice input processing via a cloud-based VAS and enabling local voice input processing. In the first mode, the NMD captures sound data associated with a first voice input and detects, via a local natural language unit, that the first voice input comprises sound data matching one or more keywords. The NMD determines an intent of the first voice input and performs a first command according to the determined intent. The NMD may detect that the housing is in a second orientation and enables the second mode. Enabling the second mode includes enabling voice input processing via the cloud-based VAS.

First claim

Opening claim text (preview).

The invention claimed is: 1. A network microphone device comprising: at least one physical control; one or more microphones; a network interface; at least one processor; and a housing carrying the one or more microphones, the network interface, the at least one processor, and data storage including instructions that are executable by the at least one processor such that the network microphone device is configured to: detect a first input to the at least one physical control; after detection of the first input to the at least one physical control, enable a first mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the first mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) disable voice input processing via a cloud-based voice assistant service and (b) enable voice input processing via a local natural language unit; while the first mode is enabled: (i) capture sound data associated with a first voice input via the one or more microphones and (ii) detect, via the local natural language unit, that the first voice input comprises sound data matching one or more keywords from a local natural language unit library of the local natural language unit; determine, via the local natural language unit, an intent of the first voice input based on at least one of the one or more keywords; perform a first command according to the determined intent of the first voice input; detect a second input to the at least one physical control; and after detection of the second input to the at least one physical control, enable a second mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the second mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) enable voice input processing via the cloud-based voice assistant service and (b) disable voice input processing via the local natural language unit. 2. The network microphone device of claim 1 , wherein the at least one physical control comprises a physical toggle control, and wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to detect the second input to the at least one physical control comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: detect that the second input toggled the physical toggle control from a first position associated with the first mode to a second position associated with the second mode. 3. The network microphone device of claim 1 , wherein the network microphone device is paired to a network internet-of-things (IoT) device, and wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to perform the first command comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: transmit, via the network interface over a local area network, an instruction to the network IoT device. 4. The network microphone device of claim 3 , wherein the network IoT device comprises a smart illumination device, and wherein the first command comprises a command to toggle an illumination state of the smart illumination device. 5. The network microphone device of claim 3 , wherein the network microphone device is paired to a playback device carried in a separate housing from the network IoT device, and wherein the playback device is configured to process playback commands transmitted to the playback device from one or more remote computing devices of the cloud-based voice assistant service. 6. The network microphone device of claim 1 , wherein the instructions are executable by the at least one processor such that the network microphone device is further configured to: while the second mode is enabled, (i) capture additional sound data associated with a second voice input via the one or more microphones; (ii) detect a wake-word in the additional sound data; and (iii) after detection of the wake-word, transmit at least a portion of the additional sound data to one or more remote computing devices of the cloud-based voice assistant service for processing of the second voice input. 7. The network microphone device of claim 1 , wherein the one or more microphones comprise at least one first microphone and at least one second microphone, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the first mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) enable the at least one first microphone and (b) disable the at least one second microphone, and wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the second mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) disable the at least one first microphone and (b) enable the at least one second microphone. 8. A system comprising: a network microphone device, the network microphone device comprising: at least one physical control; one or more microphones; a network interface; at least one processor; and a housing carrying the one or more microphones, the network interface, the at least one processor, and data storage including instructions that are executable by the at least one processor such that the network microphone device is configured to: detect a first input to the at least one physical control; after detection of the first input to the at least one physical control, enable a first mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the first mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) disable voice input processing via a cloud-based voice assistant service and (b) enable voice input processing via a local natural language unit; while the first mode is enabled: (i) capture sound data associated with a first voice input via the one or more microphones and (ii) detect, via the local natural language unit, that the first voice input comprises sound data matching one or more keywords from a local natural language unit library of the local natural language unit; determine, via the local natural language unit, an intent of the first voice input based on at least one of the one or more keywords; perform a first command according to the determined intent of the first voice input; detect a second input to the at least one physical control; and after detection of the second input to the at least one physical control, enable a second mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the second mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) enable voice input processing via the cloud-based voice assistant service and (b) disable voice input processing via the local natural language unit. 9. The system of claim 8 , wherein the at least one physical control comprises a physical toggle control, and wherein the i

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Interaction with lists of selectable items, e.g. menus · CPC title

  • Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11862161B2 cover?
As noted above, example techniques relate to toggling a cloud-based VAS between enabled and disabled modes. An example implementation involves a NMD detecting that the housing is in a first orientation and enabling a first mode. Enabling the first mode includes disabling voice input processing via a cloud-based VAS and enabling local voice input processing. In the first mode, the NMD captures s…
Who is the assignee on this patent?
Sonos Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).