Voice controlled media playback system

US11556306B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11556306-B2
Application numberUS-202016989805-A
CountryUS
Kind codeB2
Filing dateAug 10, 2020
Priority dateFeb 22, 2016
Publication dateJan 17, 2023
Grant dateJan 17, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems and methods for receiving a voice command and determining an appropriate action for the media playback system to execute based on user identification. The systems and methods receive a voice command for a media playback system, and determines whether the voice command was received from a registered user of the media playback system. In response to determining that the voice command was received from a registered user, the systems and methods configure an instruction for the media playback system based on content from the voice command and information in a user profile for the registered user.

First claim

Opening claim text (preview).

The invention claimed is: 1. Tangible, non-transitory computer-readable media having instructions encoded thereon, wherein the instructions, when executed by one or more processors, cause a computing device to perform operations comprising: associating a first profile of a media playback system with (i) a first wakeup word and (ii) a first music service account; associating a second profile of the media playback system with (i) a second wakeup word and (ii) a second music service account; receiving, via a network microphone device of the media playback system, a first voice input comprising (i) the first wakeup word and (ii) a first voice command, wherein the first wakeup word triggers a time period for the media playback system to receive additional voice commands; after receiving the first voice input, identifying the first profile based on the first wakeup word, configuring a first instruction based on (i) the first voice command and (ii) the first music service account, and transmitting the first instruction to a remote computing device configured to access content via the first music service account; after the time period has expired, receiving, via the network microphone device, a second voice input comprising (i) the second wakeup word and (ii) a second voice command; and after receiving the second voice input, identifying the second profile based on the second wakeup word, configuring a second instruction based on (i) the second voice command and (ii) the second music service account, and transmitting the second instruction to a remote computing device configured to access content via the second music service account. 2. The tangible, non-transitory computer-readable media according to claim 1 , wherein the first profile specifies first voice identifying information associated with a first user's voice and the second profile specifies second voice identifying information associated with a second user's voice, wherein: identifying the first profile after receiving the first voice input further comprises determining that the first voice input is associated with the first voice identifying information; and identifying the second profile after receiving the second voice input further comprises determining that the second voice input is associated with the second voice identifying information. 3. The tangible, non-transitory computer-readable media according to claim 2 , wherein: determining that the first voice input is associated with the first voice identifying information further comprises determining that a frequency and tone associated with the first voice input correlates with a frequency and tone specified in the first voice identifying information; and determining that the second voice input is associated with the second voice identifying information further comprises determining that a frequency and tone associated with the second voice input correlates with a frequency and tone specified in the second voice identifying information. 4. The tangible, non-transitory computer-readable media according to claim 1 , wherein the second profile is associated with one or more restrictions, wherein the operations comprise: determining whether the second voice command is in conflict with the restriction, wherein transmitting the second instruction to the remote computing device occurs after first determining that the second voice command is not in conflict with the one or more restrictions. 5. The tangible, non-transitory computer-readable media according to claim 4 , wherein the one or more restrictions specify a maximum volume, wherein transmitting a second instruction to the remote computing device to increase a volume of the remote computing device occurs after first determining that the volume of the remote computing device is below the maximum volume. 6. The tangible, non-transitory computer-readable media according to claim 4 , wherein the one or more restrictions specify one or more restricted audio content types, wherein transmitting a second instruction to the remote computing device to play a particular type of audio content occurs after first determining that the particular type of audio content does not correspond to one of the one or more restricted audio content types. 7. The tangible, non-transitory computer-readable media according to claim 1 , wherein configuring the second instruction further comprises: specifying, within the second instruction, one or more of: a preferred playback volume and a preferred audio equalization setting associated with the second profile. 8. The tangible, non-transitory computer-readable media according to claim 1 , wherein different combinations of audio content types are associated with different profiles, wherein identifying the second profile based on the second wakeup word, further comprises: determining the second profile based on a type of audio content requested via the second voice input. 9. The tangible, non-transitory computer-readable media according to claim 1 , wherein different combinations of rooms types are associated with different profiles and wherein the network microphone device is one of a plurality of network microphones devices that are provided in different rooms of a dwelling, wherein identifying the second profile based on the second wakeup word, further comprises: determining the second profile based on a room associated with a microphone device through which the second voice input is received. 10. The tangible, non-transitory computer-readable media according to claim 1 , further comprising: specifying in the first profile and the second profile, information indicative of one or more of: an age range, one or more room locations, one or more playback settings, one or more playlists, one or more audio content types, one or more restrictions, voice identifying information, a gender, or an audio content playback history. 11. A system comprising: one or more processors; and a memory in communication with the one or more processors, wherein the memory stores instruction code that when executed by the one or more processors, causes the system to perform operations comprising: associating a first profile of a media playback system with (i) a first wakeup word and (ii) a first music service account; associating a second profile of the media playback system with (i) a second wakeup word and (ii) a second music service account; receiving, via a network microphone device of the media playback system, a first voice input comprising (i) the first wakeup word and (ii) a first voice command, wherein the first wakeup word triggers a time period for the media playback system to receive additional voice commands; after receiving the first voice input, identifying the first profile based on the first wakeup word, configuring a first instruction based on (i) the first voice command and (ii) the first music service account, and transmitting the first instruction to a remote computing device configured to access content via the first music service account; after the time period has expired, receiving, via the network microphone device, a second voice input comprising (i) the second wakeup word and (ii) a second voice command; and after receiving the second voice input, identifying the second profile based on the second wakeup word, configuring a second instruction based on (i) the second voice command and (ii) the second music service account, and transmitting the second instruction to a remote computing device configured to access content via the second music service account. 12. The system according to claim 11 , wherein the first profile specifies first voice identifying information associated wit

Assignees

Inventors

Classifications

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G10L17/22Primary

    Interactive procedures; Man-machine interfaces · CPC title

  • Speaker identification or verification techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11556306B2 cover?
Disclosed herein are systems and methods for receiving a voice command and determining an appropriate action for the media playback system to execute based on user identification. The systems and methods receive a voice command for a media playback system, and determines whether the voice command was received from a registered user of the media playback system. In response to determining that t…
Who is the assignee on this patent?
Sonos Inc
What technology area does this patent fall under?
Primary CPC classification G06F3/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 17 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).