Voice recognition system for use with a personal media streaming appliance

US11308947B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11308947-B2
Application numberUS-201815973240-A
CountryUS
Kind codeB2
Filing dateMay 7, 2018
Priority dateMay 7, 2018
Publication dateApr 19, 2022
Grant dateApr 19, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for playing a media content item via a voice command, the method comprising: receiving an audio clip of an instruction, the instruction comprising: an activation trigger portion identifying a wake signal, a command portion identifying intent information, and a parameter portion identifying a shortcut; identifying the instruction by converting the audio clip to a text version and determining the intent information and the shortcut from the text version; identifying a plurality of media content items associated with the shortcut; associating the shortcut with a physical button located on a media playback device; transmitting the media content items to the media playback device for playback; obtaining a second audio clip of a save instruction during playback of the media content items, the save instruction having a second command portion and a second parameter portion identifying a shortcut alias, and the second command portion including a save request; identifying the save instruction by converting the second audio clip to a second text version and determining the second command portion and the shortcut alias from the second text version; and associating the plurality of media content items with the shortcut alias of the media playback device; and associating the shortcut alias with the physical button located on the medial playback device. 2. The method according to claim 1 , further comprising: prior to identifying the media content item, transmitting the shortcut to the media playback device; and receiving a shortcut signal from the shortcut of the media playback device, wherein the media content item is identified based on the received shortcut. 3. The method according to claim 1 , wherein the second parameter portion of the save instruction includes a shortcut number predetermined for the second shortcut alias. 4. The method according to claim 1 , wherein the second parameter portion of the save instruction includes a set of one or more words given by a user who provides the save instruction. 5. The method according to claim 1 , wherein the save shortcut command portion includes a set of one or more words automatically generated by at least one computing device. 6. The method according to claim 1 , wherein when the instruction does not comprise the wake signal, the command portion and the parameter portion are not converted to a text version. 7. The method according to claim 1 , wherein the save instruction further comprises an activation trigger portion, the activation portion identifying a wake phrase. 8. The method according to claim 7 , wherein when the save instruction does not comprise the wake signal, the save shortcut command is not converted to a text version. 9. A system for operating a voice command interface configured to control a media playback device, the system comprising: a speech recognition engine configured to: receive an audio clip of an instruction, the instruction comprising: an activation trigger portion identifying a wake signal, a command portion identifying intent information, and the parameter portion identifying a shortcut; and a speech analysis engine configured to: identify the instruction by converting the audio clip to a text version and determining the intent information and the shortcut from the text version; identify a plurality of media content items associated with the shortcut; associate the shortcut with a physical button located on a media playback device; transmit the media content items to the media playback device to perform the command; obtain a second audio clip of a save instruction during playback of the media content items, the save instruction having a second command portion and a second parameter portion identifying a shortcut alias, and the second command portion including a save request; identify the save instruction by converting the second audio clip to a second text version and determining the second command portion and the shortcut alias from the second text version; and associate the plurality of media content items with the shortcut alias of the media playback device; and associate the shortcut alias with the physical button located on the medial playback device. 10. The system according to claim 9 , wherein the second command portion includes a shortcut number predetermined for the shortcut alias. 11. The system according to claim 9 , wherein the second command portion includes a set of one or more words given by a user who provides the save instruction. 12. The system according to claim 9 , wherein the second command portion includes a set of one or more words automatically generated by at least one computing device. 13. The system according to claim 9 , wherein the save instruction further comprises an activation trigger portion, the activation portion identifying a wake phrase. 14. The system according to claim 13 , wherein when the save instruction does not comprise the wake signal, the save shortcut command is not converted to a text version. 15. The system according to claim 9 , wherein when the instruction does not comprise the wake signal, the command portion and the parameter portion are not converted to a text version.

Assignees

Inventors

Classifications

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Execution procedure of a spoken command · CPC title

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11308947B2 cover?
A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the s…
Who is the assignee on this patent?
Spotify Ab
What technology area does this patent fall under?
Primary CPC classification G06F3/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 19 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).