Virtual assistant for media playback

US11010127B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11010127-B2
Application numberUS-201916360695-A
CountryUS
Kind codeB2
Filing dateMar 21, 2019
Priority dateJun 29, 2015
Publication dateMay 18, 2021
Grant dateMay 18, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An exemplary method for identifying media may include receiving user input associated with a request for media, where that user input includes unstructured natural language speech including one or more words; identifying at least one context associated with the user input; causing a search for the media based on the at least one context and the user input; determining, based on the at least one context and the user input, at least one media item that satisfies the request; and in accordance with a determination that the at least one media item satisfies the request, obtaining the at least one media item.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: receive a first natural language speech input indicative of a media request; determine a first search criteria included in the first natural language speech input; obtain a first media item satisfying the media request, wherein obtaining the first media item includes: causing a first search for a first plurality of media items to obtain the first media item, the first search including the first search criteria and a first context; while causing the first search, causing a second search including the first search criteria and a second context to obtain a candidate media item; and determining that the first media item from the first plurality of media items matches the candidate media item; provide playback of a first media item from the first plurality of media items; while providing playback of the first media item, receive a second natural language speech input; determine, based on the second natural language speech input, a user intent of obtaining a media item different from the first media item; and in accordance with determining the user intent of obtaining a media item different from the first media item: obtain a second media item different from the first media item based on the first search criteria and the second natural language speech input, wherein obtaining the second media item includes: initiating a third search for a second plurality of media items, the third search including the previously determined first search criteria included in the first natural language speech input; and provide the second media item from the second plurality of media items. 2. The non-transitory computer-readable storage medium of claim 1 , wherein the first context is based on a context of the first natural language speech input. 3. The non-transitory computer-readable storage medium of claim 1 , wherein determining the user intent of obtaining a media item different from the first media item includes determining the user intent of obtaining a media item different from the first media item based on a context of the second natural language speech input. 4. The non-transitory computer-readable storage medium of claim 1 , wherein initiating the third search for the second plurality of media items further includes: searching for the second media item based on a context of the second natural language speech input. 5. The non-transitory computer-readable storage medium of claim 4 , wherein the context of the second natural language speech input includes device context. 6. The non-transitory computer-readable storage medium of claim 5 , wherein the device context includes a content of the first media item. 7. The non-transitory computer-readable storage medium of claim 5 , wherein the device context includes data associated with media stored on the electronic device. 8. The non-transitory computer-readable storage medium of claim 4 , wherein the context of the second natural language speech input includes a user context. 9. The non-transitory computer-readable storage medium of claim 8 , wherein the user context includes a content of the second natural language speech input. 10. The non-transitory computer-readable storage medium of claim 8 , wherein the user context includes media associated with a user. 11. The non-transitory computer-readable storage medium of claim 8 , wherein the user context includes information relating to a musical preference of a user. 12. The non-transitory computer-readable storage medium of claim 4 , wherein the context of the second natural language input includes social context. 13. The non-transitory computer-readable storage medium of claim 12 , wherein the social context includes an access frequency of a particular media item across a plurality of users. 14. The non-transitory computer-readable storage medium of claim 12 , wherein the social context includes a number of references to a media item in a social media database. 15. The non-transitory computer-readable storage medium of claim 1 , wherein initiating the third search for the second plurality of media items includes: searching for a set of media items based on the first search criteria; and discarding, from the set of media items, one or more media items matching the first media item. 16. The non-transitory computer-readable storage medium of claim 1 , wherein providing the second media item includes: terminating playback of the first media item; and providing playback of the second media item. 17. The non-transitory computer-readable storage medium of claim 1 , wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the electronic device to: present, to a user, an option to add the second media item to a library associated with the user; receive from the user, a user selection of the option; and in response to receiving the user selection of the option, add the second media item to the library associated with the user. 18. The non-transitory computer-readable storage medium of claim 1 , wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the electronic device to: while providing the second media item, receive a third natural speech input including a request for alternate media; and in accordance with receiving the third natural language speech input, provide a third media item different from the second media item. 19. The non-transitory computer-readable storage medium of claim 18 , wherein the third media item includes a different version of the second media item. 20. A method for operating a digital assistant comprising: at an electronic device with one or more processors and memory: receiving a first natural language speech input indicative of a media request; determining a first search criteria included in the first natural language speech input; obtaining a first media item satisfying the media request, wherein obtaining the first media item includes: causing a first search for a plurality of media items to obtain the first media item, the first search including the first search criteria and a first context; while causing the first search, causing a second search including the first search criteria and a second context to obtain a candidate media item; and determining that the first media item from the first plurality of media items matches the candidate media item; providing playback of a first media item from the first plurality of media items; while providing playback of the first media item, receiving a second natural language speech input; determining, based on the second natural language speech input, a user intent of obtaining a media item different from the first media item; and in accordance with determining the user intent of obtaining a media item different from the first media item: obtaining a second media item different from the first media item based on the first search criteria and the second natural language speech input, wherein obtaining the second media item includes: initiating a third search for a second plurality of media items, the third search including the previously determined first search criteria included in the first natural language speech input; and providing the sec

Assignees

Inventors

Classifications

  • G06F16/48Primary

    Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Annotation, e.g. comment data or footnotes · CPC title

  • Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11010127B2 cover?
An exemplary method for identifying media may include receiving user input associated with a request for media, where that user input includes unstructured natural language speech including one or more words; identifying at least one context associated with the user input; causing a search for the media based on the at least one context and the user input; determining, based on the at least one…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/48. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 18 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).