Intelligent automated assistant for media exploration

US10049663B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10049663-B2
Application numberUS-201615266956-A
CountryUS
Kind codeB2
Filing dateSep 15, 2016
Priority dateJun 8, 2016
Publication dateAug 14, 2018
Grant dateAug 14, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the speech input corresponds to a user intent of obtaining personalized recommendations for media items, at least one media item is obtained from a user-specific corpus of media items. The user-specific corpus of media items is generate based on data associated with the user. The at least one media item is provided.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for operating a digital assistant to explore media items, the system comprising: one or more processors; and memory having instructions stored thereon, the instructions, when executed by the one or more processors, cause the one or more processors to: receive, from a user, speech input representing a request for one or more media items; evaluate a text representation of the speech input against a set of rules to determine whether or not the speech input corresponds to a user intent of obtaining personalized recommendations for media items, wherein the evaluating includes determining an actionable intent node by analyzing words in the text representation against words of a vocabulary index associated with a plurality of actionable intent nodes, and wherein the set of rules includes a first rule that the actionable intent node corresponds to an actionable intent of obtaining personalized recommendations for media items and a second rule that one or more words in the text representation refers to the user; in accordance with a determination that the text representation satisfies the set of rules: obtain at least one media item from a user-specific corpus of media items, the user-specific corpus of media items generated according to inferred media preferences of the user; and provide the at least one media item from the user-specific corpus of media items; and in accordance with a determination that the text representation does not satisfy the set of rules: obtain at least one media item from a general corpus of media items, the general corpus of media items generated according to inferred media preferences of a plurality of users; and provide the at least one media item from the general corpus of media items. 2. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether a number of parameters defined in the text representation is less than a threshold number. 3. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether the text representation includes one of a plurality of phrases corresponding to the user intent of obtaining personalized recommendations for media items. 4. The system of claim 1 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: rank a plurality of candidate media items from the user-specific corpus of media items using a user-specific media ranking model, the user-specific media ranking model generated based on a plurality of previous media-related requests from the user, wherein obtaining the at least one media item from the user-specific corpus of media items includes selecting the at least one media item from the plurality of candidate media items based on the ranking. 5. The system of claim 4 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: receive, from the user, a second speech input; determine whether the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items; and in response to determining that the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items: update the user-specific media ranking model in accordance with the rejection. 6. The system of claim 5 , wherein the instructions further cause the one or more processors to: re-rank the plurality of candidate media items from the user-specific corpus of media items based on the rejection of the at least one media item; and select at least one second media item from the plurality of candidate media items based on the re-ranking. 7. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates an activity associated with the media item, and wherein the activity is associated with the media item based on a music tempo of the media item. 8. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates a mood associated with the media item, and wherein the mood is associated with the media item based on a music key of the media item. 9. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an occasion associated with a time period; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an occasion associated with a time period, obtain the at least one media item from the user-specific corpus of media items based the occasion, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the occasion. 10. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an editorial list associated with a media establishment; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an editorial list associated with a media establishment, obtain the at least one media item from the user-specific corpus of media items based on the editorial list associated with the media establishment, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the editorial list associated with the media establishment. 11. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a mood; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines a mood, obtain the at least one media item from the user-specific corpus of media items based on the mood, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the mood. 12. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an activity; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an activity, obtain the at least one media item from the user-specific corpus of media items based on the activity, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the activity. 13. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a time period; in response to determining that the text representation defines a time period, determine whether the text representation defines a genre in association with the time period; and in response to determining that the text representation defines a genre in association with the time period, determine a subgenre based on the time period and the genre, wherein in accordance with the determination that the text representation satisfies the set of rules, the at least one media item is obtained from the user-specific corpus of media items based on the subgenre, and wherein the at least one media item from the user-specific corpus of media items in

Assignees

Inventors

Classifications

  • G06F16/00Primary

    Information retrieval; Database structures therefor; File system structures therefor · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Speaker identification or verification techniques · CPC title

  • Natural language query formulation or dialogue systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10049663B2 cover?
Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the sp…
Who is the assignee on this patent?
Apple Inc, Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 14 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).