Intelligent automated assistant for media exploration

US11069347B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11069347-B2
Application numberUS-201816035169-A
CountryUS
Kind codeB2
Filing dateJul 13, 2018
Priority dateJun 8, 2016
Publication dateJul 20, 2021
Grant dateJul 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the speech input corresponds to a user intent of obtaining personalized recommendations for media items, at least one media item is obtained from a user-specific corpus of media items. The user-specific corpus of media items is generate based on data associated with the user. The at least one media item is provided.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: determine at least one media item frequently requested by a user, wherein the at least one media item frequently requested by the user is associated with at least one first parameter; determine at least one media item frequently rejected by the user, wherein the at least one media item frequently rejected by the user is associated with at least one second parameter; generate a user-specific corpus of media items by identifying media items including the at least one first parameter and not including the at least one second parameter; receive, from a user, speech input representing a request for one or more media items; determine whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items by evaluating a text representation of the speech input against a set of rules associated with one or more actionable intent nodes; in accordance with a determination that the text representation satisfies the set of rules: obtain at least one media item from the user-specific corpus of media items; and provide the at least one media item from the user-specific corpus of media items. 2. The non-transitory computer-readable storage medium of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether a number of parameters defined in the text representation is less than a threshold number. 3. The non-transitory computer-readable storage medium of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether the text representation includes one of a plurality of phrases corresponding to the user intent of obtaining personalized recommendations for media items. 4. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: rank a plurality of candidate media items from the user-specific corpus of media items using a user-specific media ranking model, the user-specific media ranking model generated based on a plurality of previous media-related requests from the user, wherein obtaining the at least one media item from the user-specific corpus of media items includes selecting the at least one media item from the plurality of candidate media items based on the ranking. 5. The non-transitory computer-readable storage medium of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates an activity associated with the media item, and wherein the activity is associated with the media item based on a music tempo of the media item. 6. The non-transitory computer-readable storage medium of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates a mood associated with the media item, and wherein the mood is associated with the media item based on a music key of the media item. 7. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an occasion associated with a time period; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an occasion associated with a time period, obtain the at least one media item from the user-specific corpus of media items based the occasion, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the occasion. 8. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an editorial list associated with a media establishment; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an editorial list associated with a media establishment, obtain the at least one media item from the user-specific corpus of media items based on the editorial list associated with the media establishment, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the editorial list associated with the media establishment. 9. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a mood; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines a mood, obtain the at least one media item from the user-specific corpus of media items based on the mood, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the mood. 10. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an activity; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an activity, obtain the at least one media item from the user-specific corpus of media items based on the activity, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the activity. 11. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a time period; in response to determining that the text representation defines a time period, determine whether the text representation defines a genre in association with the time period; and in response to determining that the text representation defines a genre in association with the time period, determine a subgenre based on the time period and the genre, wherein in accordance with the determination that the text representation satisfies the set of rules, the at least one media item is obtained from the user-specific corpus of media items based on the subgenre, and wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the subgenre. 12. The non-transitory computer-readable storage medium of claim 1 , wherein the text representation defines a category of media items, and wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules, obtain the at least one media item from the user-specific corpus of media items includes obtaining a plurality of media items associated with the category of media items; and determine a degree of familiarity of the user with the category of media items, wherein an average popularity rating of the plurality of media items is based on the degree of familiarity of the user with the category of media items. 13. The non-transitory computer-readable storage medium of claim 1 , wherein the instructions further cause the one or more processors to: determine an identity of the user by performing speaker identificat

Assignees

Inventors

Classifications

  • G06F16/433Primary

    using audio data · CPC title

  • G06F16/00Primary

    Information retrieval; Database structures therefor; File system structures therefor · CPC title

  • Natural language query formulation or dialogue systems · CPC title

  • for discriminating voice from noise · CPC title

  • Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11069347B2 cover?
Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the sp…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/433. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).