What technology area does this patent fall under?

Primary CPC classification G06F16/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Aug 14 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Intelligent automated assistant for media exploration

US10049663B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10049663-B2
Application number	US-201615266956-A
Country	US
Kind code	B2
Filing date	Sep 15, 2016
Priority date	Jun 8, 2016
Publication date	Aug 14, 2018
Grant date	Aug 14, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the speech input corresponds to a user intent of obtaining personalized recommendations for media items, at least one media item is obtained from a user-specific corpus of media items. The user-specific corpus of media items is generate based on data associated with the user. The at least one media item is provided.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for operating a digital assistant to explore media items, the system comprising: one or more processors; and memory having instructions stored thereon, the instructions, when executed by the one or more processors, cause the one or more processors to: receive, from a user, speech input representing a request for one or more media items; evaluate a text representation of the speech input against a set of rules to determine whether or not the speech input corresponds to a user intent of obtaining personalized recommendations for media items, wherein the evaluating includes determining an actionable intent node by analyzing words in the text representation against words of a vocabulary index associated with a plurality of actionable intent nodes, and wherein the set of rules includes a first rule that the actionable intent node corresponds to an actionable intent of obtaining personalized recommendations for media items and a second rule that one or more words in the text representation refers to the user; in accordance with a determination that the text representation satisfies the set of rules: obtain at least one media item from a user-specific corpus of media items, the user-specific corpus of media items generated according to inferred media preferences of the user; and provide the at least one media item from the user-specific corpus of media items; and in accordance with a determination that the text representation does not satisfy the set of rules: obtain at least one media item from a general corpus of media items, the general corpus of media items generated according to inferred media preferences of a plurality of users; and provide the at least one media item from the general corpus of media items. 2. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether a number of parameters defined in the text representation is less than a threshold number. 3. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether the text representation includes one of a plurality of phrases corresponding to the user intent of obtaining personalized recommendations for media items. 4. The system of claim 1 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: rank a plurality of candidate media items from the user-specific corpus of media items using a user-specific media ranking model, the user-specific media ranking model generated based on a plurality of previous media-related requests from the user, wherein obtaining the at least one media item from the user-specific corpus of media items includes selecting the at least one media item from the plurality of candidate media items based on the ranking. 5. The system of claim 4 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: receive, from the user, a second speech input; determine whether the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items; and in response to determining that the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items: update the user-specific media ranking model in accordance with the rejection. 6. The system of claim 5 , wherein the instructions further cause the one or more processors to: re-rank the plurality of candidate media items from the user-specific corpus of media items based on the rejection of the at least one media item; and select at least one second media item from the plurality of candidate media items based on the re-ranking. 7. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates an activity associated with the media item, and wherein the activity is associated with the media item based on a music tempo of the media item. 8. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates a mood associated with the media item, and wherein the mood is associated with the media item based on a music key of the media item. 9. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an occasion associated with a time period; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an occasion associated with a time period, obtain the at least one media item from the user-specific corpus of media items based the occasion, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the occasion. 10. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an editorial list associated with a media establishment; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an editorial list associated with a media establishment, obtain the at least one media item from the user-specific corpus of media items based on the editorial list associated with the media establishment, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the editorial list associated with the media establishment. 11. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a mood; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines a mood, obtain the at least one media item from the user-specific corpus of media items based on the mood, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the mood. 12. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an activity; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an activity, obtain the at least one media item from the user-specific corpus of media items based on the activity, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the activity. 13. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a time period; in response to determining that the text representation defines a time period, determine whether the text representation defines a genre in association with the time period; and in response to determining that the text representation defines a genre in association with the time period, determine a subgenre based on the time period and the genre, wherein in accordance with the determination that the text representation satisfies the set of rules, the at least one media item is obtained from the user-specific corpus of media items based on the subgenre, and wherein the at least one media item from the user-specific corpus of media items in

Assignees

Inventors

Classifications

G06F16/00Primary
Information retrieval; Database structures therefor; File system structures therefor · CPC title
G10L15/22Primary
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
G06F3/167
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
G10L17/00
Speaker identification or verification techniques · CPC title
G06F16/90332
Natural language query formulation or dialogue systems · CPC title

Patent family

Related publications grouped by family.

View patent family 60573004

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10049663B2 cover?: Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the sp…
Who is the assignee on this patent?: Apple Inc, Apple Inc
What technology area does this patent fall under?: Primary CPC classification G06F16/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Aug 14 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).