Method of processing data and electronic device thereof
US-2015067485-A1 · Mar 5, 2015 · US
US10049663B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10049663-B2 |
| Application number | US-201615266956-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 15, 2016 |
| Priority date | Jun 8, 2016 |
| Publication date | Aug 14, 2018 |
| Grant date | Aug 14, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the speech input corresponds to a user intent of obtaining personalized recommendations for media items, at least one media item is obtained from a user-specific corpus of media items. The user-specific corpus of media items is generate based on data associated with the user. The at least one media item is provided.
Opening claim text (preview).
What is claimed is: 1. A system for operating a digital assistant to explore media items, the system comprising: one or more processors; and memory having instructions stored thereon, the instructions, when executed by the one or more processors, cause the one or more processors to: receive, from a user, speech input representing a request for one or more media items; evaluate a text representation of the speech input against a set of rules to determine whether or not the speech input corresponds to a user intent of obtaining personalized recommendations for media items, wherein the evaluating includes determining an actionable intent node by analyzing words in the text representation against words of a vocabulary index associated with a plurality of actionable intent nodes, and wherein the set of rules includes a first rule that the actionable intent node corresponds to an actionable intent of obtaining personalized recommendations for media items and a second rule that one or more words in the text representation refers to the user; in accordance with a determination that the text representation satisfies the set of rules: obtain at least one media item from a user-specific corpus of media items, the user-specific corpus of media items generated according to inferred media preferences of the user; and provide the at least one media item from the user-specific corpus of media items; and in accordance with a determination that the text representation does not satisfy the set of rules: obtain at least one media item from a general corpus of media items, the general corpus of media items generated according to inferred media preferences of a plurality of users; and provide the at least one media item from the general corpus of media items. 2. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether a number of parameters defined in the text representation is less than a threshold number. 3. The system of claim 1 , wherein evaluating the text representation of the speech input against the set of rules further comprises determining whether the text representation includes one of a plurality of phrases corresponding to the user intent of obtaining personalized recommendations for media items. 4. The system of claim 1 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: rank a plurality of candidate media items from the user-specific corpus of media items using a user-specific media ranking model, the user-specific media ranking model generated based on a plurality of previous media-related requests from the user, wherein obtaining the at least one media item from the user-specific corpus of media items includes selecting the at least one media item from the plurality of candidate media items based on the ranking. 5. The system of claim 4 , wherein the instructions further cause the one or more processors to: in accordance with the determination that the text representation satisfies the set of rules: receive, from the user, a second speech input; determine whether the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items; and in response to determining that the second speech input corresponds to a rejection of the at least one media item from the user-specific corpus of media items: update the user-specific media ranking model in accordance with the rejection. 6. The system of claim 5 , wherein the instructions further cause the one or more processors to: re-rank the plurality of candidate media items from the user-specific corpus of media items based on the rejection of the at least one media item; and select at least one second media item from the plurality of candidate media items based on the re-ranking. 7. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates an activity associated with the media item, and wherein the activity is associated with the media item based on a music tempo of the media item. 8. The system of claim 1 , wherein each media item in the user-specific corpus of media items includes metadata that indicates a mood associated with the media item, and wherein the mood is associated with the media item based on a music key of the media item. 9. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an occasion associated with a time period; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an occasion associated with a time period, obtain the at least one media item from the user-specific corpus of media items based the occasion, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the occasion. 10. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an editorial list associated with a media establishment; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an editorial list associated with a media establishment, obtain the at least one media item from the user-specific corpus of media items based on the editorial list associated with the media establishment, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the editorial list associated with the media establishment. 11. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a mood; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines a mood, obtain the at least one media item from the user-specific corpus of media items based on the mood, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the mood. 12. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines an activity; and in accordance with the determination that the text representation satisfies the set of rules and that the text representation defines an activity, obtain the at least one media item from the user-specific corpus of media items based on the activity, wherein the at least one media item from the user-specific corpus of media items includes metadata indicating the activity. 13. The system of claim 1 , wherein the instructions further cause the one or more processors to: determine whether the text representation defines a time period; in response to determining that the text representation defines a time period, determine whether the text representation defines a genre in association with the time period; and in response to determining that the text representation defines a genre in association with the time period, determine a subgenre based on the time period and the genre, wherein in accordance with the determination that the text representation satisfies the set of rules, the at least one media item is obtained from the user-specific corpus of media items based on the subgenre, and wherein the at least one media item from the user-specific corpus of media items in
Information retrieval; Database structures therefor; File system structures therefor · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
Speaker identification or verification techniques · CPC title
Natural language query formulation or dialogue systems · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.