What technology area does this patent fall under?

Primary CPC classification G06F16/637. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue May 07 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Personalizing explainable recommendations with bandits

US11977577B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11977577-B2
Application number	US-202318331245-A
Country	US
Kind code	B2
Filing date	Jun 8, 2023
Priority date	Jul 6, 2018
Publication date	May 7, 2024
Grant date	May 7, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems and computer program products are provided personalizing recommendations of items with associated explanations. The example embodiments described herein use contextual bandits to personalize explainable recommendations (“recsplanations”) as treatments (“Bart”). Bart learns and predicts satisfaction (e.g., click-through rate, consumption probability) for any combination of item, explanation, and context and, through logging and contextual bandit retraining, can learn from its mistakes in an online setting.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for recommending items with associated explanations, the method comprising: receiving, from a user, a selection of an application; providing context data to a service associated with the application; causing the service to use the context data to select a media item and an explanation based on a policy defined by the service, the policy containing a distribution of media item and explanation pairs over a plurality of contexts, the distribution indicating a probability that each of the media item and explanation pairs is selected; receiving, from the service, the media item and the explanation; displaying the media item and the explanation; and receiving, from the user, a selection of the media item. 2. The method of claim 1 , wherein displaying the media item and the explanation comprises displaying a selectable visualization associated with the media item and the explanation; and wherein receiving, from the user, the selection of the media item comprises receiving a selection of the selectable visualization. 3. The method of claim 1 , further comprising receiving, from the service, a plurality of media items associated with the explanation; and wherein displaying the media item and the explanation comprises displaying a shelf layout with the explanation and the plurality of media items. 4. The method of claim 1 , wherein displaying the media item comprises displaying at least some media content of the media item. 5. The method of claim 1 , wherein the selection of the application is a selection of a home tab of a media content delivery service. 6. The method of claim 1 , wherein receiving, from the service, the media item and the explanation comprises receiving a first plurality of media items associated with a first explanation and a second plurality of media items associated with a second explanation; and wherein displaying the media item and the explanation comprises displaying, in a first region of a user interface, the first plurality of media items and the first explanation and displaying, in a second region of the user interface, the second plurality of media items and the second explanation. 7. The method of claim 1 , further comprising: causing the service to use the context data to select a plurality of media item and explanation pairs; receiving, from the service, the plurality of media item and explanation pairs; receiving, from the service, an ordering of the plurality of media item and explanation pairs; and displaying, based on the ordering of the plurality of media item and explanation pairs, a plurality of visualizations for the plurality of media item and explanation pairs. 8. The method of claim 1 , wherein selecting the media item and the explanation associated with the media item is personalized for the user based at least in part on the context data. 9. The method of claim 1 , wherein the explanation is based on one or more of a previous activity of the user or a social feature of the user. 10. The method of claim 1 , further comprising: in response to receiving the selection of the media item, providing feedback data to the service; and causing the service to, based on the feedback data, increase a selection probability for the media item and the explanation in the policy. 11. The method of claim 1 , wherein the context data includes one or more of a user attribute of the user, an item attribute, or an item identifier. 12. The method of claim 1 , wherein the policy is defined based at least in part on a reward model trained using a plurality of items, a plurality of explanations associated with the plurality of items, a plurality of context data, and user activity data. 13. The method of claim 12 , wherein the reward model includes a reward function that outputs a reward based on a selected item of the plurality of items, a selected explanation from the plurality of explanations, and a given context of the plurality of context data; and wherein training the reward model comprises optimizing a learning objective associated with the reward function. 14. The method of claim 13 , wherein the reward model includes one or more of a factorization machine or a neural network. 15. The method of claim 1 , wherein the policy is defined using an exploration-exploitation policy; and wherein, according to the exploration-exploitation policy, determining whether to explore or exploit media items is performed separately from determining whether to explore or exploit explanations. 16. The method of claim 1 , wherein the selection of the media item is a playback action or a save action; and wherein the media item is a playlist of media items. 17. A computer-readable medium having stored thereon sequences of instructions, the sequences of instructions including instructions which when executed by a computer system cause the computer system to: receive, from a user, a selection of an application; provide context data to a service associated with the application; cause the service to use the context data to select a media item and an explanation based on a policy defined by the service, the policy containing a distribution of media item and explanation pairs over a plurality of contexts, the distribution indicating a probability that each of the media item and explanation pairs is selected; receive, from the service, the media item and the explanation; display the media item and the explanation; and receive, from the user, a selection of the media item. 18. A system for personalizing recommendations of items, the system comprising: a server including a service; a device including a processor and memory storing instructions that, when executed by the processor, cause the device to: receive, from a user, a selection of an application associated with the service; provide context data to the service; cause the service to use the context data to select a media item and an explanation based on a policy defined by the service, the policy containing a distribution of media item and explanation pairs over a plurality of contexts, the distribution indicating a probability that each of the media item and explanation pairs is selected; receive, from the service, the media item and the explanation; display the media item and the explanation; and receive, from the user, a selection of the media item. 19. The system of claim 18 , wherein the instructions, when executed, further cause the device to play media content of the media item. 20. The system of claim 18 , further comprising a second device including a second processor and second memory storing second instructions that, when executed by the second processor, cause the second device to: provide second context data and feedback data to the service, the feedback data being associated with user activity for a second user; and cause the service to, based at least in part on the second context data and the feedback data, update the policy by retraining a reward model.

Assignees

Spotify Ab

Inventors

Classifications

G06N3/092
Reinforcement learning · CPC title
G06F16/637Primary
Administration of user profiles, e.g. generation, initialization, adaptation or distribution · CPC title
G06F16/639
using playlists · CPC title
G06F18/217
Validation; Performance evaluation; Active pattern learning techniques · CPC title
G06F16/9535
Search customisation based on user profiles and personalisation · CPC title

Patent family

Related publications grouped by family.

View patent family 69102593

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11977577B2 cover?: Methods, systems and computer program products are provided personalizing recommendations of items with associated explanations. The example embodiments described herein use contextual bandits to personalize explainable recommendations (“recsplanations”) as treatments (“Bart”). Bart learns and predicts satisfaction (e.g., click-through rate, consumption probability) for any combination of item,…
Who is the assignee on this patent?: Spotify Ab
What technology area does this patent fall under?: Primary CPC classification G06F16/637. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue May 07 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).