Systems, methods, and computer program products for providing contextually-aware video recommendation

US9451329B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9451329-B2
Application numberUS-201414509396-A
CountryUS
Kind codeB2
Filing dateOct 8, 2014
Priority dateOct 8, 2013
Publication dateSep 20, 2016
Grant dateSep 20, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems and computer program products are provided for providing content recommendation by obtaining metadata associated with a media object, extracting from the metadata a plurality of terms associated with the media object, and mapping at least a portion of the plurality of terms to buckets. A query vector having attributes corresponding to the buckets is used to perform a query on a database storing media object documents having attributes corresponding to the buckets.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of providing a database of content-level attributes associated with media objects, the method comprising: performing on at least one computer of a content recommendation system, the at least one computer having a query interface for receiving a query containing at least one content-level attribute, the steps of: obtaining, by crawling a network using a server adapted to gather text data, metadata associated with a media object from a plurality of data sources, wherein at least one of the plurality of data sources is an unstructured data source and the metadata includes extrinsic metadata corresponding to one or more content-level attributes of the media object; extracting from the metadata a plurality of terms associated with the media object by applying an entity extraction model to the metadata; mapping at least a portion of the plurality of terms to a plurality of buckets using an indexing engine in combination with a clustering framework configured to cluster the plurality of terms to categorization terms associated with each bucket; calculating, for each term of the plurality of terms, a probability that the term is associated with the media object; associating the probability to each term, correspondingly; generating a vector of content-level attributes corresponding to the media object based on the associating; storing the vector of content-level attributes in a database; receiving via the query interface a query vector containing at least one of the content-level attributes; searching the database for at least one content-level attribute contained in the query vector; and providing, in response to the searching, a query result containing the media object. 2. The method according to claim 1 , further comprising: generating the plurality of buckets by selecting categorization terms corresponding to a plurality of like-terms associated with a type of media content. 3. The method according to claim 1 , further comprising: generating the plurality of buckets by clustering the plurality of terms using a plurality of cluster terms and a plurality of definitions including a plurality of references to a plurality of other terms, wherein the cluster terms having the highest scores correspond to the plurality of buckets. 4. The method according to claim 1 , wherein the metadata is extracted from at least one structured data source. 5. The method of claim 1 , further comprising: for each bucket of the plurality of buckets: correlating the plurality of terms associated with the bucket; and weighting each bucket based on a correlation value obtained by the correlating. 6. The method according to claim 1 , further comprising: generating a document containing at least the vector; and storing the document in the database. 7. The method of claim 6 , further comprising: generating a query vector having the at least one content-level attribute corresponding to at least one of the plurality of buckets; and querying the database by using the query vector. 8. A system for providing a database of content-level attributes associated with media objects, the system comprising: at least one computer of a content recommendation system, the at least one computer having a query interface for receiving a query containing at least one content-level attribute; a server adapted to gather text data and operable to: obtain, by crawling a network, metadata associated with a media object from a plurality of data sources, wherein at least one of the plurality of data sources is an unstructured data source and the metadata includes extrinsic metadata corresponding to one or more content-level attributes of the media object and a database, wherein the at least one computer is operable to: extract from the metadata a plurality of terms associated with the media object by applying an entity extraction model to the metadata, map at least a portion of the plurality of terms to a plurality of buckets using an indexing engine in combination with a clustering framework configured to cluster the plurality of terms to categorization terms associated with each bucket, calculate, for each term of the plurality of terms, a probability that the term is associated with the media object; associate the probability to each term, correspondingly; generate vector of content-level attributes corresponding to the media object based on the associating; store the vector of content-level attributes in the database; receive via the query interface a query vector containing at least one of the content-level attributes; search the database for at least one content-level attribute contained in the query vector; and provide, in response to the searching, a query result containing the media object. 9. The system according to claim 8 , the at least one computer of the content recommendation system further operable to: generate the plurality of buckets by selecting categorization terms corresponding to a plurality of like-terms associated with a type of media content. 10. The system according to claim 8 , the at least one computer of the content recommendation system further operable to: generate the plurality of buckets by clustering the plurality of terms using a plurality of cluster terms and a plurality of definitions including a plurality of references to a plurality of other terms, wherein the cluster terms having the highest scores correspond to the plurality of buckets. 11. The system according to claim 8 , wherein the metadata is extracted from at least one structured data source. 12. The system according to claim 8 , the at least one computer of the content recommendation system being further operable to: for each bucket of the plurality of buckets: correlate the plurality of terms associated with the bucket; and weight each bucket based on a correlation value obtained by the correlating. 13. The system according to claim 8 , the at least one computer of the content recommendation system being further operable to: generate a document containing at least the vector; and store the document in the database. 14. The system according to claim 13 , wherein the at least one computer of the content recommendation system is further operable to: generate a query vector having the at least one content-level attribute corresponding to at least one of the plurality of buckets; and query the database by using the query vector. 15. A non-transitory computer-readable medium having stored thereon one or more sequences of instructions for causing one or more processors to perform on at least one computer of a content recommendation system, the at least one computer having a query interface for receiving a query containing at least one content-level attribute, the steps of: obtaining, by crawling a network using a server adapted to gather text data, metadata associated with a media object from a plurality of data sources, wherein at least one of the plurality of data sources is an unstructured data source and the metadata includes extrinsic metadata corresponding to one or more content-level attributes of the media object; extracting from the metadata a plurality of terms associated with the media object by applying an entity extraction model to the metadata; mapping at least a portion of the plurality of terms to a plurality of buckets using an indexing engine in combination with a clustering framework configured to cluster the plurality of terms to categorization terms associated with each bucket; calculating, for each term of the plurality of terms, a probability that the term is

Assignees

Inventors

Classifications

  • using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings · CPC title

  • Physics · mapped topic

  • Analytics of user selections, e.g. selection of programmes or purchase activity (monitoring of user selections in data processing systems G06F11/34; arrangements for monitoring the user's behaviour or opinions in broadcast systems H04H60/33) · CPC title

  • Content retrieval operation from a local storage medium, e.g. hard-disk {(details of retrieval of video data and associated meta data in video databases G06F16/739)} · CPC title

  • Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9451329B2 cover?
Methods, systems and computer program products are provided for providing content recommendation by obtaining metadata associated with a media object, extracting from the metadata a plurality of terms associated with the media object, and mapping at least a portion of the plurality of terms to buckets. A query vector having attributes corresponding to the buckets is used to perform a query on a…
Who is the assignee on this patent?
The Echo Nest Corp, Spotify Ab
What technology area does this patent fall under?
Primary CPC classification H04N21/4826. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).