TV content segmentation, categorization and identification and time-aligned applications

US9510044B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9510044-B1
Application numberUS-201113327359-A
CountryUS
Kind codeB1
Filing dateDec 15, 2011
Priority dateJun 18, 2008
Publication dateNov 29, 2016
Grant dateNov 29, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Content segmentation, categorization and identification methods are described. Content tracking approaches are illustrated that are suitable for large scale deployment. Time-aligned applications such as multi-language selection, customized advertisements, second screen services and content monitoring applications can be economically deployed at large scales. A client performs fingerprinting, scene change detection, audio turn detection, and logo detection on incoming video and gathers database search results, logos and text to identify and segment video streams into content, promos, and commercials. A learning engine is configured to learn rules for optimal identification and segmentation at each client for each channel and program. Content sensed at the client site is tracked with reduced computation and applications are executed with timing precision. A user interface for time-aligned publishing of content and subsequent usage and interaction on one or more displays is also described.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for time aligned identification of video content playing on an electronic client device, the method comprising: receiving by the electronic client device, a video stream for presentation by the electronic client device, the video stream containing a first video content followed by a second video content; generating by the electronic client device, a first set of signatures for the first video content; identifying by the electronic client device, the first video content based on an analysis of the first set of signatures compared to reference signatures stored in a database; detecting by the electronic client device, a scale ratio mapping between video frames of the identified first video content playing on the electronic client device and reference frames stored in the database by calculating a scale ratio along an x axis (Sx) and a difference scale ratio along a y axis (Sy) for a query signature in the first set of signatures and a reliable matching reference signature in the database, wherein the calculated scale ratios Sx and Sy indicate that the video frames of the identified first video content and the reference frames are not in alignment; responsive to calculating the scale ratios, using the detected scale ratio mapping to generate subsequent signatures for subsequent incoming video frames; and monitoring the subsequent incoming video frames for the second video content by the electronic client device to determine that the video stream has made a scene change from the first video content to the second video content. 2. The method of claim 1 further comprising: replacing the second video content beginning at a time the scene change is verified with a third video that is different than the second video content. 3. The method of claim 2 , wherein the second video content is an expected advertisement within an expected content or program stream. 4. The method of claim 1 further comprising: displaying synchronous information on a selected device. 5. The method of claim 1 further comprising: providing a user selection of display devices for synchronous display of data in place of the identified second video content; providing a user choice of content for synchronous stream content in place of the identified second video content; and providing a user choice of scrolling in time to select content choices including synchronous content and the identified second video content on the client device. 6. The method of claim 1 , wherein identifying a time alignment comprises: sequence matching and correlation between reference and query fingerprints. 7. The method of claim 1 , wherein identifying a change in multimedia content comprises: detecting a scene change. 8. The method of claim 1 , wherein verifying a sample of multimedia content comprises: matching the sample of multimedia content with a first frame or an opening sequence of an expected multimedia content. 9. The method of claim 1 , wherein audio content is corrected for changes in pitch and changes in frequency response. 10. The method of claim 1 , wherein Sx is equal to (QA(x)−QB(x))/(RA(x)−RB(x)) and wherein QA(x)−QB(x) is a difference in query x coordinates of signature pairs A and B and RA(x)−RB(x) is a difference in reference x coordinates of the signature pairs A and B. 11. A method of providing time-aligned services on broadcast non-recorded content, the method comprising: analyzing by an electronic client device, incoming video of a broadcast live event based on comparisons with opening sequences of programs stored on the electronic client device to determine program details associated with the broadcast live event; generating by the electronic client device, fingerprints of selected frames of the incoming video; searching a reference database using the generated fingerprints to identify a likely match for content of the incoming video with content of an original video stored in the reference database; and correcting by the electronic client device, frame alignment in the incoming video based on a scale ratio mapping by utilizing a scale ratio for an x axis (Sx) and a different scale ratio for a y axis (Sy) between the selected frames of the incoming video and corresponding frames of a matched original video selected from the reference database and for generating subsequent fingerprints based on the scale ratio mapping to enable a time aligned service to overlay on aligned video frames for display on a display of the electronic client device information personalized for the electronic client device. 12. The method of claim 11 further comprises: searching on the electronic client device a subset reference database that is a subset of a remote database using the generated subsequent fingerprints in queries to minimize computational effort and improve accuracy in identification of the incoming video; and searching on the remote database in response to a match not being found in the subset reference database. 13. The method of claim 11 further comprises: using search reports and content analytics to determine a classification of content at time nodes and a similarity between the time nodes generated for the incoming video; generating edge weights between the time nodes, wherein the edge weights represent the similarity between two edge connected time nodes; and performing graph based segmentation to segment video based on the classification and similarity between the time nodes. 14. The method of claim 11 further comprises: determining a geometric correlation of video signatures and associated data comprising geometric coordinates and scale information between the incoming video and the matched original video to represent transformation of the matched original video that approximate the incoming video. 15. The method of claim 11 , wherein video segments of the incoming video represent chapters of a program, advertisements, and promotions. 16. The method of claim 11 further comprises: generating the fingerprints of incoming video including a determined region's location and scale information; determining a reliable match of content from incoming video with the reference database; and generating frame alignment information that includes pixel offsets along the x axis and along the y axis, the scale ratio for the x axis (Sx), and the scale ratio for the y axis (Sy) to represent transformation of the matched original video that approximate the incoming video. 17. The method of claim 16 , wherein generating frame alignment information comprises: generating pixel locations for each fingerprint generated for the incoming video; calculating the scale ratios Sx and Sy based on pixel differences between a query signature pair and a reference signature pair along each axis; summing up the scale ratios calculated on each axis and averaging while excluding outliers; and calculating a pixel offset on each axis for matching pairs of signatures based on the scale ratio on that axis and the determined region's location for the query signature pair and the reference signature pair. 18. A method to identify and track video content by an electronic user device, the method comprising: detecting by the electronic user device, a scale ratio mapping between incoming video frames received by the electronic user device and reference video frames stored in a reference database by calculating a scale ratio along an x axis (Sx) and a different scale ratio along a y axis (Sy) for a query signature in an incoming video frame and a reliable matching

Assignees

Inventors

Classifications

  • involving content or source identification data, e.g. Unique Material Identifier [UMID] · CPC title

  • involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream (arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title

  • for identifying segments of broadcast information, e.g. scenes or extracting programme ID · CPC title

  • of video {(recognising characters or patterns in general G06F18/00, G06V20/00)} · CPC title

  • Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9510044B1 cover?
Content segmentation, categorization and identification methods are described. Content tracking approaches are illustrated that are suitable for large scale deployment. Time-aligned applications such as multi-language selection, customized advertisements, second screen services and content monitoring applications can be economically deployed at large scales. A client performs fingerprinting, sc…
Who is the assignee on this patent?
Pereira Jose Pio, Kulkarni Sunil Suresh, Bolgarov Oleksiy, and 4 more
What technology area does this patent fall under?
Primary CPC classification H04N21/44008. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).