System and method for video program recognition

US9432702B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9432702-B2
Application numberUS-201414325004-A
CountryUS
Kind codeB2
Filing dateJul 7, 2014
Priority dateJul 7, 2014
Publication dateAug 30, 2016
Grant dateAug 30, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for video program recognition includes recording video sequences of each of a plurality of channels and obtaining a set of features of an image captured by a user device. The method also includes performing a camera registration operation on the video sequences to select video frames representing the recorded video sequences based on key video frames from a registered cameras associated with the recorded video sequences. Further, the method includes extracting features of at least some of the selected plurality of video frames of each channel, storing the extracted features, and comparing the obtained set of features with the stored features of each channel to determine whether there is a match between the obtained set of features of the image and features of a video frame from a specific channel and, when there is a match, determining the specific channel as a recognized channel number.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for video program recognition on a server system, comprising: recording, by the server system, video sequences of each of a plurality of channels, wherein the video sequences of each of the plurality of channels are formed by interleaved segments of camera takes filmed with a plurality of original camera setups, and a camera take is a series of frames filmed with one camera setup; performing, by the server system, a camera registration operation on the video sequences of each of the plurality of channels to select a plurality of video frames representing the recorded video sequences of each of the plurality of channels based on key video frames from a plurality of registered cameras associated with the recorded video sequences; extracting features of at least some of the selected plurality of video frames of each of the plurality of channels; storing the extracted features of each of the plurality of channels; obtaining a set of features of an image captured by a first user device of a video frame from one of the plurality of channels and played on a second user device different from the first user device; comparing the obtained set of features of the image with the stored features of each of the plurality of channels using an image classifier to determine whether there is a match between the obtained set of features of the image and features of a video frame from a specific channel; and when there is a match between the obtained set of features of the image and features of a video frame from a specific channel, determining a channel number of the specific channel as a recognized channel number; wherein performing the camera registration operation further includes: detecting camera takes in the recorded video sequences of each of the plurality of channels such that the recorded video sequences are separated into a number of sequences with each corresponding to a registered camera that is aligned to one of the plurality of the original camera setups; forming a registered camera pool of camera takes for the detected camera takes for each of the plurality of channels; calculating frame difference between consecutive frames using one of a plurality of algorithms; detecting a new video shot when the frame difference is above a preset threshold value; selecting a key frame to represent visual content of each detected video shot; matching the key frame with a last shot of each registered camera based on a matching criterion, wherein when a registered camera is initially detected, a first shot is used as the last shot of the registered camera; and when the matching criterion is satisfied, adding the key frame to an end of the matching camera take. 2. The method according to claim 1 , wherein: the first user device is a mobile device and the mobile device captures the image by a camera. 3. The method according to claim 2 , wherein: the second user device is a TV; and the TV and the mobile device are associated with one or more multi-screen applications. 4. The method according to claim 2 , wherein: the mobile device extracts the set of features of the image in same manner as the features of the selected plurality of video frames of each of the plurality of channels are extracted. 5. The method according to claim 1 , further including: performing an adaptive flow control operation to adjust a throughput of selected plurality of frames from the camera registration operation to match a throughput of extracting features of each of the plurality of channels. 6. The method according to claim 5 , wherein performing the adaptive flow control operation further includes: calculating a ranking score of a video frame of a channel outputted from the camera registration operation based on at least according to a priority of selection of the channel based on user inquiry and a similarity score between the video frame and a last selected frame of a corresponding camera take; and during the adaptive flow control operation, selecting a video frame with a highest ranking score for feature extraction. 7. The method according to claim 5 , wherein: the extracted features of the plurality of video frames includes global features describing an entire frame image and local features using descriptors of local image neighborhoods computed at multiple interest points. 8. The method according to claim 7 , wherein comparing the obtained set of features of the image with the stored features of each of the plurality of channels further includes: under a processing congestion condition, comparing the obtained set of features of the image with the stored features of each of the plurality of channels using the global features; and under a processing non-congestion condition, comparing the obtained set of features of the image with the stored features of each of the plurality of channels using the local features. 9. The method according to claim 1 , wherein performing the camera registration operation further includes: when no matched image is added to a camera take for a preconfigured time period, removing the camera take from the registered camera pool. 10. The method according to claim 6 , wherein: when a time period that the channel has no new frame selection is less than a preset threshold, the ranking score of the video frame is calculated according to number of inquiries of the channel and the similarity score between the video frame and the last selected frame of the corresponding camera take; when the last selected frame of the corresponding camera take does not exist, the similarity score is zero; and when a time period that the channel has no new frame selection is greater than a preset threshold, the ranking score of the video frame is a constant. 11. A video program recognition system, comprising: a recorder configured to record video sequences of each of a plurality of channels, wherein the video sequences of each of the plurality of channels are formed by interleaved segments of camera takes filmed with a plurality of original camera setups, and a camera take is a series of frames filmed with one camera setup; a server having one or more processors configured to: obtain a set of features of an image captured by a first user device of a video frame from one of the plurality of channels and played on a second user device different from the first user device, wherein the first user device includes a communication module configured to transmit the set of features of the image captured by the first user device to the server; perform a camera registration operation on the video sequences of each of the plurality of channels to select a plurality of video frames representing the recorded video sequences of each of the plurality of channels based on key video frames from a plurality of registered cameras associated with the recorded video sequences from the recorder; and extract features of at least some of the selected plurality of video frames of each of the plurality of channels; and a feature database configured to store the extracted features of each of the plurality of channels from the server, wherein the server is further configured to: compare the obtained set of features of the image with the stored features of each of the plurality of channels to determine whether there is a match between the obtained set of features of the image and features of a video frame from a specific channel; and when there is a match between the obtained set of features of the image and features of a video frame from a specific channel, determine a channel number of the specific channel as a recognized channel number; and wherein, to perform the camera registra

Assignees

Inventors

Classifications

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • Determining representative reference patterns, e.g. averaging or distorting patterns; Generating dictionaries · CPC title

  • Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title

  • structured as a network, e.g. client-server architectures · CPC title

  • Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9432702B2 cover?
A method for video program recognition includes recording video sequences of each of a plurality of channels and obtaining a set of features of an image captured by a user device. The method also includes performing a camera registration operation on the video sequences to select video frames representing the recorded video sequences based on key video frames from a registered cameras associate…
Who is the assignee on this patent?
Tcl Res America Inc
What technology area does this patent fall under?
Primary CPC classification H04N21/23418. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 30 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).