Media channel identification with video multi-match detection and disambiguation based on audio fingerprint

US11089360B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11089360-B2
Application numberUS-202016819657-A
CountryUS
Kind codeB2
Filing dateMar 16, 2020
Priority dateFeb 29, 2016
Publication dateAug 10, 2021
Grant dateAug 10, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are methods and systems to help disambiguate channel identification in a scenario where a video fingerprint of media content matches multiple reference video fingerprints corresponding respectively with multiple different channels. Given such a multi-match situation, an entity could disambiguate based on an audio component of the media content, such as by further determining that an audio fingerprint of the media content at issue matches an audio fingerprint of just one of the multiple channels, thereby establishing that that is the channel on which the media content being rendered by the media presentation device is arriving.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: a network communication interface; at least one processing unit; non-transitory data storage; and program instructions stored in the non-transitory data storage and executable by the at least one processing unit to carry out operations including: determining that digital video fingerprint data representing media content being rendered by a media presentation device matches reference video fingerprint data corresponding with multiple channels, responsive to at least the determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with the multiple channels, performing disambiguation based at least in part on a determination that digital audio fingerprint data representing the media content being rendered by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media content being rendered by the media presentation device is media content of the single channel, and taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel. 2. The system of claim 1 , wherein the media content has a video track and an audio track, and wherein the digital video fingerprint data is fingerprint data representing the video track and the digital audio fingerprint data is fingerprint data representing the audio track. 3. The system of claim 1 , wherein the digital audio fingerprint data represents at least a language track of the media content. 4. The system of claim 1 , wherein the digital audio fingerprint data represents at least one of background music or a sound effect. 5. The system of claim 1 , wherein taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel comprises causing the media presentation device to present supplemental channel-specific content in conjunction with the media content being rendered by the media presentation device. 6. The system of claim 5 , wherein the supplemental channel-specific content includes at least one of a pop-up advertisement, a commercial break, or a channel-identification. 7. The system of claim 5 , wherein the supplemental channel-specific content comprises an advertisement, and wherein causing the media presentation device to present the supplemental channel-specific content in conjunction with the media content being rendered by the media presentation device comprises causing the media presentation to present the advertisement as a replacement for a portion of the media content. 8. The system of claim 1 , wherein taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel comprises inserting an advertisement in place of a portion of the media content. 9. The system of claim 1 , wherein taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel comprises recording presentation of the single channel for use in a channel ratings system. 10. The system of claim 1 , wherein the system comprises the media presentation device. 11. The system of claim 1 , wherein the system comprises an entity other than the media presentation device, wherein the digital video fingerprint data and the digital audio fingerprint data are generated by the media presentation device, and wherein the operations further include: receiving by the system from the media presentation device the digital video fingerprint data and the digital audio fingerprint data. 12. The system of claim 11 , wherein receiving by the system the digital audio fingerprint data occurs after the determining that the digital video fingerprint data matches reference video fingerprint data corresponding with the multiple channels. 13. A non-transitory computer readable medium storing instructions executable by at least one processing unit to carry out operations including: determining that digital video fingerprint data representing media content being rendered by a media presentation device matches reference video fingerprint data corresponding with multiple channels; responsive to at least the determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with the multiple channels, performing disambiguation based at least in part on a determination that digital audio fingerprint data representing the media content being rendered by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media content being rendered by the media presentation device is media content of the single channel; and taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel. 14. The non-transitory computer readable medium of claim 13 , wherein the media content has a video track and an audio track, and wherein the digital video fingerprint data is fingerprint data representing the video track and the digital audio fingerprint data is fingerprint data representing the audio track. 15. The non-transitory computer readable medium of claim 13 , wherein the digital audio fingerprint data represents at least a language track of the media content. 16. The non-transitory computer readable medium of claim 13 , wherein the digital audio fingerprint data represents at least one of background music or a sound effect. 17. The non-transitory computer readable medium of claim 13 , wherein taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel comprises causing the media presentation device to present supplemental channel-specific content in conjunction with the media content being rendered by the media presentation device. 18. The non-transitory computer readable medium of claim 17 , wherein the supplemental channel-specific content includes at least one of a pop-up advertisement, a commercial break, or a channel-identification. 19. The non-transitory computer readable medium of claim 17 , wherein the supplemental channel-specific content comprises an advertisement, and wherein causing the media presentation device to present the supplemental channel-specific content in conjunction with the media content being rendered by the media presentation device comprises causing the media presentation to present the advertisement as a replacement for a portion of the media content. 20. The non-transitory computer readable medium of claim 13 , wherein taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel comprises recording presentation of the single channel for use in a channel ratings non-transitory computer readable medium.

Assignees

Inventors

Classifications

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • Matching video sequences · CPC title

  • Generation or processing of descriptive data, e.g. content descriptors {(systems specially adapted for using meta-information in broadcast systems H04H60/73)} · CPC title

  • Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet (web site content organization and management for information retrieval from the Internet G06F16/958; transmission by internet of broadcast information H04H60/82; stock exchange data over packet-switching network H04L12/1804; push services including data channel over packet-switching network H04L12/1859) · CPC title

  • involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams (arrangements characterised by components specially adapted for monitoring, identification or recognition of audio in broadcast systems H04H60/58) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11089360B2 cover?
Disclosed are methods and systems to help disambiguate channel identification in a scenario where a video fingerprint of media content matches multiple reference video fingerprints corresponding respectively with multiple different channels. Given such a multi-match situation, an entity could disambiguate based on an audio component of the media content, such as by further determining that an a…
Who is the assignee on this patent?
Roku Inc, Gracenote Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/683. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 10 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).