Media channel identification with video multi-match detection and disambiguation based on audio fingerprint

US11412296B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11412296-B2
Application numberUS-202117305116-A
CountryUS
Kind codeB2
Filing dateJun 30, 2021
Priority dateFeb 29, 2016
Publication dateAug 9, 2022
Grant dateAug 9, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are methods and systems to help disambiguate channel identification in a scenario where a video fingerprint of media content matches multiple reference video fingerprints corresponding respectively with multiple different channels. Given such a multi-match situation, an entity could disambiguate based on an audio component of the media content, such as by further determining that an audio fingerprint of the media content at issue matches an audio fingerprint of just one of the multiple channels, thereby establishing that that is the channel on which the media content being rendered by the media presentation device is arriving.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: determining, by a computing system, that digital video fingerprint data representing media content being received by a media presentation device matches reference video fingerprint data corresponding with multiple channels; responsive to at least determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with multiple channels, performing disambiguation by the computing system based at least in part on a determination that digital audio fingerprint data representing the media content being received by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media presentation device is receiving the media content on the single channel; and taking action based on the establishing that the media presentation device is receiving the media content on the single channel. 2. The method of claim 1 , wherein the media content has a video track and an audio track, wherein the digital video fingerprint data is fingerprint data representing the video track and the digital audio fingerprint data is fingerprint data representing the audio track. 3. The method of claim 1 , wherein the digital audio fingerprint data represents at least a language track of the media content. 4. The method of claim 1 , wherein taking action based on the establishing that the media presentation device is receiving the media content on the single channel comprises causing presentation of supplemental channel-specific content in conjunction with the media content being received by the media presentation device. 5. The method of claim 4 , wherein the supplemental channel-specific content includes at least one of a pop-up advertisement, a commercial break, or a channel-identification. 6. The method of claim 4 , wherein the supplemental channel-specific content comprises an advertisement, and wherein causing presentation of the supplemental channel-specific content in conjunction with the media content being received by the media presentation device comprises causing presentation of the advertisement as a replacement for a portion of the media content. 7. The method of claim 1 , wherein taking action based on the establishing that the media presentation device is receiving the media content on the single channel comprises inserting an advertisement in place of a portion of the media content. 8. A system comprising: a network communication interface; at least one processing unit; non-transitory data storage; and program instructions stored in the non-transitory data storage and executable by the at least one processing unit to carry out operations including: determining that digital video fingerprint data representing media content being received by a media presentation device matches reference video fingerprint data corresponding with multiple channels, responsive to at least the determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with the multiple channels, performing disambiguation based at least in part on a determination that digital audio fingerprint data representing the media content being received by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media content being received by the media presentation device is media content of the single channel, and taking action based on the establishing that the media content being received by the media presentation device is media content of the single channel. 9. The system of claim 8 , wherein the media content has a video track and an audio track, and wherein the digital video fingerprint data is fingerprint data representing the video track and the digital audio fingerprint data is fingerprint data representing the audio track. 10. The system of claim 8 , wherein the digital audio fingerprint data represents at least a language track of the media content. 11. The system of claim 8 , wherein the digital audio fingerprint data represents at least one of background music or a sound effect. 12. The system of claim 8 , wherein taking action based on the establishing that the media content being received by the media presentation device is media content of the single channel comprises causing presentation of supplemental channel-specific content in conjunction with the media content being received by the media presentation device. 13. The system of claim 12 , wherein the supplemental channel-specific content includes at least one of a pop-up advertisement, a commercial break, or a channel-identification. 14. The system of claim 12 , wherein the supplemental channel-specific content comprises an advertisement, and wherein causing presentation of the supplemental channel-specific content in conjunction with the media content being received by the media presentation device comprises causing presentation of the advertisement as a replacement for a portion of the media content. 15. The system of claim 8 , wherein taking action based on the establishing that the media content being received by the media presentation device is media content of the single channel comprises inserting an advertisement in place of a portion of the media content. 16. The system of claim 8 , wherein taking action based on the establishing that the media content being received by the media presentation device is media content of the single channel comprises recording presentation of the single channel for use in a channel ratings system. 17. The system of claim 8 , wherein the system comprises the media presentation device. 18. The system of claim 8 , wherein the system comprises an entity other than the media presentation device, wherein the digital video fingerprint data and the digital audio fingerprint data are generated by the media presentation device, and wherein the operations further include: receiving by the system from the media presentation device the digital video fingerprint data and the digital audio fingerprint data. 19. The system of claim 18 , wherein receiving by the system the digital audio fingerprint data occurs after the determining that the digital video fingerprint data matches reference video fingerprint data corresponding with the multiple channels. 20. A non-transitory computer readable medium storing instructions executable by at least one processing unit to carry out operations including: determining that digital video fingerprint data representing media content being received by a media presentation device matches reference video fingerprint data corresponding with multiple channels; responsive to at least the determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with the multiple channels, performing disambiguation based at least in part on a determination that digital audio fingerprint data representing the media content being received by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media content being received by the media presentation device is media content of the single channel; and taking action based on the establishing that the media content being received by the media presentation device is media content

Assignees

Inventors

Classifications

  • involving content or source identification data, e.g. Unique Material Identifier [UMID] · CPC title

  • Matching video sequences · CPC title

  • involving advertisement data (advertising per se G06Q30/02) · CPC title

  • of video {(recognising characters or patterns in general G06F18/00, G06V20/00)} · CPC title

  • of audio {(determination or detection of speech characteristics in general G10L25/00; speech recognition in general G10L15/00)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11412296B2 cover?
Disclosed are methods and systems to help disambiguate channel identification in a scenario where a video fingerprint of media content matches multiple reference video fingerprints corresponding respectively with multiple different channels. Given such a multi-match situation, an entity could disambiguate based on an audio component of the media content, such as by further determining that an a…
Who is the assignee on this patent?
Roku Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/683. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 09 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).