Commercial detection based on audio fingerprinting

US9258604B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9258604-B1
Application numberUS-201414552039-A
CountryUS
Kind codeB1
Filing dateNov 24, 2014
Priority dateNov 24, 2014
Publication dateFeb 9, 2016
Grant dateFeb 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A commercial detection system generates a probe audio fingerprint of an audio signal associated with a media stream on a broadcast channel and determines whether the media stream has commercial content based on analysis of the probe audio fingerprint. The commercial detection system determines whether a same match between the probe audio fingerprint and a reference audio fingerprint is observed across multiple broadcast channels. Responsive to the number of same matches exceeding a predetermined threshold, the commercial detection system determines that the media stream has commercial content. The commercial detection system may also apply a trained feature analysis model to extracted acoustic features of the audio signal. The commercial detection system determines whether the media stream has commercial content based on a confidence score assigned to the probe audio fingerprint. The commercial detection system reduces false positive detection using program guide information of the media stream.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving an audio signal including a plurality of frames, each frame representing a portion of the audio signal, the audio signal associated with a media stream on a broadcast channel; generating a probe audio fingerprint based on one or more of the plurality of frames; comparing the probe audio fingerprint with one or more reference audio fingerprints from a plurality of reference audio fingerprints, a reference audio fingerprint of the plurality of reference audio fingerprints representing a set of acoustic features of an audio signal associated with commercial content; determining whether the probe audio fingerprint matches a reference audio fingerprint of the plurality of reference audio fingerprints based on comparison of the probe audio fingerprint and the reference audio fingerprint; and responsive to a match between the probe audio fingerprint and the reference audio fingerprint of the plurality of reference audio fingerprints: determining whether an audio fingerprint associated with a media stream broadcast over a channel of a plurality of other broadcast channels matches the reference audio fingerprint: and determining that the media stream associated with the probe audio fingerprint has commercial content in response to the number of matches between the audio fingerprints associated with the plurality of other broadcast channels and the reference audio fingerprint exceeding a predetermined threshold. 2. The computer-implemented method of claim 1 , further comprising: responsive to a match between the probe audio fingerprint and the reference audio fingerprint of the plurality of reference audio fingerprints: determining whether an audio fingerprint associated with a media stream of a plurality of other media streams matches the reference audio fingerprint; determining that the media stream associated with the probe audio fingerprint has commercial content in response to the number of matches between the audio fingerprints associated with the plurality of other media streams and the reference audio fingerprint exceeding a predetermined threshold. 3. The computer-implemented method of claim 1 , further comprising: receiving program guide information associated with the media stream on the broadcast channel, the program guide information describing scheduling information of known commercials broadcast along with the media stream on the broadcast channel; and verifying the commercial content determination based on the program guide information. 4. The computer-implemented method of claim 1 , further comprising: receiving program guide information associated with other media streams on the plurality of other broadcast channels, the program guide information describing scheduling information of known commercials broadcast along with the other media streams on the other broadcast channels. 5. The computer-implemented method of claim 1 , further comprising: extracting a plurality of acoustic features from the audio signal associated with the media stream; applying a trained analysis model to the extracted acoustic features of the audio signal; assigning a confidence score to the probe audio signal based on the analysis of the extracted acoustic features of the media stream; detecting commercial content in the media stream based on the confidence score assigned to the probe audio signal. 6. The computer-implemented method of claim 5 , wherein the plurality of acoustic features of the audio signal comprise a plurality of acoustic features of the audio signal in frequency domain comprising: magnitude spectrum of the audio signal; Mel-frequency cepstral coefficients of the audios signal; spectral bandwidth of the audio signal; spectral flatness measure of the audio signal; spectral fluctuation of the audio signal; number of extreme value frequencies; and number of silent frequencies. 7. The computer-implemented method of claim 5 , wherein the plurality of acoustic features of the audio signal comprise a plurality of acoustic features of the audio signal in time domain comprising mean, standard deviation and covariance matrix of features vectors of the audio signal, the feature vectors of the audio signal being observed over a predefined time window. 8. The computer-implemented method of claim 5 , wherein the confidence score assigned to the probe audio fingerprint indicates a likelihood that the media stream associated with the probe audio fingerprint has commercial content. 9. The computer-implemented method of claim 5 , wherein assigning a confidence score to the probe audio signal comprises: determining a degree of similarities between the acoustic features associated with the probe audio fingerprint and the acoustic features of reference audio fingerprints of known commercial content. 10. A non-transitory computer-readable storage medium storing computer program instructions, executed by a computer processor, for detecting commercial in a media stream, the computer program instructions comprising instructions for: receiving an audio signal including a plurality of frames, each frame representing a portion of the audio signal, the audio signal associated with a media stream on a broadcast channel; generating a probe audio fingerprint based on one or more of the plurality of frames; comparing the probe audio fingerprint with one or more reference audio fingerprints from a plurality of reference audio fingerprints, a reference audio fingerprint of the plurality of reference audio fingerprints representing a set of acoustic features of an audio signal associated with commercial content; determining whether the probe audio fingerprint matches a reference audio fingerprint of the plurality of reference audio fingerprints based on comparison of the probe audio fingerprint and the reference audio fingerprint; and responsive to a match between the probe audio fingerprint and the reference audio fingerprint of the plurality of reference audio fingerprints: determining whether an audio fingerprint associated with a media stream broadcast over a channel of a plurality of other broadcast channels matches the reference audio fingerprint; and determining that the media stream associated with the probe audio fingerprint has commercial content in response to the number of matches between the audio fingerprints associated with the plurality of other broadcast channels and the reference audio fingerprint exceeding a predetermined threshold. 11. The computer-readable storage medium of claim 10 , further comprising computer program instructions for: responsive to a match between the probe audio fingerprint and the reference audio fingerprint of the plurality of reference audio fingerprints: determining whether an audio fingerprint associated with a media stream of a plurality of other media streams matches the reference audio fingerprint; determining that the media stream associated with the probe audio fingerprint has commercial content in response to the number of matches between the audio fingerprints associated with the plurality of other media streams and the reference audio fingerprint exceeding a predetermined threshold. 12. The computer-readable storage medium of claim 10 , further comprising computer program instructions for: receiving program guide information associated with the media stream on the broadcast channel, the program guide information describing scheduling information of known commercials broadcast along with the media stream on the broadcast channel; and verifying the commercial content determination based on the program guide information.

Assignees

Inventors

Classifications

  • involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream (arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title

  • using a grid, e.g. sorted out by channel and broadcast time · CPC title

  • involving watermark {(protecting executable software by watermarking G06F21/16; image watermarking in general G06T1/0021; watermarks inserted in still images for transmission purposes H04N1/32144; inserting watermarks during video coding H04N19/467)} · CPC title

  • Learning process for intelligent management, e.g. learning user preferences for recommending movies {(services using the results of monitoring in broadcast systems H04H60/61)} · CPC title

  • End-user interface for programme selection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9258604B1 cover?
A commercial detection system generates a probe audio fingerprint of an audio signal associated with a media stream on a broadcast channel and determines whether the media stream has commercial content based on analysis of the probe audio fingerprint. The commercial detection system determines whether a same match between the probe audio fingerprint and a reference audio fingerprint is observed…
Who is the assignee on this patent?
Facebook Inc
What technology area does this patent fall under?
Primary CPC classification H04N21/4394. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).