Method and apparatus for extracting a fingerprint of video having a plurality of frames

US12322181B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12322181-B2
Application numberUS-202217820359-A
CountryUS
Kind codeB2
Filing dateAug 17, 2022
Priority dateAug 18, 2021
Publication dateJun 3, 2025
Grant dateJun 3, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for extracting a fingerprint of a video having a plurality of frames includes obtaining a plurality of pixel value matrices from each of the plurality of frames, calculating maximum values of average pixel values in each axis of the plurality of pixel value matrices for each of the plurality of frames, and calculating the fingerprint of the video based on a temporal correlation of the maximum values calculated for the plurality of frames.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for extracting a fingerprint of a video having a plurality of frames, the method being executed by one or more processors and comprising: extracting the plurality of frames from the video; obtaining a plurality of pixel value matrices from each of the plurality of frames; calculating maximum values of average pixel values in each axis of the plurality of pixel value matrices for each of the plurality of frames; calculating the fingerprint of the video based on a temporal correlation of the maximum values calculated for the plurality of frames; and comparing the fingerprint of the video with the fingerprint of an original video and determining if the fingerprint of the video is within an error range of at least a portion of the fingerprint of the original video. 2. The method according to claim 1 , wherein the calculating of the maximum values of the average pixel values includes: calculating maximum values of average pixel values in a horizontal axis of the plurality of pixel value matrices for each of the plurality of frames; and calculating maximum values of average pixel values in a vertical axis of the plurality of pixel value matrices for each of the plurality of frames. 3. The method according to claim 1 , wherein the calculating of the fingerprint of the video includes performing discrete cosine transform (DCT) on the maximum values calculated for the plurality of frames. 4. The method according to claim 3 , wherein the performing of the discrete cosine transform includes: calculating a first DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices in each of the plurality of frames; calculating a second DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices in each of the plurality of frames; and generating the fingerprint of the video by combining the first DCT coefficient and the second DCT coefficient. 5. The method according to claim 3 , wherein the calculating of the fingerprint of the video further includes excluding a coefficient having a basis frequency of 0 from the DCT coefficients calculated by the discrete cosine transform. 6. The method according to claim 1 , wherein the calculating of the maximum value of the average pixel values includes: calculating the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices, and the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices for each of the plurality of frames, and the calculating of the fingerprint of the video includes: calculating a first DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices in each of the plurality of frames; calculating a second DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices in each of the plurality of frames; and generating the fingerprint of the video by combining the first DCT coefficient and the second DCT coefficient. 7. The method according to claim 6 , wherein the calculating of the fingerprint of the video further includes excluding a coefficient having a basis frequency of 0 from the first DCT coefficient and the second DCT coefficient. 8. The method according to claim 1 , further comprising: obtaining a fingerprint of an original video; and determining whether the video is a tampered video by comparing the obtained fingerprint of the original video with the extracted fingerprint of the video. 9. A non-transitory computer-readable recording medium storing instructions that, when executed by one or more processors, cause performance of the method according to claim 1 . 10. A computing device, comprising: a memory; and one or more processors connected to the memory and configured to execute one or more computer-readable programs stored in the memory for extracting a fingerprint of a video, wherein the one or more programs include instructions for: extracting the plurality of frames from the video; obtaining a plurality of pixel value matrices from each of a plurality of frames included in a video; calculating maximum values of average pixel values in each axis of the plurality of pixel value matrices for each of the plurality of frames; calculating a fingerprint of the video based on a temporal correlation of the maximum values calculated for the plurality of frames; and comparing the fingerprint of the video with the fingerprint of an original video and determining if the fingerprint of the video is within an error range of at least a portion of the fingerprint of the original video. 11. The computing device according to claim 10 , wherein the calculating of the maximum values of the average pixel values includes: calculating maximum values of average pixel values in a horizontal axis of the plurality of pixel value matrices for each of the plurality of frames; and calculating maximum values of average pixel values in a vertical axis of the plurality of pixel value matrices for each of the plurality of frames. 12. The computing device according to claim 10 , wherein the calculating of the fingerprint of the video includes performing discrete cosine transform on the maximum values calculated for each of the plurality of frames. 13. The computing device according to claim 12 , wherein the performing of the discrete cosine transform includes: calculating a first DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices in each of the plurality of frames; calculating a second DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices in each of the plurality of frames; and generating the fingerprint of the video by combining the first DCT coefficient and the second DCT coefficient. 14. The computing device according to claim 12 , wherein the calculating of the fingerprint of the video includes excluding a coefficient having a basis frequency of 0 from the DCT coefficients calculated by the discrete cosine transform. 15. The computing device according to claim 10 , wherein the calculating of the maximum value of the average pixel values includes: calculating the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices, and the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices for each of the plurality of frames, and the calculating of the fingerprint of the video includes: calculating a first DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the horizontal axis of the plurality of pixel value matrices in each of the plurality of frames; calculating a second DCT coefficient by performing a discrete cosine transform on the maximum values of the average pixel values in the vertical axis of the plurality of pixel value matrices in each of the plurality of frames; and generating the fingerprint of the video by combining the first DCT coefficient and the second DCT coefficient. 16

Assignees

Inventors

Classifications

  • G06V20/46Primary

    Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • Transform-based matching, e.g. Hough transform · CPC title

  • Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items (segmenting video sequences G06V20/49) · CPC title

  • Discrete cosine transform [DCT] · CPC title

  • involving embedding information at multiplex stream level, e.g. embedding a watermark at packet level · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12322181B2 cover?
A method for extracting a fingerprint of a video having a plurality of frames includes obtaining a plurality of pixel value matrices from each of the plurality of frames, calculating maximum values of average pixel values in each axis of the plurality of pixel value matrices for each of the plurality of frames, and calculating the fingerprint of the video based on a temporal correlation of the …
Who is the assignee on this patent?
Naver Corp
What technology area does this patent fall under?
Primary CPC classification G06V20/46. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 03 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).