Method of authenticating a printed document

US9349237B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9349237-B2
Application numberUS-201213730743-A
CountryUS
Kind codeB2
Filing dateDec 28, 2012
Priority dateDec 28, 2012
Publication dateMay 24, 2016
Grant dateMay 24, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for authenticating a printed document which carries barcode that encode authentication data, including word bounding boxes for each word in the original document image and data for reconstructing the original image. The printed document is scanned to generate a target document image, which is then segmented into text words. The word bounding boxes of the original and target document images are used to align the target document image. Then, each word in the original document image is compared to corresponding words in the target document image using word difference map and Hausdorff distance between them. Symbols of the original document image are further compared to corresponding symbols in the target document image using feature comparison, symbol difference map and Hausdorff distance comparison, and point matching. These various comparison results can identify alterations in the target document with respect to the original document, which can be visualized.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for authenticating a printed document, the printed document carrying barcode which encodes compressed image data representing a binary original document image, the method comprising: (a) obtaining an image representing the printed document; (b) separating the image into a target document image and the barcode; (c) decoding the barcode and decompressing the compressed image data therein to obtain the original document image; (d) binarizing the target document image; (e) aligning the target document image with respect to the original document image; (f) comparing each word in the original document image with a corresponding word in the target document image to detect any differences, comprising: (f1) for each word of the original document image obtained in step (c), finding the corresponding word of the target document image; (f2) generating a difference map and calculating a Hausdorff distance between each word of the original and the corresponding word of the target document image, and comparing the difference map and the Hausdorff distance to determine whether the corresponding words of the original and target document images are different; (f3) if the words of the original and target document images are not determined to be different in step (f2), identifying one or more candidate symbols in the word of the original document image and corresponding candidate symbols in the target document image; (f4) comparing image features of each candidate symbol of the original document image identified in step (f3) with image features of the corresponding candidate symbol of the target document image to determine whether any of the corresponding candidate symbols of the original and the target document images are different; (f5) if the corresponding symbols of the original and target document images are not determined to be different in step (f4), generating a difference map and calculating a Hausdorff distance between each candidate symbol of the original document image and the corresponding candidate symbol of the target document image, and comparing the difference map and the Hausdorff distance to determine whether any of the corresponding candidate symbols of the original and target document images are different; and (f6) if the corresponding symbols of the original and target document images are not determined to be different in step (f5), comparing shapes of each candidate symbol of the original document image and the corresponding candidate symbol of the target document image using a point matching method to determine whether any of the corresponding candidate symbols of the original and target document images are different; and (g) visualizing the differences detected in step (f). 2. The method of claim 1 , wherein the barcode further encodes a plurality of original word bounding boxes each corresponding to a word in the original document, wherein step (c) further includes obtaining the plurality of original word bounding boxes from the barcode, and wherein step (e) comprises: (e1) segmenting the target document image into words to obtain target word bounding boxes corresponding to words in the target document image; (e2) matching at least some of the plurality of original word bounding boxes obtained in step (c) and at least some of the target word bounding boxes obtained in step (e1) to align the target document mage; (e3) based on the alignment obtained in step (e2), further aligning the target document image using the target document image and the original document image. 3. The method of claim 2 , wherein the barcode further encodes a plurality of original text line bounding boxes each corresponding to a line of text in the original document image, wherein step (c) further includes obtaining the plurality of original text line bounding boxes from the barcode, and wherein step (e1) further includes segmenting the target document image into lines of text to obtain target text line bounding boxes corresponding to lines of text in the target document image, and wherein step (e2) further includes matching at least some of the plurality of original text line bounding boxes obtained in step (c) and at least some of the target text line bounding boxes obtained in step (e1) to align the target document image. 4. The method of claim 2 , wherein the step (f2) uses a RANSAC (RANdom SAmple Consensus) method. 5. The method of claim 1 , wherein step (a) comprises scanning the printed document to generate a scanned image and pre-processing the scanned image including de-noising, de-skewing, and/or correction of perspective distortions. 6. The method of claim 1 , wherein in step (f4), the image features include zoning profiles, side profiles, topology statistics, and low-order image moments. 7. The method of claim 1 , wherein step (g) includes displaying or printing the original or target document image with indications that indicate any words of the original document image or the corresponding words of the target document image that are determined to be different in step (f2) and any candidate symbols of the original document image or the corresponding candidate symbols of the target document image that are determined to be different in steps (f4), (f5) and (f6). 8. A computer program product comprising a computer usable non-transitory medium having a computer readable program code embedded therein for controlling a data processing apparatus, the computer readable program code configured to cause the data processing apparatus to execute a process for authenticating a printed document, the printed document carrying barcode which encodes compressed image data representing a binary original document image, the process comprising: (a) obtaining an image representing the printed document; (b) separating the image into a target document image and the barcode; (c) decoding the barcode and decompressing the compressed image data therein to obtain the original document image; (d) binarizing the target document image; (e) aligning the target document image with respect to the original document image; (f) comparing each word in the original document image with a corresponding word in the target document image to detect any differences, comprising: (f1) for each word of the original document image obtained in step (c), finding the corresponding word of the target document image; (f2) generating a difference map and calculating a Hausdorff distance between each word of the original and the corresponding word of the target document image, and comparing the difference map and the Hausdorff distance to determine whether the corresponding words of the original and target document images are different; (f3) if the words of the original and target document images are not determined to be different in step (f2), identifying one or more candidate symbols in the word of the original document image and corresponding candidate symbols in the target document image; (f4) comparing image features of each candidate symbol of the original document image identified in step (f3) with image features of the corresponding candidate symbol of the target document image to determine whether any of the corresponding candidate symbols of the original and the target document images are different; (f5) if the corresponding symbols of the original and target document images are not determined to be different in step (f4), generating a difference map and calculating a Hausdorff distance between each candidate symbol of the original document image and the corresponding candidate symbol of the target document image, and comparing the difference map and the Hausdorff distance to determine whether any of the corresponding candidate symbols of the original

Assignees

Inventors

Classifications

  • Testing specially adapted to determine the identity or genuineness of valuable papers or for segregating those which are unacceptable, e.g. banknotes that are alien to a currency · CPC title

  • G07D7/2058Primary

    Physics · mapped topic

  • G07D7/206Primary

    Matching template patterns · CPC title

  • B42D15/00Primary

    Printed matter of special format or style not otherwise provided for · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9349237B2 cover?
A method for authenticating a printed document which carries barcode that encode authentication data, including word bounding boxes for each word in the original document image and data for reconstructing the original image. The printed document is scanned to generate a target document image, which is then segmented into text words. The word bounding boxes of the original and target document im…
Who is the assignee on this patent?
Tian Yibin, Ming Wei, Konica Minolta Lab Usa Inc
What technology area does this patent fall under?
Primary CPC classification G07D7/2058. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 24 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).