Systems and methods for robust industrial optical character recognition

US10810465B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10810465-B2
Application numberUS-201816024910-A
CountryUS
Kind codeB2
Filing dateJul 1, 2018
Priority dateJun 30, 2017
Publication dateOct 20, 2020
Grant dateOct 20, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An auto-encoder is configured to verify character detection and/or classification results generated by an automated optical character recognition system. The auto-encoder may be trained to reconstruct visual representations of the detected character, and a determination of whether the character detection result comprises a true positive or false positive may be based on a reconstruction error between the image data in which the character was detected and a reconstructed image generated by the auto-encoder.

First claim

Opening claim text (preview).

We claim: 1. A method of operation in an image processing system for recognizing one or more characters in an image, the method comprising: receiving a classifier result indicating detection of a character at specified coordinates within the image; retrieving input image data corresponding to the specified coordinates within the image; generating a reconstruction of the input image data by use of an auto-encoder trained to replicate the detected character; and determining whether the classifier result is a true positive based on a reconstruction error between the input image data and the reconstruction of the input image data, wherein generating the reconstruction of the input image data comprises: processing the input image data through a plurality of encoder layers of the auto-encoder to produce an inner layer representation of the input image data; and decoding the inner layer representation by use of a plurality of decoder layers of the auto-encoder to generate the reconstruction of the input image data from the inner layer representation. 2. The method of claim 1 , further comprising calculating the reconstruction error between the input image data and reconstructed image data generated by the auto-encoder. 3. The method of claim 2 , wherein calculating the reconstruction error comprises computing a root-mean-square error between the input image data and the reconstructed image data. 4. The method of claim 1 , further comprising invalidating the classifier result in response to the reconstruction error exceeding a determined verification threshold. 5. A method of operation in an image processing system for recognizing one or more characters in an image, the method comprising: receiving a classifier result indicating detection of a character at specified coordinates within the image; retrieving input image data corresponding to the specified coordinates within the image; generating a reconstruction of the input image data by use of an auto-encoder trained to replicate the detected character; determining whether the classifier result is a true positive based on a reconstruction error between the input image data and the reconstruction of the input image; acquiring a plurality of different sets of input image data corresponding to the classifier result; generating a plurality of reconstruction images by use of the auto-encoder, each reconstruction image corresponding to a respective one of the different sets of input image data; calculating a plurality of reconstruction errors, each reconstruction error between a respective one of the different sets of the input image data and corresponding reconstruction image; and determining whether the classification result comprises a true positive based on a minimum reconstruction error of the plurality of reconstruction errors. 6. The method of claim 5 , wherein generating the plurality of different sets of input image data comprises retrieving respective sets of input image data from the image within an enlarged area around the specified coordinates, the enlarged area corresponding to a size of one or more ground-truth images used to train the auto-encoder. 7. A method of operation in an image processing system for recognizing one or more characters in an image, the method comprising: receiving a classifier result indicating detection of a character at specified coordinates within the image; retrieving input image data corresponding to the specified coordinates within the image; generating a reconstruction of the input image data by use of an auto-encoder trained to replicate the detected character; determining whether the classifier result is a true positive based on a reconstruction error between the input image data and the reconstruction of the input image; invalidating the classifier result in response to the reconstruction error exceeding a determined verification threshold; and determining the reconstruction error based on reconstruction errors of one or more ground-truth images used to train the auto-encoder. 8. An image processing system, comprising: an auto-encoder processor, comprising: a plurality of encode layers configured to generate a compressed representation of input image data corresponding to a character classification result, a plurality of decode layers configured to generate a reconstructed image from the compressed representation of the input image data, and a difference calculator configured to determine a reconstruction error between the input image data and the reconstructed image, wherein the auto-encoder processor is configured to determine whether the character classification result is one of a true positive and a false positive based on the reconstruction error. 9. The system of claim 8 , wherein: each encode layer of the plurality of encode layers comprises a respective neural network comprising a plurality of neurons; and each decode layer of the plurality of decode layers comprises a respective neural network comprising a plurality of neurons. 10. The system of claim 9 , wherein: each neuron of a first encode layer of the plurality of encode layers comprises a two-dimensional convolutional filter, and each neuron of the encode layers other than the first encode layer comprises a one-dimensional convolutional filter. 11. The system of claim 9 , wherein: the plurality of encode layers comprises a first encode layer comprising a first number of neurons, and second encode layer comprising a second number of neurons, and a third encode layer comprising a third number of neurons; the first number is larger than the second number; and the second number is larger than the third number. 12. The system of claim 9 , wherein: the plurality of decode layers comprises a first decode layer comprising a first number of neurons, and second decode layer comprising a second number of neurons, and a third decode layer comprising a third number of neurons; the third number is larger than the second number; and the second number is larger than the first number. 13. The system of claim 8 , wherein: the auto-encoder processor is configured to invalidate the character classification result in response to determining that the character classification result is a false positive; invalidating the character classification result comprises one or more of: removing the character classification result from result data, marking the character classification result as a false positive, and recording that the character classification result is a false positive on a non-transitory storage medium. 14. The system of claim 8 , further comprising an auto-encoder network configuration engine adapted to train the encode layers and the decode layers to replicate a target character, the target character corresponding to a character of the character classification result. 15. The image processing system of claim 8 , further comprising an image capture device operably coupled with the auto-encoder processor and configured to capture the input image data. 16. A non-transitory computer-readable storage medium comprising instructions configured to cause a computing device to perform operations for improving accuracy of a character recognition system, the operations comprising: receiving a detection result indicating detection of a character at a designated location within a captured image; acquiring input image data corresponding to the designated location within the captured image; configuring an auto-encoder network to replicate a target character corresponding to the character of the detection result; reconstr

Assignees

Inventors

Classifications

  • Validation; Performance evaluation · CPC title

  • Detection or correction of errors, e.g. by rescanning the pattern · CPC title

  • Classification techniques · CPC title

  • using neural networks · CPC title

  • Validation; Performance evaluation; Active pattern learning techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10810465B2 cover?
An auto-encoder is configured to verify character detection and/or classification results generated by an automated optical character recognition system. The auto-encoder may be trained to reconstruct visual representations of the detected character, and a determination of whether the character detection result comprises a true positive or false positive may be based on a reconstruction error b…
Who is the assignee on this patent?
Datalogic Usa Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 20 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).