Ocr of text overlapping scenes through text graph structuring
US-2024265719-A1 · Aug 8, 2024 · US
US12205390B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12205390-B2 |
| Application number | US-202217661654-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 2, 2022 |
| Priority date | May 2, 2022 |
| Publication date | Jan 21, 2025 |
| Grant date | Jan 21, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system for identifying handwritten characters on an image using a classification model that employs a neural network. The system includes a computer having a processor and a memory device that stores data and executable code that, when executed, causes the processor to read and convert typed text on the image to machine encoded text to identify locations of the typed text on the image; identify a location on the image that includes handwritten text based on the location of predetermined typed text on the image; identify clusters of non-white pixels in the image at the location having the handwritten text, where constraints are employed to refine and limit the clusters; generate an individual and separate cluster image for each identified cluster; and classify each cluster image using machine learning and at least one neural network to determine the likelihood that the cluster is a certain character.
Opening claim text (preview).
What is claimed is: 1. A system for identifying handwritten characters on an image, said system comprising: a computer including at least one processor and a memory device storing data and executable code that, when executed, causes the at least one processor to: read and convert typed text on the image to machine encoded text to identify locations of the typed text on the image; identify a location on the image that includes handwritten text based on the location of predetermined typed text on the image; identify clusters of non-white pixels in the image at the location having the handwritten text, where each cluster is presumed to be a handwritten character, wherein the at least one processor employs a constraint so that separate clusters are not identified in a stacked up/down direction relative to a writing direction of the handwritten text; generate an individual and separate cluster image for each identified cluster; classify each cluster image using machine learning and at least one neural network to determine the likelihood that the cluster is a certain character; and determine what character each cluster image is based on the classification. 2. The system according to claim 1 wherein the at least one processor employs a constraint that knits a size of each cluster to be less than a predetermined size. 3. The system according to claim 1 wherein the at least one processor employs a constraint that requires all of the clusters to be within a certain percentage size of each other. 4. The system according to claim 1 wherein the at least one processor reads and converts the typed text using an optical character recognition algorithm. 5. The system according to claim 1 wherein the at least one processor identifies clusters of non-white pixels using a density-based clustering algorithm. 6. The system according to claim 1 wherein the at least one processor rescales the cluster when it generates an individual and separate cluster image for each identified cluster. 7. The system according to claim 6 wherein the at least one processor rescales the cluster to a 28×28 pixel cluster image. 8. The system according to claim 1 wherein the at least one processor centers the cluster in the cluster image when it generates an individual and separate cluster image for each identified cluster. 9. The system according to claim 1 wherein the at least one processor classifies each cluster image by determining the likelihood that the cluster image is one of sixty-two character classes, namely, upper case letters A-Z, lower case letters a-z and numbers 0-9. 10. The system according to claim 1 wherein the at least one neural network is a convolutional neural network (CNN). 11. A method for identifying handwritten characters on an image, said method comprising: reading and converting typed text on the image to machine encoded text to identify locations of the typed text on the image; identifying a location on the image that includes handwritten text based on the location of predetermined typed text on the image; identifying clusters of non-white pixels in the image at the location having the handwritten text, where each cluster is presumed to be a handwritten character, wherein identifying clusters of non-white pixels includes employing a constraint so that separate clusters are not identified in a stacked up/down direction relative to a writing direction of the handwritten text; generating an individual and separate cluster image for each identified cluster; classifying each cluster image using machine learning and at least one neural network to determine the likelihood that the cluster is a certain character; and determining what character each cluster image is based on the classification. 12. The method according to claim 11 wherein identifying clusters of non-white pixels in the image includes employing a constraint that limits a size of each cluster to be less than a predetermined size. 13. The method according to claim 11 wherein identifying clusters of non-white pixels in the image includes employing a constraint that requires all of the clusters to be within a certain percentage size of each other. 14. The method according to claim 11 wherein reading and converting the typed text includes using an optical character recognition algorithm and identifying clusters of non-white pixels includes using a density-based clustering algorithm. 15. The method according to claim 11 wherein generating an individual and separate cluster image for each identified cluster includes rescaling each cluster. 16. The method according to claim 15 wherein each cluster is rescaled to a 28×28 pixel cluster image. 17. The method according to claim 11 wherein generating an individual and separate cluster image for each identified cluster includes centering the cluster in the image. 18. The method according to claim 11 wherein classifying each cluster image includes determining the likelihood that the cluster image is one of sixty-two character classes, namely, upper case letters A-Z, lower case letters a-z and numbers 0-9. 19. The method according to claim 11 wherein the at least one neural network is a convolutional neural network (CNN). 20. The method according to claim 11 wherein the image is an image of a check, the predetermined typed text is “pay to the order of” and the handwritten text is a payee on the check.
Non-supervised learning, e.g. competitive learning · CPC title
Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques · CPC title
of cursive writing · CPC title
Banking, e.g. interest calculation or account maintenance (credit or loans G06Q40/03) · CPC title
Machine learning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.