Image reading apparatus that aligns directions of document images, image reading method, image forming apparatus, and recording medium
US-10482338-B2 · Nov 19, 2019 · US
US9183452B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9183452-B2 |
| Application number | US-201414269777-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 5, 2014 |
| Priority date | Oct 29, 2009 |
| Publication date | Nov 10, 2015 |
| Grant date | Nov 10, 2015 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A text recognition server is configured to recognize text in a sparse text image. Specifically, given an image, the server specifies a plurality of “patches” (blocks of pixels within the image). The system applies a text detection algorithm to the patches to determine a number of the patches that contain text. This application of the text detection algorithm is used both to estimate the orientation of the image and to determine whether the image is textually sparse or textually dense. If the image is determined to be textually sparse, textual patches are identified and grouped into text regions, each of which is then separately processed by an OCR algorithm, and the recognized text for each region is combined into a result for the image as a whole.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method of recognizing text in an image, comprising: identifying a first plurality of patches within the image; applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text; determining whether the image represents sparse text or dense text based at least in part on the identified first set of patches; responsive to the image representing sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, and performing optical character recognition on the second set of patches that contain text to obtain a textual result; responsive to the image representing dense text: performing optical character recognition on the image as a whole to produce the textual result; and storing the textual result. 2. The computer-implemented method of claim 1 , wherein determining that the image represents sparse text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold. 3. The computer-implemented method of claim 1 , wherein determining that the image represents sparse text comprises: determining an approximate number of characters in the image; comparing the approximate number of characters to a threshold, wherein the properly-oriented image is deemed to represent sparse text when the number of characters is below the threshold. 4. The computer-implemented method of claim 1 , further comprising: grouping patches of the second set of patches that contain text and are proximate to one another into a text region, wherein performing optical character recognition on the second set of patches that contain text to obtain the textual result comprises performing optical character recognition on the text region. 5. The computer-implemented method of claim 4 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combining overlapping patches and forming the text region from a union of areas of the combined patches. 6. The computer-implemented method of claim 1 , wherein the second set of patches comprises patches that overlap with each other or patches of different sizes. 7. A computer system for recognizing text in an image, comprising: at least one computer processor; and a computer program executable by the computer processor and performing actions comprising: identifying a first plurality of patches within the image, applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text, determining whether the image represents sparse text or dense text based at least in part on the identified first set of patches, responsive to the image representing sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, performing optical character recognition on the second set of patches that contain text to obtain a textual result, responsive to the image representing dense text: performing optical character recognition on the image as a whole to produce the textual result, and storing the textual result. 8. The computer system of claim 7 , wherein determining that the image represents sparse text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold. 9. The computer system of claim 7 , wherein determining that the image represents sparse text comprises: determining an approximate number of characters in the image; comparing the approximate number of characters to a threshold, wherein the properly-oriented image is deemed to represent sparse text when the number of characters is below the threshold. 10. The computer system of claim 7 , wherein the actions further comprise: grouping patches of the second set of patches that contain text and are proximate to one another into a text region, wherein performing optical character recognition on the second set of patches that contain text to obtain the textual result comprises performing optical character recognition on the text region. 11. The computer system of claim 10 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combining overlapping patches and forming the text region from a union of areas of the combined patches. 12. The computer system of claim 7 , wherein the second set of patches comprises patches that overlap with each other or patches of different sizes. 13. A computer-implemented method of recognizing text in an image, comprising: identifying a first plurality of patches within the image; applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text; determining whether the image represents sparse text or dense text based at least in part on a number of patches within the identified first set of patches; when the image represents sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, and performing optical character recognition on the second set of patches that contain text to obtain a textual result, and when the image represents dense text: performing optical character recognition on the image as a whole to obtain the textual result; and storing the textual result. 14. The computer-implemented method of claim 13 , wherein determining whether the image represents sparse text or dense text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold and is deemed to represent dense text when the number of patches is above the threshold. 15. The computer-implemented method of claim 13 , wherein determining whether the image represents sparse text or dense text comprises: determining an approximate number of characters in the image; and comparing the approximate number of characters to a threshold, wherein the image is deemed to represent sparse text when the approximate number of characters is below the threshold and is deemed to represent dense text when the approximate number of characters is above the threshold. 16. The computer-implemented method of claim 13 , wherein performing optical character recognition on the second set of patches that contain text to obtain a textual result comprises: grouping patches of the second set of patches that contain text and are proximate to one another into a text region; and performing optical character recognition on the text region. 17. The computer-implemented method of claim 16 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combini
Orientation detection or correction, e.g. rotation of multiples of 90 degrees · CPC title
Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques · CPC title
Determination of region of interest · CPC title
Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion · CPC title
Coarse or fine approaches, e.g. resolution of ambiguities or multiscale approaches · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.