Text recognition for textually sparse images

US9183452B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9183452-B2
Application numberUS-201414269777-A
CountryUS
Kind codeB2
Filing dateMay 5, 2014
Priority dateOct 29, 2009
Publication dateNov 10, 2015
Grant dateNov 10, 2015

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A text recognition server is configured to recognize text in a sparse text image. Specifically, given an image, the server specifies a plurality of “patches” (blocks of pixels within the image). The system applies a text detection algorithm to the patches to determine a number of the patches that contain text. This application of the text detection algorithm is used both to estimate the orientation of the image and to determine whether the image is textually sparse or textually dense. If the image is determined to be textually sparse, textual patches are identified and grouped into text regions, each of which is then separately processed by an OCR algorithm, and the recognized text for each region is combined into a result for the image as a whole.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of recognizing text in an image, comprising: identifying a first plurality of patches within the image; applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text; determining whether the image represents sparse text or dense text based at least in part on the identified first set of patches; responsive to the image representing sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, and performing optical character recognition on the second set of patches that contain text to obtain a textual result; responsive to the image representing dense text: performing optical character recognition on the image as a whole to produce the textual result; and storing the textual result. 2. The computer-implemented method of claim 1 , wherein determining that the image represents sparse text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold. 3. The computer-implemented method of claim 1 , wherein determining that the image represents sparse text comprises: determining an approximate number of characters in the image; comparing the approximate number of characters to a threshold, wherein the properly-oriented image is deemed to represent sparse text when the number of characters is below the threshold. 4. The computer-implemented method of claim 1 , further comprising: grouping patches of the second set of patches that contain text and are proximate to one another into a text region, wherein performing optical character recognition on the second set of patches that contain text to obtain the textual result comprises performing optical character recognition on the text region. 5. The computer-implemented method of claim 4 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combining overlapping patches and forming the text region from a union of areas of the combined patches. 6. The computer-implemented method of claim 1 , wherein the second set of patches comprises patches that overlap with each other or patches of different sizes. 7. A computer system for recognizing text in an image, comprising: at least one computer processor; and a computer program executable by the computer processor and performing actions comprising: identifying a first plurality of patches within the image, applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text, determining whether the image represents sparse text or dense text based at least in part on the identified first set of patches, responsive to the image representing sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, performing optical character recognition on the second set of patches that contain text to obtain a textual result, responsive to the image representing dense text: performing optical character recognition on the image as a whole to produce the textual result, and storing the textual result. 8. The computer system of claim 7 , wherein determining that the image represents sparse text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold. 9. The computer system of claim 7 , wherein determining that the image represents sparse text comprises: determining an approximate number of characters in the image; comparing the approximate number of characters to a threshold, wherein the properly-oriented image is deemed to represent sparse text when the number of characters is below the threshold. 10. The computer system of claim 7 , wherein the actions further comprise: grouping patches of the second set of patches that contain text and are proximate to one another into a text region, wherein performing optical character recognition on the second set of patches that contain text to obtain the textual result comprises performing optical character recognition on the text region. 11. The computer system of claim 10 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combining overlapping patches and forming the text region from a union of areas of the combined patches. 12. The computer system of claim 7 , wherein the second set of patches comprises patches that overlap with each other or patches of different sizes. 13. A computer-implemented method of recognizing text in an image, comprising: identifying a first plurality of patches within the image; applying, to each patch of the first plurality of patches, a first text detection algorithm that indicates whether a patch contains text, thereby identifying a first set of patches that contain text; determining whether the image represents sparse text or dense text based at least in part on a number of patches within the identified first set of patches; when the image represents sparse text: identifying within the image, using a second text detection algorithm, a second set of patches that contain text, and performing optical character recognition on the second set of patches that contain text to obtain a textual result, and when the image represents dense text: performing optical character recognition on the image as a whole to obtain the textual result; and storing the textual result. 14. The computer-implemented method of claim 13 , wherein determining whether the image represents sparse text or dense text comprises: determining a number of patches within the identified first set of patches that contain text; and comparing the number of patches that contain text to a threshold, wherein the image is deemed to represent sparse text when the number of patches is below the threshold and is deemed to represent dense text when the number of patches is above the threshold. 15. The computer-implemented method of claim 13 , wherein determining whether the image represents sparse text or dense text comprises: determining an approximate number of characters in the image; and comparing the approximate number of characters to a threshold, wherein the image is deemed to represent sparse text when the approximate number of characters is below the threshold and is deemed to represent dense text when the approximate number of characters is above the threshold. 16. The computer-implemented method of claim 13 , wherein performing optical character recognition on the second set of patches that contain text to obtain a textual result comprises: grouping patches of the second set of patches that contain text and are proximate to one another into a text region; and performing optical character recognition on the text region. 17. The computer-implemented method of claim 16 , wherein grouping patches of the second set of patches that are proximate to one another into the text region comprises combini

Assignees

Inventors

Classifications

  • Orientation detection or correction, e.g. rotation of multiples of 90 degrees · CPC title

  • Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques · CPC title

  • Determination of region of interest · CPC title

  • Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion · CPC title

  • Coarse or fine approaches, e.g. resolution of ambiguities or multiscale approaches · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9183452B2 cover?
A text recognition server is configured to recognize text in a sparse text image. Specifically, given an image, the server specifies a plurality of “patches” (blocks of pixels within the image). The system applies a text detection algorithm to the patches to determine a number of the patches that contain text. This application of the text detection algorithm is used both to estimate the orienta…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06V30/1463. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 10 2015 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).