Method, device, and computer-readable storage medium for area extraction

US10127471B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10127471-B2
Application numberUS-201615298352-A
CountryUS
Kind codeB2
Filing dateOct 20, 2016
Priority dateOct 30, 2015
Publication dateNov 13, 2018
Grant dateNov 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for area extraction includes obtaining an area position of a first information area in a document image, determining a second information area according to the area position of the first information area, and segmenting the second information area to obtain at least one character zone.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for area extraction, comprising: obtaining an area position of a first information area in a document image; determining a second information area according to the area position of the first information area; and segmenting the second information area to obtain at least one character zone, wherein segmenting the second information area comprises: performing a binarization processing on the second information area to obtain a binarized second information area; calculating a horizontal histogram of the binarized second information area in a horizontal direction, a vertical direction in the horizontal histogram representing vertical coordinates of pixels in the binarized second information area, and a horizontal direction in the horizontal histogram representing a number of foreground color pixels in each row of pixels; recognizing one or more text rows based on consecutive row sets in the horizontal histogram, each of the consecutive row sets including consecutive rows each having the number of foreground color pixels larger than a first threshold; calculating a vertical histogram for a target text row in the one or more text rows in a vertical direction, a horizontal direction in the vertical histogram representing horizontal coordinates of the pixels in the target text row, and a vertical direction in the vertical histogram representing the number of foreground color pixels in each column of pixels; and recognizing the at least one character zone based on consecutive column sets in the vertical histogram, each of the consecutive column sets including consecutive columns each having the number of foreground color pixels larger than a second threshold, and wherein the method further comprises: recognizing a row spacing between two adjacent text rows based on the consecutive row sets in the horizontal histogram; and discarding, if the row spacing is larger than a third threshold, one of the two adjacent text rows that is closer to an upper edge or a lower edge of the second information area. 2. The method of claim 1 , wherein: obtaining the area position includes: obtaining two vertex coordinate pairs representing the area position, and determining the second information area includes: determining the second information area according to the two vertex coordinate pairs and a predetermined relative position relationship between the two vertex coordinate pairs and the second information area. 3. The method of claim 2 , wherein determining the second information area according to the two vertex coordinate pairs and the predetermined relative position relationship includes: determining a lower edge of the second information area according to a vertical coordinate in a closer vertex coordinate pair of the two vertex coordinate pairs that is closer to the second information area; determining an upper edge of the second information area according to the vertical coordinate in the closer vertex coordinate pair and a predetermined height; determining a left edge of the second information area according to a horizontal coordinate of any one of the two vertex coordinate pairs and a first predetermined width; determining a right edge of the second information area according to a horizontal coordinate of any one of the two vertex coordinate pairs and a second predetermined width; and determining the second information area according to the lower edge, the upper edge, the left edge, and the right edge. 4. The method of claim 1 , further comprising: recognizing a character spacing between two adjacent character zones based on the consecutive column sets in the vertical histogram; recognizing, if the two adjacent character zones are on the left of the second information area and the character spacing is larger than a fourth threshold, a right one of the two adjacent character zones as a first character zone in the target text row; and recognizing, if the two adjacent character zones are on the right of the second information area and the character spacing is larger than a fifth threshold, a left one of the two adjacent character zones as a last character zone in the target text row. 5. A device for area extraction, comprising: a processor; and a memory storing instructions that, when executed by the processor, cause the processor to: obtain an area position of a first information area in a document image; determine a second information area according to the area position of the first information area; and segment the second information area to obtain at least one character zone, wherein in segmenting the second information area to obtain at least one character zone, the instructions further cause the processor to: perform a binarization processing on the second information area to obtain a binarized second information area; calculate a horizontal histogram of the binarized second information area in a horizontal direction, a vertical direction in the horizontal histogram representing vertical coordinates of pixels in the binarized second information area, and a horizontal direction in the horizontal histogram representing a number of foreground color pixels in each row of pixels; recognize one or more text rows based on consecutive row sets in the horizontal histogram, each of the consecutive row sets including consecutive rows each having the number of foreground color pixels larger than a first threshold; calculate a vertical histogram for a target text row in the one or more text rows in a vertical direction, a horizontal direction in the vertical histogram representing horizontal coordinates of the pixels in the target text row, and a vertical direction in the vertical histogram representing the number of foreground color pixels in each column of pixels; and recognize the at least one character zone based on consecutive column sets in the vertical histogram, each of the consecutive column sets including consecutive columns each having the number of foreground color pixels larger than a second threshold, and wherein the instructions further cause the processor to: recognize a row spacing between two adjacent text rows based on the consecutive row sets in the horizontal histogram; and discard, if the row spacing is larger than a third threshold, one of the two adjacent text rows that is closer to an upper edge or a lower edge of the second information area. 6. The device of claim 5 , wherein the instructions further cause the processor to: obtain two vertex coordinate pairs representing the area position; and determine the second information area according to the two vertex coordinate pairs and a predetermined relative position relationship between the two vertex coordinate pairs and the second information area. 7. The device of claim 6 , wherein the instructions further cause the processor to: determine a lower edge of the second information area according to a vertical coordinate in a closer vertex coordinate pair of the two vertex coordinate pairs that is closer to the second information area; determine an upper edge of the second information area according to the vertical coordinate in the closer vertex coordinate pair and a predetermined height; determine a left edge of the second information area according to a horizontal coordinate of any one of the two vertex coordinate pairs and a first predetermined width; determine a right edge of the second information area according to a horizontal coordinate of any one of the two vertex coordinate pairs and a second predetermined width; and determine the second information area according to the lower edge, the upper edge, the left edge, and the right edge. 8. The device of claim 5 , wherein the instructions further cause the proce

Assignees

Inventors

Classifications

  • G06V30/412Primary

    Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables · CPC title

  • Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion · CPC title

  • by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition · CPC title

  • Summing image-intensity values; Histogram projection analysis · CPC title

  • Region-based segmentation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10127471B2 cover?
A method for area extraction includes obtaining an area position of a first information area in a document image, determining a second information area according to the area position of the first information area, and segmenting the second information area to obtain at least one character zone.
Who is the assignee on this patent?
Xiaomi Inc
What technology area does this patent fall under?
Primary CPC classification G06V30/412. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).