Mobile document detection and orientation based on reference object characteristics
US-9760788-B2 · Sep 12, 2017 · US
US10423851B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10423851-B2 |
| Application number | US-201815908308-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 28, 2018 |
| Priority date | Feb 28, 2018 |
| Publication date | Sep 24, 2019 |
| Grant date | Sep 24, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Speed and accuracy of character recognition can be improved by isolating text orientation during an early stage of processing an image containing a mixture of horizontal and vertical text. Vertical and horizontal line bounding boxes are defined from characters in the image. In a section of the image containing horizontal text, vertical line bounding boxes may tend to be larger and/or spaced close together due to misalignment of characters. For the same reason, horizontal line bounding boxes may tend to be larger and/or spaced closed together in a section of the image containing vertical text. Such variations in size and/or spacing may be used to identify a division between the horizontal and vertical text. A subsequent character recognition process may take advantage of a known division to conserve computing resources.
Opening claim text (preview).
What is claimed is: 1. A method for processing an image containing horizontal text arranged in a horizontal direction and vertical text arranged in a vertical direction, the method comprising: determining a horizontal score for a plurality of horizontal line bounding boxes, the horizontal score determined from one or both of size and spacing of the horizontal line bounding boxes, the plurality of horizontal line bounding boxes defined from character connected components in the horizontal text and the vertical text, each of the horizontal line bounding boxes occupying a position interval in the vertical direction; determining a vertical score for a plurality of vertical line bounding boxes, the vertical score determined from one or both of size and spacing of the vertical line bounding boxes, the plurality of vertical line bounding boxes defined from the character connected components in the horizontal text and the vertical text, each of the vertical line bounding boxes occupying a position interval in the horizontal direction; comparing the horizontal score and the vertical score; and determining from the comparison of the horizontal and vertical scores, either a position of a horizontal division between the horizontal and vertical texts or a position of a vertical division between the horizontal and vertical texts. 2. The method of claim 1 , wherein the determining of a horizontal score comprises for each of the position intervals of the horizontal line bounding boxes, determining whether the position interval is a positive position interval or a negative position interval, the position interval of the horizontal line bounding box determined to be a positive position interval on condition that the position interval satisfies one or both of a horizontal evaluating size range and a horizontal evaluating spacing minimum, determining an overall proportion of positive position intervals among the horizontal line bounding boxes, and deriving the horizontal score from the overall proportion of positive position intervals among the horizontal line bounding boxes; and the determining of a vertical score comprises for each of the position intervals of the vertical line bounding boxes, determining whether the position interval is a positive position interval or a negative position interval, the position interval of the vertical line bounding box determined to be a positive position interval on condition that the position interval satisfies one or both of a vertical evaluating size range and a vertical evaluating spacing minimum, determining an overall proportion of positive position intervals among the vertical line bounding boxes, and deriving the vertical score from the overall proportion of positive position intervals among the vertical line bounding boxes. 3. The method of claim 2 , further comprising: using a median size of the position intervals of the horizontal line bounding boxes to determine the horizontal evaluating size range; and using a median size of the position intervals of the vertical line bounding boxes to determine the vertical evaluating size range. 4. The method of claim 2 , wherein the determining of a position of a horizontal division between the horizontal and vertical texts comprises: identifying a discordant pair of the position intervals among the horizontal line bounding boxes, the discordant pair having a positive position interval and a negative position interval adjacent to the positive position interval, there being a gap interval between the positive position interval and the negative position interval, there being two interval regions each located on opposite sides of the gap interval; identifying a target interval region from among the two interval regions located on opposite sides of the interval gap, the target interval region having a lower number of positive position intervals than the interval region on the opposite side, determining a reference score being a proportion of positive position intervals among the horizontal line bounding boxes in the target interval region, determining a comparative score being a proportion of positive position intervals among the vertical line bounding boxes in the target interval region, and determining the position of the horizontal division as being within the gap interval when the comparative score is greater than the reference score; and the determining of a position of a vertical division between the horizontal and vertical texts comprises: identifying a discordant pair of the position intervals among the vertical line bounding boxes, each discordant pair having a positive position interval and a negative position interval adjacent to the positive position interval, there being a gap interval between the positive position interval and the negative position interval, there being two interval regions each located on opposite sides of the gap interval, identifying a target interval region from among the two interval regions located on opposite sides of the interval gap, the target interval region having a lower number of positive position intervals than the interval region on the opposite side, determining a reference score being a proportion of positive position intervals among the vertical line bounding boxes in the target interval region, determining a comparative score being a proportion of positive position intervals among the horizontal line bounding boxes in the target interval region, and determining the position of the vertical division as being within the gap interval when the comparative score is greater than the reference score. 5. The method of claim 1 , further comprising: before the determining of the horizontal score, performing a horizontal line finding process to define the plurality of horizontal line bounding boxes, the horizontal line finding process comprising, for each of the character connected components: determining whether a horizontal gap between the character connected component and a target horizontal line bounding box satisfies a horizontal gap maximum, the target horizontal line bounding box being one among the plurality of horizontal line bounding boxes, determining whether a projected vertical overlap of the character connected component and the target horizontal line bounding box satisfies a projected vertical overlap minimum, and redefining the target horizontal line bounding box to include the character connected component on condition that both the horizontal gap maximum and the projected vertical overlap minimum are satisfied; and before the determining of the vertical score, performing a vertical line finding process to define the plurality of vertical line bounding boxes, the vertical line finding process comprising, for each of the character connected components: determining whether a vertical gap between the character connected component and a target vertical line bounding box satisfies a vertical gap maximum, the target vertical line bounding box being one among the plurality of vertical line bounding boxes, determining whether a projected horizontal overlap of the character connected component and the target vertical line bounding box satisfies a projected horizontal overlap minimum, and redefining the target vertical line bounding box to include the character connected component on condition that both the vertical gap maximum and the projected horizontal overlap minimum are satisfied. 6. An apparatus for processing an image with horizontal text arranged in a horizontal direction and vertical text arranged in a vertical direction, the apparatus comprising: a processor; and a memory in communication with the processor, the memory storing instructions, wherein the processor is configured to perform a process on the image according to the stor
Orientation detection or correction, e.g. rotation of multiples of 90 degrees · CPC title
Character recognition · CPC title
using recognition of characters or words · CPC title
Bounding box · CPC title
Drawing from basic elements · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.