Character segmentation and recognition method

US10430681B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10430681-B2
Application numberUS-201515549126-A
CountryUS
Kind codeB2
Filing dateJun 26, 2015
Priority dateFeb 13, 2015
Publication dateOct 1, 2019
Grant dateOct 1, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a character segmentation and recognition method. The method includes: collecting image data to obtain a to-be-recognized image; positioning a character line candidate region on the to-be-recognized image; obtaining pre-set character line prior information, where the character line prior information includes the number of characters, character spacing and a character size; obtaining a corresponding segmentation point template based on the character line prior information; obtaining credible degrees of different positions on the character line candidate region traversed by the segmentation point template; determining a position with the highest credible degree as an optimal segmentation position; segmenting the character line candidate region based on the segmentation point template and the optimal segmentation position to obtain multiple single character regions; and performing character recognition on each of the single character regions to obtain a corresponding recognition result.

First claim

Opening claim text (preview).

The invention claimed is: 1. A character segmentation and recognition method, comprising: collecting image data to obtain a to-be-recognized image; positioning a character line candidate region on the to-be-recognized image, wherein the character line candidate region comprises a plurality of characters which do not overlap with each other; obtaining pre-set character line prior information, wherein the character line prior information comprises a quantity of the plurality of characters, a spacing among the plurality of characters, and a size of the plurality of characters; obtaining a segmentation point template based on the character line prior information, wherein the segmentation point template comprises boundary frames of each of the plurality of characters; traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region; determining an optimal segmentation position from the different positions, wherein the credible degree of the optimal segmentation position is largest among the credible degrees of the different positions; segmenting the character line candidate region based on the segmentation point template and the optimal segmentation position to obtain a plurality of single-character regions which do not overlap with each other, wherein each of the plurality of single-character regions comprises one of the plurality of characters; and performing character recognition on each of the plurality of single-character regions to obtain a corresponding recognition result; wherein traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region comprises: recording position information of the segmentation point template, in response to the segmentation point being traversed to each of the different positions; and obtaining the credible degrees corresponding to the position information; and wherein the obtaining the credible degrees corresponding to the position information comprises: obtaining the first number of first effective pixel points of the character line candidate region on a left boundary of a character segmentation point, wherein the segmentation point template comprises a plurality of character segmentation points, each of the character segmentation points corresponds to one character on a character line, each of the character segmentation points comprises the left boundary and a right boundary, and the first effective pixel points are pixel points with gray values in a pre-set first threshold range; obtaining a first evaluation value based on the first number and a pre-set first weight; obtaining the second number of second effective pixel points of the character line candidate region on the right boundary of the character segmentation point, wherein the second effective pixel points are pixel points with gray values in a pre-set second threshold range; obtaining a second evaluation value based on the second number and a pre-set second weight; obtaining the third number of third effective pixel points of the character line candidate region within the segmentation point template, wherein the third effective pixel points are pixel points with gray values in a pre-set third threshold range; obtaining a third evaluation value based on the third number and a pre-set third weight; and obtaining the credible degrees corresponding to the position information based on the first evaluation value, the second evaluation value and the third evaluation value. 2. The method according to claim 1 , wherein the obtaining the pre-set character line prior information comprises: obtaining the pre-set character line prior information based on a type of the to-be-recognized image; or obtaining the pre-set character line prior information based on an orientation feature of the character line candidate region. 3. The method according to claim 2 , wherein after obtaining the corresponding segmentation point template based on the character line prior information, the method further comprises: obtaining a binary image of the to-be-recognized image; obtaining a projection waveform of the binary image in a horizontal direction; and updating an upper boundary and a lower boundary of the segmentation point template based on the projection waveform. 4. The method according to claim 2 , wherein the positioning the character line candidate region on the to-be-recognized image comprises: positioning a region of interest where a character line locates on the to-be-recognized image; performing affine transformation and bilinear interpolation on the region of interest to obtain a correction image; and positioning the character line candidate region on the correction image. 5. The method according to claim 2 , wherein the performing character recognition on each of the single character regions to obtain the corresponding recognition result comprises: obtaining character feature data in the single character region; and performing character recognition based on the character feature data to obtain the recognition result. 6. The method according to claim 1 , wherein the obtaining the corresponding segmentation point template based on the character line prior information comprises: creating and initializing the segmentation point template based on the character line prior information; or obtaining the corresponding segmentation point template based on the character line prior information and a pre-set segmentation index table, wherein the segmentation index table comprises a correspondence between the character line prior information and the segmentation point template. 7. The method according to claim 6 , wherein after obtaining the corresponding segmentation point template based on the character line prior information, the method further comprises: obtaining a binary image of the to-be-recognized image; obtaining a projection waveform of the binary image in a horizontal direction; and updating an upper boundary and a lower boundary of the segmentation point template based on the projection waveform. 8. The method according to claim 6 , wherein the positioning the character line candidate region on the to-be-recognized image comprises: positioning a region of interest where a character line locates on the to-be-recognized image; performing affine transformation and bilinear interpolation on the region of interest to obtain a correction image; and positioning the character line candidate region on the correction image. 9. The method according to claim 6 , wherein the performing character recognition on each of the single character regions to obtain the corresponding recognition result comprises: obtaining character feature data in the single character region; and performing character recognition based on the character feature data to obtain the recognition result. 10. The method according to claim 1 , wherein after obtaining the corresponding segmentation point template based on the character line prior information, the method further comprises: obtaining a binary image of the to-be-recognized image; obtaining a projection waveform of the binary image in a horizontal direction; and updating an upper boundary and a lower boundary of the segmentation point template based on the projection waveform. 11. The method according to claim 10 , wherein the updating the upper boundary and the lower boundary of the segmentation point template based on the projection waveform comprises: determining and updating the uppermost position at which a grey value i

Assignees

Inventors

Classifications

  • G06V30/158Primary

    using character size, text spacings or pitch estimation · CPC title

  • using recognition of characters or words · CPC title

  • Physics · mapped topic

  • G06K9/344Primary

    Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10430681B2 cover?
Provided is a character segmentation and recognition method. The method includes: collecting image data to obtain a to-be-recognized image; positioning a character line candidate region on the to-be-recognized image; obtaining pre-set character line prior information, where the character line prior information includes the number of characters, character spacing and a character size; obtaining …
Who is the assignee on this patent?
Grg Banking Equipment Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V30/158. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 01 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).