Method of identifying pattern training need during verification of recognized text

US9613299B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9613299-B2
Application numberUS-201414567548-A
CountryUS
Kind codeB2
Filing dateDec 11, 2014
Priority dateJan 21, 2014
Publication dateApr 4, 2017
Grant dateApr 4, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for performing character recognition of a document image include analyzing verification performed by a user on a recognized text obtained by character recognition of a document image, identifying analogous changes of a first incorrect character for a first correct character, and prompting the user to initiate a training of a recognition pattern based on the identified analogous changes.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method for text analysis, comprising: analyzing verification performed by a user on a recognized text obtained by character recognition of a document image, wherein the verification comprises changing an incorrect character identified by the user for a correct character identified by the user; identifying analogous changes of a first incorrect character for a first correct character; and initiating a training of a recognition pattern based on the identified analogous changes, wherein the recognition pattern is a pattern used in character recognition of the document image to generate the recognized text. 2. The computer-implemented method of claim 1 , wherein: the identifying analogous changes comprises tracking a number of analogous changes of the first incorrect character for the first correct character during the verification, and determining that the number of analogous changes of the first incorrect character for the first correct character has reached a predetermined threshold, and the initiating of the training of the recognition pattern is based on the determining that the number of analogous changes has reached the predetermined threshold. 3. The computer-implemented method of claim 1 , wherein the initiating the training of the recognition pattern comprises presenting the user with an option to initiate the training. 4. The computer-implemented method of claim 1 , wherein the initiating the training of the recognition pattern is automatic. 5. The computer-implemented method of claim 1 , further comprising repeating character recognition of the document image based on the trained recognition pattern. 6. The computer-implemented method of claim 1 , further comprising repeating character recognition of an unverified portion of the document image based on the trained recognition pattern. 7. The method of claim 1 , wherein identifying analogous changes of a first incorrect character for a first correct character comprises automatically tracking analogous changes during verification of recognition results. 8. Computer storage media encoded with one or more computer programs, the one or more computer programs comprising instructions that when executed by a data processing apparatus cause the data processing apparatus to perform operations comprising: analyzing verification performed by a user on a recognized text obtained by character recognition of a document image, wherein the verification comprises changing an incorrect character identified by the user for a correct character identified by the user; identifying analogous changes of a first incorrect character for a first correct character; and initiating a training of a recognition pattern based on the identified analogous changes, wherein the recognition pattern is a pattern used in character recognition of the document image to generate the recognized text. 9. The computer storage media of claim 8 , wherein: the identifying analogous changes comprises tracking a number of analogous changes of the first incorrect character for the first correct character during the verification, and determining that the number of analogous changes of the first incorrect character for the first correct character has reached a predetermined threshold, and the initiating of the training of the recognition pattern is based the determining that the number of analogous changes has reached the predetermined threshold. 10. The computer storage media of claim 8 , wherein the initiating the training of the recognition pattern comprises presenting the user with an option to initiate the training. 11. The computer storage media of claim 8 , wherein the initiating the training of the recognition pattern is automatic. 12. The computer storage media of claim 8 , further comprising repeating character recognition of the document image based on the trained recognition pattern. 13. The computer storage media of claim 8 , further comprising repeating character recognition of an unverified portion of the document image based on the trained recognition pattern. 14. A system, comprising: a computing device; and a computer-readable medium coupled to the computing device and having instructions stored thereon which, when executed by the computing device, cause the computing device to perform operations comprising: analyzing verification performed by a user on a recognized text obtained by character recognition of a document image, wherein the verification comprises changing an incorrect character identified by the user for a correct character identified by the user; identifying analogous changes of a first incorrect character for a first correct character; and initiating a training of a recognition pattern based on the identified analogous changes, wherein the recognition pattern is a pattern used in character recognition of the document image to generate the recognized text. 15. The system of claim 14 wherein: the identifying analogous changes comprises tracking a number of analogous changes of the first incorrect character for the first correct character during the verification, and determining that the number of analogous changes of the first incorrect character for the first correct character has reached a predetermined threshold, and the initiating of the training of the recognition pattern is based the determining that the number of analogous changes has reached the predetermined threshold. 16. The system of claim 14 , wherein the initiating the training of the recognition pattern comprises presenting the user with an option to initiate the training. 17. The system of claim 14 , wherein the initiating the training of the recognition pattern is automatic. 18. The system of claim 14 , further comprising repeating character recognition of the document image based on the trained recognition pattern. 19. The system of claim 14 , further comprising repeating character recognition of an unverified portion of the document image based on the trained recognition pattern.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9613299B2 cover?
Methods and systems for performing character recognition of a document image include analyzing verification performed by a user on a recognized text obtained by character recognition of a document image, identifying analogous changes of a first incorrect character for a first correct character, and prompting the user to initiate a training of a recognition pattern based on the identified analog…
Who is the assignee on this patent?
Abbyy Dev Llc
What technology area does this patent fall under?
Primary CPC classification G06V30/19167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 04 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).