Identifying desynchronization between representations of data obtained from an ordered plurality of documents

US9959693B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9959693-B2
Application numberUS-201514814655-A
CountryUS
Kind codeB2
Filing dateJul 31, 2015
Priority dateJul 31, 2015
Publication dateMay 1, 2018
Grant dateMay 1, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In a method for identifying desynchronization between representations of data obtained from an ordered plurality of documents, a processor can receive ordered first and second pluralities of data strings obtained from the respective plurality of documents; compare each data string in the first plurality to the corresponding data string in the second plurality and to each data string sequentially before or sequentially after the corresponding data string in the second plurality; based on the comparison, designate each data string in the first plurality as being one of synchronized, leading, or trailing; identify a continuous sequence of N data strings in the first plurality that all have a designation of leading or all have a designation of trailing, where N equals or exceeds a specified sequence threshold; and generate a single error signal that identifies all N of the data strings in the continuous sequence as being desynchronized.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving ordered first and second pluralities of data strings obtained from an ordered plurality of documents, the ordered first plurality of data strings obtained from an optical scanner, the ordered second plurality of data strings obtained from a magnetic scanner; comparing each data string in the first plurality to a corresponding data string in the second plurality and to each data string sequentially before or sequentially after the corresponding data string in the second plurality; based on the comparing, designating each data string in the first plurality as being one of unknown, synchronized, leading, or trailing; when a continuous sequence of N data strings in the first plurality have a same designation as leading or trailing, generating an error signal associated with the first and second pluralities identifying all N of the data strings in the continuous sequence as being desynchronized; and making only documents associated with data strings in the first plurality that are synchronized available for viewing. 2. The method of claim 1 , further comprising: receiving an ordered plurality of images of the respective ordered plurality of documents; and performing optical character recognition on the ordered plurality of images to generate the ordered first plurality of data strings. 3. The method of claim 2 , further comprising: magnetically reading a magnetic ink recognition line from each of the plurality of documents to generate the second plurality of data strings; and imaging at least the magnetic ink recognition line from each of the plurality of documents to form the plurality of images. 4. The method of claim 1 , wherein designating each data string in the first plurality as being one of synchronized, leading, or trailing comprises: designating each data string in the first plurality as being one of: synchronized, when the data string in the first plurality matches the corresponding data string in the second plurality; leading, when the data string in the first plurality matches a data string sequentially before the corresponding data string in the second plurality; or trailing, when the data string in the first plurality matches a data string sequentially after the corresponding data string in the second plurality. 5. The method of claim 1 , wherein comparing each data string in the first plurality to the corresponding data string in the second plurality and to each data string sequentially before or sequentially after the corresponding data string in the second plurality comprises: for each data string in the first plurality: calculating respective Levenshtein distances between said data string in the first plurality and the corresponding data string in the second plurality, and between said first data string in the first plurality and each data string sequentially before or sequentially after the corresponding data string in the second plurality; selecting the lowest of the calculated Levenshtein distances; selecting the data string of the second plurality corresponding to the selected Levenshtein distance; and designating the selected data string of the second plurality as matching said data string of the first plurality. 6. The method of claim 5 , further comprising: designating the selected data string of the second plurality as being unknown if the lowest of the calculated Levenshtein distances equals or exceeds a specified Levenshtein distance threshold. 7. The method of claim 6 , further comprising: identifying a continuous sequence of N data strings in the first plurality that all have a designation of leading or unknown, or all have a designation of trailing or unknown, where N equals or exceeds the specified sequence threshold. 8. The method of claim 1 , wherein generating an error signal associated with the first and second pluralities identifying all N of the data strings in the continuous sequence as being desynchronized comprises: generating a single error signal associated with the first and second pluralities identifying all N of the data strings in the continuous sequence as being desynchronized. 9. The method of claim 1 , wherein N equals or exceeds a specified sequence threshold. 10. A method, comprising: imaging each of an ordered plurality of documents to form a respective ordered plurality of images; performing optical character recognition on the ordered plurality of images to generate an ordered first plurality of data strings; magnetically reading a magnetic ink recognition line from each of the plurality of documents to generate an ordered second plurality of data strings; comparing each data string in the first plurality to a corresponding data string in the second plurality and to each data string sequentially before or sequentially after the corresponding data string in the second plurality; based on the comparing, designating each data string in the first plurality as being one of unknown, synchronized, leading, or trailing; when a continuous sequence of N data strings in the first plurality have a same designation as leading or trailing, generating an error signal associated with the first and second pluralities identifying all N of the data strings in the continuous sequence as being desynchronized, where N equals or exceeds a specified sequence threshold; and making only images associated with data strings in the first plurality that synchronized available for viewing. 11. The method of claim 10 , wherein designating each data string in the first plurality as being one of synchronized, leading, or trailing comprises: designating each data string in the first plurality as being one of: synchronized, if the data string in the first plurality matches the corresponding data string in the second plurality; leading, if the data string in the first plurality matches a data string sequentially before the corresponding data string in the second plurality; or trailing, if the data string in the first plurality matches a data string sequentially after the corresponding data string in the second plurality. 12. The method of claim 10 , wherein comparing each data string in the first plurality to the corresponding data string in the second plurality and to each data string sequentially before or sequentially after the corresponding data string in the second plurality comprises: for each data string in the first plurality: calculating respective Levenshtein distances between said data string in the first plurality and the corresponding data string in the second plurality, and between said first data string in the first plurality and each data string sequentially before or sequentially after the corresponding data string in the second plurality; selecting the lowest of the calculated Levenshtein distances; selecting the data string of the second plurality corresponding to the selected Levenshtein distance; and designating the selected data string of the second plurality as matching said data string of the first plurality. 13. The method of claim 12 , further comprising: designating the selected data string of the second plurality as being unknown if the lowest of the calculated Levenshtein distances equals or exceeds a specified Levenshtein distance threshold. 14. The method of claim 13 , further comprising: identifying a continuous sequence of N data strings in the first plurality that all have a designation of leading or unknown, or all have a designation of trailing or unknown, where N equals or exceeds the specified sequence threshold. 15. The method of claim 10 , wherein gen

Assignees

Inventors

Classifications

  • G06V30/12Primary

    Detection or correction of errors, e.g. by rescanning the pattern · CPC title

  • using checkcodes, e.g. coded numbers derived from serial number and denomination · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Recognition of characters printed with magnetic ink (G06V30/2247 takes precedence) · CPC title

  • Character recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9959693B2 cover?
In a method for identifying desynchronization between representations of data obtained from an ordered plurality of documents, a processor can receive ordered first and second pluralities of data strings obtained from the respective plurality of documents; compare each data string in the first plurality to the corresponding data string in the second plurality and to each data string sequentiall…
Who is the assignee on this patent?
Ncr Corp
What technology area does this patent fall under?
Primary CPC classification G06V30/12. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 01 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).