Text entity recognition

US9256795B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9256795-B1
Application numberUS-201313842433-A
CountryUS
Kind codeB1
Filing dateMar 15, 2013
Priority dateMar 15, 2013
Publication dateFeb 9, 2016
Grant dateFeb 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: displaying an image, captured by a camera of a computing device, on a display element of the computing device; analyzing the image to locate a region of text in the image; recognizing text within the region with an optical character recognition (OCR) engine, the OCR engine providing an output of recognized text including characters grouped by one or more text lines; analyzing the one or more text lines to isolate a character string indicative of at least one of a phone number, an email address, or a uniform resource locator (URL), each character of the isolated character string being assigned to a character class to produce a character class string; based at least in part on a pattern of the character class string, determining a matching score for the isolated character string with respect to at least one of a phone number, an email address, or a URL, the isolated character string being identified as the at least one of a phone number, an email address, or a URL if the matching score is greater than a threshold score, wherein determining the matching store comprises assigning costs to edits made to the character class string, wherein a cost associated with mistaking characters that are similar in appearance is small and the cost associated with mistaking characters that are relatively different in appearance is greater than a threshold value; determining an overlay template and respective functionality for the at least one of a phone number, an email address, or a URL; and displaying the isolated character string on the display element using the overlay template as an overlay element in a live field of view being captured by the camera, the overlay element including at least one user-selectable element enabling the respective functionality associated with the at least one of a phone number, an email address, or a URL to be performed. 2. The computer-implemented method of claim 1 , wherein various costs to various edits are provided in an N by M matrix where N represents a number possible ASCII characters and M represents a number of possible character classes. 3. The computer-implemented method of claim 1 , wherein analyzing the one or more text lines to isolate groups of characters includes omitting groups of characters that do not fit a pattern indicative of at least one of the phone number, the email address, or the URL. 4. The computer-implemented method of claim 1 , further comprising: validating the isolated character string by comparing the isolated character string to a determined number of most popular URLs in response to the pattern of the character string being indicative of a URL. 5. A computer-implemented method, comprising: receiving an output from an optical character recognition (OCR) engine; analyzing the output to isolate a character string indicative of a text entity; assigning each character of the isolated character string to a character class to produce a character class string; and based at least in part on a pattern identified for the character class string, identifying the isolated character string as being the text entity, wherein the isolated character string is identified as the text entity in response to determining a matching score above a threshold for the isolated character string, the matching score being based at least in part on a number of edits made to the character class string, and wherein determining the matching store comprises assigning costs to edits made to the character class string, wherein a cost associated with mistaking characters that are similar in appearance is small and the cost associated with mistaking characters that are relatively different in appearance is greater than a threshold value. 6. The computer-implemented method of claim 5 , wherein analyzing the output to isolate the character string indicative of the text entity includes performing one or more heuristic tests. 7. The computer-implemented method of claim 6 , further comprising: converting the output into text lines; and omitting characters that do not fit a pattern indicative of the text entity. 8. The computer-implemented method of claim 5 , wherein various costs to various edits are provided in an N by M matrix where N represents a number possible ASCII characters and M represents a number of possible character classes. 9. The computer-implemented method of claim 5 , further comprising: autocorrecting a character in the character string to a character of a character class associated with the text entity in response to identifying the character belonging to a character class not associated with the text entity type. 10. The computer-implemented method of claim 5 , further comprising: validating the isolated character string by comparing the isolated character string to a determined number of most popular URLs in response to the pattern of the character string being indicative of a URL. 11. The computer-implemented method of claim 5 , further comprising: validating the isolated character string by comparing the isolated character string to valid area codes in response to the pattern of the character string being indicative of a phone number. 12. A computing device, comprising: a processor; a display screen; and memory including instructions that, when executed by the processor, cause the computing device to: receive an output from an optical character recognition (OCR) engine; analyze the output to isolate a character string indicative of a text entity; assign each character of the isolated character string to a character class to produce a character class string; and based at least in part on a pattern of the character class string, identify the isolated character string as being the text entity, wherein the isolated character string is identified as the text entity in response to determining a matching score above a threshold for the isolated character string, the matching score being based at least in part on a number of edits made to the character class string, and wherein determining the matching store comprises assigning costs to edits made to the character class string, wherein a cost associated with mistaking characters that are similar in appearance is small and the cost associated with mistaking characters that are relatively different in appearance is greater than a threshold value. 13. The computing device of claim 12 , wherein analyzing the output to isolate the character string indicative of the text entity includes performing one or more heuristic tests. 14. The computing device of claim 13 , wherein the instructions, when executed by the processor, further enable the computing device to: convert the output into text lines; and omit characters that do not fit a pattern indicative of the text entity. 15. A non-transitory computer-readable storage medium storing instructions that, when executed by at least one processor, cause a computing device to: receive an output from an optical character recognition (OCR) engine; analyze the output to isolate a character string indicative of a text entity; assign each character of the isolated character string to a character class to produce a character class string; and based at least in part on a pattern of the character class string, identify the isolated character string as being the text entity, wherein the isolated character string is identified as the text entity in response to determining a matching score above a threshold for the isolated character string, the matching score being based at least in part on a number of edits made to the cha

Assignees

Inventors

Classifications

  • based on positionally close symbols, e.g. amount sign or URL-specific characters · CPC title

  • Text, e.g. of license plates, overlay texts or captions on TV images · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Digital output to display device {; Cooperation and interconnection of the display device with other functional units} · CPC title

  • Editing, e.g. inserting or deleting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9256795B1 cover?
Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is an…
Who is the assignee on this patent?
A9 Com Inc
What technology area does this patent fall under?
Primary CPC classification G06V30/1452. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).