Text extraction from graphical user interface content

US9520102B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9520102-B2
Application numberUS-201313872172-A
CountryUS
Kind codeB2
Filing dateApr 29, 2013
Priority dateApr 29, 2013
Publication dateDec 13, 2016
Grant dateDec 13, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for extracting text from images rendered on a display screen, the method comprising capturing a color image rendered on a display screen; and transforming the color image to binary color image, preserving text-like graphic components and filtering out non-text-like graphical components. The transforming comprises scanning one or more areas of the color image; and detecting continuous bi-tonal regions in the scanned one or more areas, wherein the continuous bi-tonal regions have large variances.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for extracting text from images rendered on a display screen, the method comprising: capturing a color image rendered on a display screen; and transforming the color image to binary color image, preserving textual graphic components and filtering out non-textual graphical components, said transforming comprises: scanning one or more areas of the color image; detecting continuous regions in the scanned one or more areas, the continuous regions having two distinguishable shades, wherein a first shade represents the foreground and a second shade represents the background, and wherein the continuous regions comprise multiple components; representing a set having a component from the continuous regions; and extracting the component as text component, wherein the component is recognized based on switches between runs where a single color part of the run comprises a lower number of pixels than all other single color parts of the run. 2. The method of claim 1 , wherein the binary image comprises black and white colors. 3. The method of claim 1 , wherein the binary image is a gray scaled image. 4. The method of claim 1 , wherein the detecting comprises collecting horizontal sets of pixels in the continuous regions that contain two shades. 5. The method of claim 4 , wherein the collected horizontal sets of pixels are combined into bi-level regions. 6. The method of claim 5 , wherein pixels associated with a foreground color are distinguished from pixels from a background color, in response to determining that a bi-level region contains text. 7. A system for extracting text from images rendered on a display screen, the system comprising: a logic unit for capturing a color image rendered on a display screen; and a logic unit for transforming the color image to binary color image, preserving textual graphic components and filtering out non-textual graphical components, wherein the transforming comprises: scanning one or more areas of the color image; detecting continuous regions in the scanned one or more areas, the continuous regions having two distinguishable shades, wherein a first shade represents the foreground and a second shade represents the background, and wherein the continuous regions comprise multiple components; representing a set having a component from the one or more continuous regions; and extracting the component as text component, wherein the component is recognized based on switches between runs where a single color part of the run comprises a lower number of pixels than all other single color parts of the run. 8. A computer program product comprising a non-transitory computer readable storage medium having a computer readable program, wherein the computer readable program when executed on a computer causes the computer to: capture a color image rendered on a display screen; and transform the color image to binary color image, preserving textual graphic components and filtering out non-textual graphical components, wherein the transforming comprises: scanning one or more areas of the color image; detecting continuous regions in the scanned one or more areas, the continuous regions having two distinguishable shades, wherein a first shade represents the foreground and a second shade represents the background wherein the continuous regions comprise multiple components; representing a set having a component from the one or more continuous regions; and extracting the component as text component, wherein the component is recognized based on switches between runs where a single color part of the run comprises a lower number of pixels than all other single color parts of the run.

Assignees

Inventors

Classifications

  • G06V30/413Primary

    Classification of content, e.g. text, photographs or tables · CPC title

  • G09G5/02Primary

    characterised by the way in which colour is displayed {(details of colour display specific for CRTs G09G1/28; specific for flat matrix panels other than liquid crystal displays G09G3/2003; specific for liquid crystal displays G09G3/3607)} · CPC title

  • by performing operations on regions, e.g. growing, shrinking or watersheds · CPC title

  • Discrimination between different image types, e.g. two-tone, continuous tone · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9520102B2 cover?
Systems and methods for extracting text from images rendered on a display screen, the method comprising capturing a color image rendered on a display screen; and transforming the color image to binary color image, preserving text-like graphic components and filtering out non-text-like graphical components. The transforming comprises scanning one or more areas of the color image; and detecting c…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06V30/413. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 13 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).