Information processing apparatus, information processing method, and storage medium

US9898845B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9898845-B2
Application numberUS-201514791070-A
CountryUS
Kind codeB2
Filing dateJul 2, 2015
Priority dateJul 7, 2014
Publication dateFeb 20, 2018
Grant dateFeb 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In a case where a position of a recognized cell is shifted from a position of a ruled line of an actual cell, if the recognized cell is deleted, a part of the ruled line of the actual cell is deleted. According to an aspect of the present invention, straight lines are detected from regions around four sides constituting the recognized cell, and an inside of a region surrounded by the detected straight lines is deleted.

First claim

Opening claim text (preview).

What is claimed is: 1. An information processing apparatus comprising: a processor; and a memory for storing a program, wherein the processor executes the program to perform: obtaining a document image; setting a recognized cell in a table region included in the document image; detecting straight lines from regions around four sides that constitute the recognized cell; and deleting, in the document image, color information of an inside of a region surrounded by the detected four straight lines. 2. The information processing apparatus according to claim 1 , wherein the regions around the four sides constituting the recognized cell are regions enlarged in orthogonal directions to respective sides while the respective sides are set as references. 3. The information processing apparatus according to claim 1 , wherein the straight line is an edge of a ruled line of an original cell corresponding to the recognized cell. 4. The information processing apparatus according to claim 3 , wherein the detecting includes an edge detection to detect edge pixels from the regions around the four sides, and a ruled line detection to detect the straight lines based on a number of duplications of lines passing through the detected respective edge pixels. 5. The information processing apparatus according to claim 4 , wherein, in a case where a plurality of lines are detected from the region around one side, an innermost line is detected as the straight line among the plurality of lines while a center position of the recognized cell is set as a reference. 6. The information processing apparatus according to claim 1 , wherein the recognized cell is set based on a circumscribed rectangle of a white pixel block detected from a table region included in the document image. 7. The information processing apparatus according to claim 1 , wherein the recognized cell is set at a position operated by a user in the table region included in the document image. 8. The information processing apparatus according to claim 1 , wherein the deleting substitutes the color information of the inside of the region surrounded by the detected four straight lines with default color, wherein the default color is white or a background color of the region. 9. The information processing apparatus according to claim 1 , wherein the obtained document image is a scanned image generated by scanning a document. 10. An information processing method executed by an information processing apparatus, wherein the information processing method comprising: obtaining a document image; setting a recognized cell inside a table region included in the document image; detecting straight lines from regions around four sides that constitute the recognized cell; and deleting color information of an inside of a region surrounded by the detected four straight lines. 11. The information processing method according to claim 10 , wherein the regions around the four sides constituting the recognized cell are regions enlarged in orthogonal directions to the respective sides while the respective sides are set as references. 12. The information processing method according to claim 10 , the straight line is an edge of a ruled line of an original cell corresponding to the recognized cell. 13. The information processing method according to claim 10 , wherein the detecting includes an edge detection to detect edge pixels from the regions around the four sides, and a ruled line detection to detect the straight lines on the basis of a number of duplications of straight lines passing through the detected respective pixels. 14. The information processing method according to claim 13 , wherein, in a case where a plurality of lines are detected from the region around one side, an innermost line is detected as the straight line among the plurality of lines while a center position of the recognized cell is set as a reference in the ruled line detection. 15. The information processing method according to claim 10 , wherein the recognized cell is set based on a circumscribed rectangle of a white pixel block detected from the table region included in the document image. 16. The information processing method according to claim 10 , wherein the recognized cell is set at a position operated by a user in the table region included in the document image. 17. A non-transitory computer-readable medium storing instructions that, when executed by a processor, cause the processor to perform operations comprising: obtaining a document image; setting a recognized cell inside a table region included in the document image; detecting straight lines from regions around four sides that constitute the recognized cell; and deleting color information of an inside of a region surrounded by the detected four straight lines. 18. The non-transitory computer-readable medium according to claim 17 , wherein the regions around the four sides constituting the recognized cell are regions enlarged in orthogonal directions to respective sides while the respective sides are set as references. 19. The non-transitory computer-readable medium according to claim 17 , wherein the straight line is an edge of a ruled line of an original cell corresponding to the recognized cell. 20. The non-transitory computer-readable medium according to claim 19 , wherein the detecting includes an edge detection to detect edge pixels from the regions around the four sides, and a ruled line detection unit to detect the straight lines based on a number of duplications of lines passing through the detected respective edge pixels. 21. The non-transitory computer-readable medium according to claim 17 , wherein the recognized cell is set based on a circumscribed rectangle of a white pixel block detected from the table region included in the document image. 22. The non-transitory computer-readable medium according to claim 17 , wherein the recognized cell is set at a position operated by a user in the table region included in the document image.

Assignees

Inventors

Classifications

  • Inclination or skew detection or correction of characters or of image to be recognised · CPC title

  • Drawing of charts or graphs · CPC title

  • using straight lines or curves · CPC title

  • Character recognition · CPC title

  • Cropping · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9898845B2 cover?
In a case where a position of a recognized cell is shifted from a position of a ruled line of an actual cell, if the recognized cell is deleted, a part of the ruled line of the actual cell is deleted. According to an aspect of the present invention, straight lines are detected from regions around four sides constituting the recognized cell, and an inside of a region surrounded by the detected s…
Who is the assignee on this patent?
Canon Kk
What technology area does this patent fall under?
Primary CPC classification G06T11/60. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).