Image processing apparatus, control method, and non-transitory storage medium that obtain text data for an image

US10984233B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10984233-B2
Application numberUS-201916282200-A
CountryUS
Kind codeB2
Filing dateFeb 21, 2019
Priority dateFeb 28, 2018
Publication dateApr 20, 2021
Grant dateApr 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus acquires an image by reading a document, extracts a plurality of regions having a predetermined attribute from the acquired image, determines information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit with use of positional information about the extracted plurality of regions, selects a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document, performs character recognition processing on the selected processing target region, and displays text data obtained by the character recognition processing.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: a memory that stores a program; and a processor that executes the program to: extract a plurality of regions having a predetermined attribute from an acquired image; determine, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; select a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; perform first character recognition processing on the selected processing target region; display first text data obtained by the first character recognition processing; perform second character recognition processing on a region specified by a user; and join second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 2. The apparatus according to claim 1 , wherein the text data which is displayed is recommended as a file name of the acquired image. 3. The apparatus according to claim 1 , wherein, in a case where positions of a plurality of processing target regions are previously specified with respect to the determined information about the most similar registered document, a plurality of processing target regions in the acquired image is selected, and wherein the first character recognition processing is performed on the selected plurality of processing target regions. 4. The apparatus according to claim 3 , wherein a plurality of pieces of text data obtained by the first character recognition processing performed on the selected plurality of processing target regions is displayed while being joined with a predetermined delimiter character. 5. The apparatus according to claim 4 , wherein positions of a plurality of processing target regions are previously specified with respect to the determined information about the most similar registered document, and a sequential order is previously set with respect to the plurality of processing target regions, and wherein a plurality of pieces of text data obtained by the first character recognition processing performed on the selected plurality of processing target regions is arranged according to the previously set sequential order and is displayed while being joined with the predetermined delimiter character. 6. The apparatus according to claim 1 , wherein the specified region is specified by the user from among the extracted plurality of regions, and wherein the second character recognition processing is performed on the specified region if character recognition processing has not been performed on the specified region yet. 7. The apparatus according to claim 1 , wherein the second text data obtained by the second character recognition processing performed on the specified region is joined and displayed behind the first text data obtained by the first character recognition processing performed on the selected processing target region. 8. The apparatus according to claim 1 , wherein a predetermined delimiter character and the second text data obtained by the second character recognition processing performed on the specified region are joined and displayed behind the first text data obtained by the first character recognition processing performed on the selected processing target region. 9. The apparatus according to claim 1 , wherein the processor further executes the program to: edit the displayed first and/or second text data based on an instruction from the user. 10. The apparatus according to claim 1 , wherein the extracting of the plurality of regions having the predetermined attribute from the acquired image is performed with respect to a previously determined partial image in the acquired image. 11. The apparatus according to claim 10 , wherein the partial image is determined based on a region on which the first and/or second character recognition processing were performed in a previously processed acquired image. 12. The apparatus according to claim 10 , wherein, if information about a registered document similar to the acquired image from among the information about the plurality of registered documents stored in the storage unit is not determined based on the positional information about the plurality of regions extracted from the partial image, the processor extracts a plurality of regions having the predetermined attribute from the entire acquired image and determines information about a registered document most similar to the acquired image from among the information about the plurality of registered documents stored in the storage unit based on positional information about the plurality of regions extracted from the entire acquired image. 13. A method for an apparatus, the method comprising: acquiring an image by reading a document; extracting a plurality of regions having a predetermined attribute from the acquired image; determining, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; selecting a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; performing the character recognition processing on the selected processing target region; displaying first text data obtained by the first character recognition processing; performing second character recognition processing on a region specified by a user; and adding second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 14. A non-transitory computer-readable storage medium storing a program that, when executed by a computer, causes the computer to: extract a plurality of regions having a predetermined attribute from an acquired image; determine, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; select a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; perform first character recognition processing on the selected processing target region; display first text data obtained by the first character recognition processing; perform second character recognition processing on a region specified by a user; and add second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 15. The non-transitory computer-readable storage medium according to claim 14 , wherein the text data which is displayed is recommended as a file name of the acquired image. 16. The non-transitory computer-readable storage medium acc

Assignees

Inventors

Classifications

  • with a server, e.g. an internet server (fax-servers or the like for store and forward H04N1/324) · CPC title

  • Classification of content, e.g. text, photographs or tables · CPC title

  • Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables · CPC title

  • Determination of region of interest · CPC title

  • Character recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10984233B2 cover?
An apparatus acquires an image by reading a document, extracts a plurality of regions having a predetermined attribute from the acquired image, determines information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit with use of positional information about the extracted plurality of regions, …
Who is the assignee on this patent?
Canon Kk
What technology area does this patent fall under?
Primary CPC classification H04N1/00244. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Apr 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).