Image processing apparatus and image processing program
US-2019197303-A1 · Jun 27, 2019 · US
US10984233B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10984233-B2 |
| Application number | US-201916282200-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 21, 2019 |
| Priority date | Feb 28, 2018 |
| Publication date | Apr 20, 2021 |
| Grant date | Apr 20, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus acquires an image by reading a document, extracts a plurality of regions having a predetermined attribute from the acquired image, determines information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit with use of positional information about the extracted plurality of regions, selects a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document, performs character recognition processing on the selected processing target region, and displays text data obtained by the character recognition processing.
Opening claim text (preview).
What is claimed is: 1. An apparatus comprising: a memory that stores a program; and a processor that executes the program to: extract a plurality of regions having a predetermined attribute from an acquired image; determine, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; select a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; perform first character recognition processing on the selected processing target region; display first text data obtained by the first character recognition processing; perform second character recognition processing on a region specified by a user; and join second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 2. The apparatus according to claim 1 , wherein the text data which is displayed is recommended as a file name of the acquired image. 3. The apparatus according to claim 1 , wherein, in a case where positions of a plurality of processing target regions are previously specified with respect to the determined information about the most similar registered document, a plurality of processing target regions in the acquired image is selected, and wherein the first character recognition processing is performed on the selected plurality of processing target regions. 4. The apparatus according to claim 3 , wherein a plurality of pieces of text data obtained by the first character recognition processing performed on the selected plurality of processing target regions is displayed while being joined with a predetermined delimiter character. 5. The apparatus according to claim 4 , wherein positions of a plurality of processing target regions are previously specified with respect to the determined information about the most similar registered document, and a sequential order is previously set with respect to the plurality of processing target regions, and wherein a plurality of pieces of text data obtained by the first character recognition processing performed on the selected plurality of processing target regions is arranged according to the previously set sequential order and is displayed while being joined with the predetermined delimiter character. 6. The apparatus according to claim 1 , wherein the specified region is specified by the user from among the extracted plurality of regions, and wherein the second character recognition processing is performed on the specified region if character recognition processing has not been performed on the specified region yet. 7. The apparatus according to claim 1 , wherein the second text data obtained by the second character recognition processing performed on the specified region is joined and displayed behind the first text data obtained by the first character recognition processing performed on the selected processing target region. 8. The apparatus according to claim 1 , wherein a predetermined delimiter character and the second text data obtained by the second character recognition processing performed on the specified region are joined and displayed behind the first text data obtained by the first character recognition processing performed on the selected processing target region. 9. The apparatus according to claim 1 , wherein the processor further executes the program to: edit the displayed first and/or second text data based on an instruction from the user. 10. The apparatus according to claim 1 , wherein the extracting of the plurality of regions having the predetermined attribute from the acquired image is performed with respect to a previously determined partial image in the acquired image. 11. The apparatus according to claim 10 , wherein the partial image is determined based on a region on which the first and/or second character recognition processing were performed in a previously processed acquired image. 12. The apparatus according to claim 10 , wherein, if information about a registered document similar to the acquired image from among the information about the plurality of registered documents stored in the storage unit is not determined based on the positional information about the plurality of regions extracted from the partial image, the processor extracts a plurality of regions having the predetermined attribute from the entire acquired image and determines information about a registered document most similar to the acquired image from among the information about the plurality of registered documents stored in the storage unit based on positional information about the plurality of regions extracted from the entire acquired image. 13. A method for an apparatus, the method comprising: acquiring an image by reading a document; extracting a plurality of regions having a predetermined attribute from the acquired image; determining, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; selecting a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; performing the character recognition processing on the selected processing target region; displaying first text data obtained by the first character recognition processing; performing second character recognition processing on a region specified by a user; and adding second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 14. A non-transitory computer-readable storage medium storing a program that, when executed by a computer, causes the computer to: extract a plurality of regions having a predetermined attribute from an acquired image; determine, based on positional information about the extracted plurality of regions, information about a registered document most similar to the acquired image from among information about a plurality of registered documents stored in a storage unit; select a processing target region in the acquired image based on a position of a processing target region previously specified with respect to the determined information about the most similar registered document; perform first character recognition processing on the selected processing target region; display first text data obtained by the first character recognition processing; perform second character recognition processing on a region specified by a user; and add second text data obtained by the second character recognition processing performed on the specified region to the first text data obtained by the first character recognition processing performed on the selected processing target region. 15. The non-transitory computer-readable storage medium according to claim 14 , wherein the text data which is displayed is recommended as a file name of the acquired image. 16. The non-transitory computer-readable storage medium acc
with a server, e.g. an internet server (fax-servers or the like for store and forward H04N1/324) · CPC title
Classification of content, e.g. text, photographs or tables · CPC title
Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables · CPC title
Determination of region of interest · CPC title
Character recognition · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.