Enhancing images in text documents

US12079912B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12079912-B2
Application numberUS-202218054162-A
CountryUS
Kind codeB2
Filing dateNov 10, 2022
Priority dateNov 10, 2022
Publication dateSep 3, 2024
Grant dateSep 3, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Images placed in documents are enhanced based on the context in which the image is used. Context is determined according to document-specific indicators such as nearby text, headings, titles, and tables of content. A generative adversarial network (GAN) modifies the image according to the context to selectively emphasize relevant components of the image, which may include erasing or deleting irrelevant components. Relevant general-purpose images may be retrieved for use in the document and may be selectively enhanced according to usage of the general-purpose image in a given document.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: identifying, in a document, an image placed at a location within the document and a set of text proximate the location of the image; determining, by natural language processing, a meaning of the set of text; determining a usage context for the image based on the meaning of the set of text, the usage context being a purpose for placing the image at the location within the document; identifying a set of features in the image; selecting, from the set of features, a sub-set of features as being related to the usage context; and determining an enhancement action that emphasizes the selected sub-set of features over remaining features in the set of features. 2. The computer-implemented method of claim 1 , further comprising: monitoring the document during an editing process; and detecting placement of the image in the document, wherein: the determining the usage context is performed responsive to the detecting placement. 3. The computer-implemented method of claim 1 , further comprising: analyzing the set of features with respect to the meaning of the set of text to determine a relevance level of each feature to the meaning of the set of text; and ranking a first feature as being more relevant than a second feature, the first and second features identified within the image; wherein: selecting the sub-set of features includes selecting the first feature, the first feature being related to the usage context by being more relevant than the second feature. 4. The computer-implemented method of claim 3 , further comprising: responsive to the first feature being of high relevance, modifying the image to emphasize the first feature, the modifying including resizing the first feature in the document and repositioning associated text in the image. 5. The computer-implemented method of claim 4 , wherein the modifying further includes increasing resolution of the associated text in the image. 6. The computer-implemented method of claim 1 , further comprising: modifying the sub-set of features according to the enhancement action by: inputting the image and the set of features into a generative adversarial network (GAN) to take the enhancement action. 7. The computer-implemented method of claim 1 , further comprising: identifying the image for placement into the document based on assigned topic from an image repository of general-purpose images; and presenting the identified image to a user for use in the document. 8. A computer program product comprising a computer-readable storage medium having a set of instructions stored therein which, when executed by a processor, causes the processor to perform a method comprising: identifying, in a document, an image embedded at a location within the document and a set of text proximate the location of the image; determining, by natural language processing, a meaning of the set of text; determining a usage context for the image based on the meaning of the set of text, the usage context being a purpose for placing the image at the location within the document; identifying a set of features in the image; selecting, from the set of features, a sub-set of features as being related to the usage context; and determining an enhancement action that emphasizes the selected sub-set of features over remaining features in the set of features. 9. The computer program product of claim 8 , further comprising: monitoring the document during an editing process; and detecting placement of the image in the document, wherein: the determining the usage context is performed responsive to the detecting placement. 10. The computer program product of claim 8 , further comprising: analyzing the set of features with respect to the meaning of the set of text to determine a relevance level of each feature to the meaning of the set of text; and ranking a first feature as being more relevant than a second feature, the first and second features identified within the image; wherein: selecting the sub-set of features includes selecting the first feature, the first feature being related to the usage context by being more relevant than the second feature. 11. The computer program product of claim 10 , further comprising: responsive to the first feature being of high relevance, modifying the image to emphasize the first feature, the modifying including resizing the first feature in the document and repositioning associated text in the image. 12. The computer program product of claim 11 , wherein the modifying further includes increasing resolution of the associated text in the image. 13. The computer program product of claim 8 , further comprising: modifying the sub-set of features according to the enhancement action by: inputting the image and the set of features into a generative adversarial network (GAN) to take the enhancement action. 14. The computer program product of claim 8 , further comprising: identifying the image for placement into the document based on assigned topic from an image repository of general-purpose images; and presenting the identified image to a user for use in the document. 15. A computer system comprising: a processor set; and a computer readable storage medium; wherein: the processor set is structured, located, connected, and/or programmed to run program instructions stored on the computer readable storage medium; and the program instructions which, when executed by the processor set, cause the processor set to perform a method comprising: identifying, in a document, an image embedded at a location within the document and a set of text proximate the location of the image; determining, by natural language processing, a meaning of the set of text; determining a usage context for the image based on the meaning of the set of text, the usage context being a purpose for placing the image at the location within the document; identifying a set of features in the image; selecting, from the set of features, a sub-set of features as being related to the usage context; and determining an enhancement action that emphasizes the selected sub-set of features over remaining features in the set of features. 16. The computer system of claim 15 , further comprising: monitoring the document during an editing process; and detecting placement of the image in the document, wherein: the determining the usage context is performed responsive to the detecting placement. 17. The computer system of claim 15 , further comprising: analyzing the set of features with respect to the meaning of the set of text to determine a relevance level of each feature to the meaning of the set of text; and ranking a first feature as being more relevant than a second feature, the first and second features identified within the image; wherein: selecting the sub-set of features includes selecting the first feature, the first feature being related to the usage context by being more relevant than the second feature. 18. The computer system of claim 17 , further comprising: responsive to the first feature being of high relevance, modifying the image to emphasize the first feature, the modifying including resizing the first feature in the document, repositioning associated text in the image, and increasing resolution of the associated text in the image. 19. The computer system of claim 15 , further comprising: modifying the sub-set of features according to the enhancement action by: inputting the image and t

Assignees

Inventors

Classifications

  • Image enhancement or restoration · CPC title

  • Selection of the most significant subset of features · CPC title

  • Extraction of features or characteristics of the image · CPC title

  • Classification of content, e.g. text, photographs or tables · CPC title

  • G06F40/20Primary

    Natural language analysis (semantic analysis of natural language G06F40/30) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12079912B2 cover?
Images placed in documents are enhanced based on the context in which the image is used. Context is determined according to document-specific indicators such as nearby text, headings, titles, and tables of content. A generative adversarial network (GAN) modifies the image according to the context to selectively emphasize relevant components of the image, which may include erasing or deleting ir…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 03 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).