Methods and apparatuses of contrastive learning for color constancy

US12079306B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12079306-B2
Application numberUS-202117529539-A
CountryUS
Kind codeB2
Filing dateNov 18, 2021
Priority dateNov 20, 2020
Publication dateSep 3, 2024
Grant dateSep 3, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A contrastive learning method for color constancy employs a fully-supervised construction of contrastive pairs, driven by a novel data augmentation. The contrastive learning method includes receiving two training images, constructing positive and negative contrastive pairs by the novel data augmentation, extracting representations by a feature extraction function, and training a color constancy model by contrastive learning representations in the positive contrastive pair are closer than representations in the negative contrastive pair. The positive contrastive pair contains images having an identical illuminant while negative contrastive pair contains images having different illuminants. The contrastive learning method improves the performance without additional computational costs. The desired contrastive pairs allow the color constancy model to learn better illuminant feature that are particular robust to worse-cases in data sparse regions.

First claim

Opening claim text (preview).

The invention claimed is: 1. A contrastive learning method for color constancy in an image or video processing system, comprising: receiving input data associated with a first training image captured in a first scene under a first illuminant, and a second training image captured in a second scene under a second illuminant; constructing at least a positive contrastive pair and at least a negative contrastive pair by applying a data augmentation to the first and second training images, wherein each positive contrastive pair contains two images having an identical illuminant and each negative contrastive pair contains two images having different illuminants; extracting representations of the images in the positive and negative contrastive pairs by a feature extraction function; and training a color constancy model by contrastive learning, wherein the color constancy model is trained by learning representations in each positive contrastive pair are closer than the representations in each negative contrastive pair. 2. The method of claim 1 , wherein the step of training a color constancy model by contrastive learning further comprises: mapping each representation to a projection in a latent projection space by a feature projection function; measuring a similarity between projections of the positive contrastive pair and a similarity between projections of the negative contrastive pair; and maximizing the similarity between the projections of the positive pair and minimizing the similarity between the projections of the negative pair by a contrastive loss function. 3. The method of claim 1 , wherein the data augmentation augments the first training image to a different view to derive a first augmented image, wherein the first augmented image is label-preserving as the first training image and the first augmented image share a same ground-truth illuminant. 4. The method of claim 1 , wherein the step of constructing positive and negative contrastive pairs further comprises: deriving a novel illuminant by interpolation or extrapolation between the first illuminant and the second illuminant; synthesizing a first augmented image having the first scene and the first illuminant, a second augmented image having the second scene and the first illuminant, a third augmented image having the first scene and the novel illuminant, and a fourth augmented image having the second scene and the novel illuminant by the data augmentation; and constructing an easy positive contrastive pair by including the first training image and the first augmented image, constructing an easy negative contrastive pair by including the first training image and the fourth augmented image, constructing a hard positive contrastive pair by including the first training image and the second augmented image, and constructing a hard negative contrastive pair by including the first training image and the third augmented image. 5. The method of claim 4 , wherein the data augmentation extracts canonical colors from the first and second training images to form color checkers, fits a color mapping matrix and an inverse color mapping matrix to map between the two color checkers, derives two additional color mapping matrices from the color mapping matrix and inverse color mapping matrix for the novel illuminant, applies the color mapping matrix to the second training image to synthesize the second augmented image, and applies the two additional color mapping matrices to the first and second training images to synthesize the third and fourth augmented images respectively. 6. The method of claim 5 , wherein the color mapping matrix and inverse color mapping matrix are full color transformation matrices and the two additional color mapping matrices are full color transformation matrices. 7. The method of claim 5 , wherein the color mapping matrix and inverse color mapping matrix are reduced from full color transformation matrices to diagonal matrices, and the two additional color mapping matrices are derived from an identity matrix, the color mapping matrix, and inverse color mapping matrix, wherein the third and fourth augmented images are synthesized by simplified neutral color mapping using the two additional color mapping matrices. 8. The method of claim 4 , further comprising: mapping each representation to a projection in a latent projection space by a feature projection function; computing a first loss for the representations of the easy positive contrastive pair and easy negative contrastive pair, a second loss for the representations of the easy positive contrastive pair and hard negative contrastive pair, a third loss for the representations of the hard positive contrastive pair and easy negative contrastive pair, and a fourth loss for the representations of the hard positive contrastive pair and hard negative contrastive pair; and computing a contrastive loss by a sum of the first, second, third and fourth losses. 9. The method of claim 1 , wherein the step of constructing positive and negative contrastive pairs further comprises: synthesizing a first augmented image having the second scene and the first illuminant and a second augmented image having the first scene and the second illuminant by the data augmentation; constructing the positive contrastive pair by including the first training image and the first augmented image and constructing the negative contrastive pair by including the first training image and the second augmented image. 10. The method of claim 9 , wherein the data augmentation extracts canonical colors from the first and second training images to form color checkers, fits a color mapping matrix and an inverse color mapping matrix to map between the two color checkers, and applies the color mapping matrix and the inverse color mapping matrix to the first and second training images to synthesize the first and second augmented images. 11. The method of claim 1 , wherein the color constancy model is trained by scene-invariant and illuminant-dependent representations, so that representations of a same scene under different illuminants are far from each other and representations of different scenes under a same illuminant are close to each other. 12. An apparatus conducting contrastive learning for color constancy in an image or video processing system, the apparatus comprising one or more electronic circuits configured for: receiving input data associated with a first training image captured in a first scene under a first illuminant, and a second training image captured in a second scene under a second illuminant; constructing at least a positive contrastive pair and at least a negative contrastive pair by applying a data augmentation to the first and second training images, wherein each positive contrastive pair contains two images having an identical illuminant and each negative contrastive pair contains two images having different illuminants; extracting representations of the images in the positive and negative contrastive pairs by a feature extraction function; and training a color constancy model by contrastive learning, wherein the color constancy model is trained by learning representations in each positive contrastive pair are closer than the representations in each negative contrastive pair.

Assignees

Inventors

Classifications

  • G06V10/56Primary

    relating to colour · CPC title

  • Colour balance circuits, e.g. white balance circuits or colour temperature control (camera processing pipelines for colour balance H04N23/88) · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Extraction of image or video features · CPC title

  • Determination of colour characteristics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12079306B2 cover?
A contrastive learning method for color constancy employs a fully-supervised construction of contrastive pairs, driven by a novel data augmentation. The contrastive learning method includes receiving two training images, constructing positive and negative contrastive pairs by the novel data augmentation, extracting representations by a feature extraction function, and training a color constancy…
Who is the assignee on this patent?
Mediatek Inc
What technology area does this patent fall under?
Primary CPC classification G06V10/56. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 03 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).