Electronic device for estimating camera illuminant and method of the same
US-2022156899-A1 · May 19, 2022 · US
US12079306B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12079306-B2 |
| Application number | US-202117529539-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 18, 2021 |
| Priority date | Nov 20, 2020 |
| Publication date | Sep 3, 2024 |
| Grant date | Sep 3, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A contrastive learning method for color constancy employs a fully-supervised construction of contrastive pairs, driven by a novel data augmentation. The contrastive learning method includes receiving two training images, constructing positive and negative contrastive pairs by the novel data augmentation, extracting representations by a feature extraction function, and training a color constancy model by contrastive learning representations in the positive contrastive pair are closer than representations in the negative contrastive pair. The positive contrastive pair contains images having an identical illuminant while negative contrastive pair contains images having different illuminants. The contrastive learning method improves the performance without additional computational costs. The desired contrastive pairs allow the color constancy model to learn better illuminant feature that are particular robust to worse-cases in data sparse regions.
Opening claim text (preview).
The invention claimed is: 1. A contrastive learning method for color constancy in an image or video processing system, comprising: receiving input data associated with a first training image captured in a first scene under a first illuminant, and a second training image captured in a second scene under a second illuminant; constructing at least a positive contrastive pair and at least a negative contrastive pair by applying a data augmentation to the first and second training images, wherein each positive contrastive pair contains two images having an identical illuminant and each negative contrastive pair contains two images having different illuminants; extracting representations of the images in the positive and negative contrastive pairs by a feature extraction function; and training a color constancy model by contrastive learning, wherein the color constancy model is trained by learning representations in each positive contrastive pair are closer than the representations in each negative contrastive pair. 2. The method of claim 1 , wherein the step of training a color constancy model by contrastive learning further comprises: mapping each representation to a projection in a latent projection space by a feature projection function; measuring a similarity between projections of the positive contrastive pair and a similarity between projections of the negative contrastive pair; and maximizing the similarity between the projections of the positive pair and minimizing the similarity between the projections of the negative pair by a contrastive loss function. 3. The method of claim 1 , wherein the data augmentation augments the first training image to a different view to derive a first augmented image, wherein the first augmented image is label-preserving as the first training image and the first augmented image share a same ground-truth illuminant. 4. The method of claim 1 , wherein the step of constructing positive and negative contrastive pairs further comprises: deriving a novel illuminant by interpolation or extrapolation between the first illuminant and the second illuminant; synthesizing a first augmented image having the first scene and the first illuminant, a second augmented image having the second scene and the first illuminant, a third augmented image having the first scene and the novel illuminant, and a fourth augmented image having the second scene and the novel illuminant by the data augmentation; and constructing an easy positive contrastive pair by including the first training image and the first augmented image, constructing an easy negative contrastive pair by including the first training image and the fourth augmented image, constructing a hard positive contrastive pair by including the first training image and the second augmented image, and constructing a hard negative contrastive pair by including the first training image and the third augmented image. 5. The method of claim 4 , wherein the data augmentation extracts canonical colors from the first and second training images to form color checkers, fits a color mapping matrix and an inverse color mapping matrix to map between the two color checkers, derives two additional color mapping matrices from the color mapping matrix and inverse color mapping matrix for the novel illuminant, applies the color mapping matrix to the second training image to synthesize the second augmented image, and applies the two additional color mapping matrices to the first and second training images to synthesize the third and fourth augmented images respectively. 6. The method of claim 5 , wherein the color mapping matrix and inverse color mapping matrix are full color transformation matrices and the two additional color mapping matrices are full color transformation matrices. 7. The method of claim 5 , wherein the color mapping matrix and inverse color mapping matrix are reduced from full color transformation matrices to diagonal matrices, and the two additional color mapping matrices are derived from an identity matrix, the color mapping matrix, and inverse color mapping matrix, wherein the third and fourth augmented images are synthesized by simplified neutral color mapping using the two additional color mapping matrices. 8. The method of claim 4 , further comprising: mapping each representation to a projection in a latent projection space by a feature projection function; computing a first loss for the representations of the easy positive contrastive pair and easy negative contrastive pair, a second loss for the representations of the easy positive contrastive pair and hard negative contrastive pair, a third loss for the representations of the hard positive contrastive pair and easy negative contrastive pair, and a fourth loss for the representations of the hard positive contrastive pair and hard negative contrastive pair; and computing a contrastive loss by a sum of the first, second, third and fourth losses. 9. The method of claim 1 , wherein the step of constructing positive and negative contrastive pairs further comprises: synthesizing a first augmented image having the second scene and the first illuminant and a second augmented image having the first scene and the second illuminant by the data augmentation; constructing the positive contrastive pair by including the first training image and the first augmented image and constructing the negative contrastive pair by including the first training image and the second augmented image. 10. The method of claim 9 , wherein the data augmentation extracts canonical colors from the first and second training images to form color checkers, fits a color mapping matrix and an inverse color mapping matrix to map between the two color checkers, and applies the color mapping matrix and the inverse color mapping matrix to the first and second training images to synthesize the first and second augmented images. 11. The method of claim 1 , wherein the color constancy model is trained by scene-invariant and illuminant-dependent representations, so that representations of a same scene under different illuminants are far from each other and representations of different scenes under a same illuminant are close to each other. 12. An apparatus conducting contrastive learning for color constancy in an image or video processing system, the apparatus comprising one or more electronic circuits configured for: receiving input data associated with a first training image captured in a first scene under a first illuminant, and a second training image captured in a second scene under a second illuminant; constructing at least a positive contrastive pair and at least a negative contrastive pair by applying a data augmentation to the first and second training images, wherein each positive contrastive pair contains two images having an identical illuminant and each negative contrastive pair contains two images having different illuminants; extracting representations of the images in the positive and negative contrastive pairs by a feature extraction function; and training a color constancy model by contrastive learning, wherein the color constancy model is trained by learning representations in each positive contrastive pair are closer than the representations in each negative contrastive pair.
relating to colour · CPC title
Colour balance circuits, e.g. white balance circuits or colour temperature control (camera processing pipelines for colour balance H04N23/88) · CPC title
Matching criteria, e.g. proximity measures · CPC title
Extraction of image or video features · CPC title
Determination of colour characteristics · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.