What technology area does this patent fall under?

Primary CPC classification G06T5/92. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jun 04 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).

Remastering lower dynamic range content for higher dynamic range displays

US12002189B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12002189-B2
Application number	US-202217684779-A
Country	US
Kind code	B2
Filing date	Mar 2, 2022
Priority date	Mar 2, 2022
Publication date	Jun 4, 2024
Grant date	Jun 4, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The technology disclosed herein involves using a machine learning model (e.g., CNN) to expand lower dynamic-range image content (e.g., SDR images) into higher dynamic-range image content (e.g., HDR images). The machine learning model can take as input the lower dynamic-range image and can output multiple expansion maps that are used to make the expanded image appear more natural. The expansion maps may be used by image operators to smooth color banding and to dim overexposed regions or user interface elements in the expanded image. The expanded content (e.g., HDR image content) may then be provided to one or more devices for display or storage.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: a memory; a processing device, operatively coupled with the memory, the processing device to: receive an image corresponding to a first pixel value range; process an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expand the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; update the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; update the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and output the image data to one or more devices. 2. The system of claim 1 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 3. The system of claim 1 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 4. The system of claim 1 , wherein: the first output comprises a first expansion map that indicates, for each pixel in the image, a distance from the pixel to be considered by the first operator. 5. The system of claim 4 , wherein the first classification comprises a user interface classification, and wherein the second operator reduces at least one luminance value corresponding to the region. 6. The system of claim 4 , wherein the first classification comprises a high reflectance classification, and wherein the second operator increases at least one luminance value corresponding to the region. 7. The system of claim 1 , wherein the trained machine learning model comprises a Deep Neural Network (DNN) that uses the image as input and outputs the first output and the second output in two or more different channels, wherein the two or more different channels comprise at least a first channel that comprises a first expansion map and a second channel that comprises a second expansion map. 8. The system of claim 1 , wherein the input further comprises one or more previous outputs of the trained machine learning model. 9. The system of claim 1 , wherein to expand the image, the processing device is to dequantize an 8 bit input image to generate at least one of: 10 bit image data, 12 bit image data, 16 bit image data, or 32 bit image data. 10. The system of claim 1 , wherein the image comprises a frame of a video that comprises a sequence of frames, and the processing device is to output, to a display device, expanded image data for at least one frame of the sequence. 11. A method comprising: receiving, by a processing device, an image corresponding to a first pixel value range; processing, by the processing device, an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expanding the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; updating the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; updating the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and outputting, by the processing device, the image data. 12. The method of claim 11 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 13. The method of claim 11 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 14. The method of claim 11 , wherein: the first output comprises a first expansion map that indicates, for at least one pixel in the image, a distance from the pixel to be considered by the first operator. 15. The method of claim 14 , wherein the first classification comprises a user interface classification, and wherein the second operator reduces at least one luminance value corresponding to the region. 16. The method of claim 14 , wherein the first classification comprises a high reflectance classification, and wherein the second operator increases at least one luminance value corresponding to the region. 17. A non-transitory machine-readable storage medium storing instructions which, when executed, cause a processing device to perform operations comprising: receiving an image corresponding to a first pixel value range; processing an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expanding the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; updating the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; updating the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and outputting the image data to one or more devices. 18. The non-transitory machine-readable storage medium of claim 17 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 19. The non-transitory machine-readable storage medium of claim 17 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 20. The non-transitory machine-readable storage medium of claim 17 , wherein the trained machine learning model comprises a Convolutional Neural Network (CNN) that uses the image as input and outputs the first output and the second output in two or more different channels, wherein the two or more different channels comprise at least a first channel that comprises a first expansion map and a second channel that comprises a second expansion map.

Assignees

Nvidia Corp

Inventors

Classifications

G06T5/92Primary
based on global image properties · CPC title
G06N3/08
Learning methods · CPC title
G06T5/20Primary
using local operators · CPC title
G06T5/70
Denoising; Smoothing · CPC title
G06T2200/24
involving graphical user interfaces [GUIs] · CPC title

Patent family

Related publications grouped by family.

View patent family 87850789

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12002189B2 cover?: The technology disclosed herein involves using a machine learning model (e.g., CNN) to expand lower dynamic-range image content (e.g., SDR images) into higher dynamic-range image content (e.g., HDR images). The machine learning model can take as input the lower dynamic-range image and can output multiple expansion maps that are used to make the expanded image appear more natural. The expansion …
Who is the assignee on this patent?: Nvidia Corp
What technology area does this patent fall under?: Primary CPC classification G06T5/92. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jun 04 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).