Display-side adaptive video processing
US-2015245043-A1 · Aug 27, 2015 · US
US12002189B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12002189-B2 |
| Application number | US-202217684779-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 2, 2022 |
| Priority date | Mar 2, 2022 |
| Publication date | Jun 4, 2024 |
| Grant date | Jun 4, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The technology disclosed herein involves using a machine learning model (e.g., CNN) to expand lower dynamic-range image content (e.g., SDR images) into higher dynamic-range image content (e.g., HDR images). The machine learning model can take as input the lower dynamic-range image and can output multiple expansion maps that are used to make the expanded image appear more natural. The expansion maps may be used by image operators to smooth color banding and to dim overexposed regions or user interface elements in the expanded image. The expanded content (e.g., HDR image content) may then be provided to one or more devices for display or storage.
Opening claim text (preview).
What is claimed is: 1. A system comprising: a memory; a processing device, operatively coupled with the memory, the processing device to: receive an image corresponding to a first pixel value range; process an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expand the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; update the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; update the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and output the image data to one or more devices. 2. The system of claim 1 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 3. The system of claim 1 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 4. The system of claim 1 , wherein: the first output comprises a first expansion map that indicates, for each pixel in the image, a distance from the pixel to be considered by the first operator. 5. The system of claim 4 , wherein the first classification comprises a user interface classification, and wherein the second operator reduces at least one luminance value corresponding to the region. 6. The system of claim 4 , wherein the first classification comprises a high reflectance classification, and wherein the second operator increases at least one luminance value corresponding to the region. 7. The system of claim 1 , wherein the trained machine learning model comprises a Deep Neural Network (DNN) that uses the image as input and outputs the first output and the second output in two or more different channels, wherein the two or more different channels comprise at least a first channel that comprises a first expansion map and a second channel that comprises a second expansion map. 8. The system of claim 1 , wherein the input further comprises one or more previous outputs of the trained machine learning model. 9. The system of claim 1 , wherein to expand the image, the processing device is to dequantize an 8 bit input image to generate at least one of: 10 bit image data, 12 bit image data, 16 bit image data, or 32 bit image data. 10. The system of claim 1 , wherein the image comprises a frame of a video that comprises a sequence of frames, and the processing device is to output, to a display device, expanded image data for at least one frame of the sequence. 11. A method comprising: receiving, by a processing device, an image corresponding to a first pixel value range; processing, by the processing device, an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expanding the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; updating the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; updating the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and outputting, by the processing device, the image data. 12. The method of claim 11 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 13. The method of claim 11 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 14. The method of claim 11 , wherein: the first output comprises a first expansion map that indicates, for at least one pixel in the image, a distance from the pixel to be considered by the first operator. 15. The method of claim 14 , wherein the first classification comprises a user interface classification, and wherein the second operator reduces at least one luminance value corresponding to the region. 16. The method of claim 14 , wherein the first classification comprises a high reflectance classification, and wherein the second operator increases at least one luminance value corresponding to the region. 17. A non-transitory machine-readable storage medium storing instructions which, when executed, cause a processing device to perform operations comprising: receiving an image corresponding to a first pixel value range; processing an input corresponding to the image using a trained machine learning model to generate a first output and a second output, wherein the second output comprises an expansion map that indicates pixel classifications for pixels in the image, wherein one or more pixels having a first classification are included in a region of the image for which luminance is to be modified; expanding the image to generate image data corresponding to a second pixel value range, wherein the second pixel value range is larger than the first pixel value range; updating the image data using a first operator, wherein the first operator applies a filter to reduce one or more banding artifacts in the image data based on the first output; updating the image data using a second operator, wherein the second operator modifies the luminance of the region of the expanded image based on the second output; and outputting the image data to one or more devices. 18. The non-transitory machine-readable storage medium of claim 17 , wherein the image comprises a Standard Dynamic Range (SDR) image and the image data that is output comprises a High Dynamic Range (HDR) image. 19. The non-transitory machine-readable storage medium of claim 17 , wherein the first operator is a local operator comprising a clamped bilateral filter of variable width and the second operator is a global operator comprising a luminance multiplier. 20. The non-transitory machine-readable storage medium of claim 17 , wherein the trained machine learning model comprises a Convolutional Neural Network (CNN) that uses the image as input and outputs the first output and the second output in two or more different channels, wherein the two or more different channels comprise at least a first channel that comprises a first expansion map and a second channel that comprises a second expansion map.
Related publications grouped by family.
Answers are generated from the same data shown on this page.