Systems and methods for generating object detection labels using foveated image magnification for autonomous driving

US12340482B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12340482-B2
Application numberUS-202217652348-A
CountryUS
Kind codeB2
Filing dateFeb 24, 2022
Priority dateMar 16, 2021
Publication dateJun 24, 2025
Grant dateJun 24, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for processing high resolution images are disclosed. The methods include generating a saliency map of a received high-resolution image using a saliency model. The saliency map includes a saliency value associated with each of a plurality of pixels of the high-resolution image. The method then includes using the saliency map for generating an inverse transformation function that is representative of an inverse mapping of one or more first pixel coordinates in a warped image to one or more second pixel coordinates in the high-resolution image, and implementing an image warp for converting the high-resolution image to the warped image using the inverse transformation function. The warped image is a foveated image that includes at least one region having a higher resolution than one or more other regions of the warped image.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of processing a high-resolution image, the method comprising, by a processor: receiving a high-resolution image; generating, using a saliency model, a saliency map of the high-resolution image, the saliency map comprising a saliency value associated with each of a plurality of pixels of the high-resolution image; generating, using the saliency map, an inverse transformation function that is representative of an inverse mapping of one or more first pixel coordinates in a warped image to one or more second pixel coordinates in the high-resolution image; implementing, using the inverse transformation function, an image warp for converting the high-resolution image to the warped image, the warped image being a foveated image that includes at least one region having a higher resolution than one or more other regions of the warped image; and saving the warped image to a data store. 2. The method of claim 1 , further comprising: generating, using an object detection model, one or more bounding box predictions in a frame of reference of the warped image; and transforming, using the inverse transformation function, first coordinates of the one or more bounding box predictions in the warped image to second coordinates of the one or more bounding box predictions in a frame of reference of the high-resolution image. 3. The method of claim 2 , further comprising using the second coordinates of the one or more bounding box predictions for controlling navigation of an autonomous vehicle. 4. The method of claim 1 , further comprising generating the saliency model based on one or more bounding box predictions in at least one prior frame of a video stream, the high-resolution image being a part of the video stream and is captured after the at least one prior frame. 5. The method of claim 1 , further comprising generating the saliency model based on one or more bounding box predictions in a dataset-wide prior comprising a training dataset. 6. The method of claim 1 , wherein the at least one region having the higher resolution in the warped image has a high likelihood of including an object of interest. 7. The method of claim 1 , further comprising reducing a resolution of the one or more other regions of the warped image. 8. The method of claim 1 , wherein implementing, using the inverse transformation function, the image warp for converting the high-resolution image to the warped image comprises for each of a plurality of pixels of the warped image: finding an input pixel in the high-resolution image; and bi-linearly interpolating that pixel's intensity or color from one or more pixels in the high-resolution image adjacent the input pixel. 9. The method of claim 1 , wherein the inverse transformation function is a differentiable function that is trained using backpropagation. 10. The method of claim 1 , further comprising introducing symmetries about each of a plurality of edges of the saliency map for cropping regularization of the warped image. 11. A system for processing a high-resolution image, the system comprising: a processor; and a non-transitory computer readable medium comprising programming instructions that when executed by the processor, will cause the processor to: receive a high-resolution image; generate, using a saliency model, a saliency map of the high-resolution image, the saliency map comprising a saliency value associated with each of a plurality of pixels of the high-resolution image; generate, using the saliency map, an inverse transformation function that is representative of an inverse mapping of one or more first pixel coordinates in a warped image to one or more second pixel coordinates in the high-resolution image; implement, using the inverse transformation function, an image warp for converting the high-resolution image to the warped image, the warped image being a foveated image that includes at least one region having a higher resolution than one or more other regions of the warped image; and save the warped image to a data store. 12. The system of claim 11 , further comprising programming instructions that when executed by the processor, will cause the processor to: generate, using an object detection model, one or more bounding box predictions in a frame of reference of the warped image; and transform, using the inverse transformation function, first coordinates of the one or more bounding box predictions in the warped image to second coordinates of the one or more bounding box predictions in a frame of reference of the high-resolution image. 13. The system of claim 12 , further comprising programming instructions that when executed by the processor, will cause the processor to use the second coordinates of the one or more bounding box predictions for controlling navigation of an autonomous vehicle. 14. The system of claim 11 , further comprising programming instructions that when executed by the processor, will cause the processor to generate the saliency model based on one or more bounding box predictions in at least one prior frame of a video stream, the high-resolution image being a part of the video stream and is captured after the at least one prior frame. 15. The system of claim 11 , further comprising programming instructions that when executed by the processor, will cause the processor to generate the saliency model based on one or more bounding box predictions in a dataset-wide prior comprising a training dataset. 16. The system of claim 11 , wherein the at least one region having the higher resolution in the warped image has a high likelihood of including an object of interest. 17. The system of claim 11 , further comprising programming instructions that when executed by the processor, will cause the processor to reduce a resolution of the one or more other regions of the warped image. 18. The system of claim 11 , wherein the programming instructions that when executed by the processor, will cause the processor to implement, using the inverse transformation function, the image warp for converting the high-resolution image to the warped image further comprise programming instructions to cause the processor to, for each of a plurality of pixels of the warped image: find an input pixel in the high-resolution image; and bi-linearly interpolate that pixel's intensity or color from one or more pixels in the high-resolution image adjacent the input pixel. 19. The system of claim 11 , wherein the inverse transformation function is a differentiable function that is trained using backpropagation. 20. A non-transitory computer program product for processing a high-resolution image, the computer program product comprising a memory that stores programming instructions that are configured to cause a processor to: receive a high-resolution image; generate, using a saliency model, a saliency map of the high-resolution image, the saliency map comprising a saliency value associated with each of a plurality of pixels of the high-resolution image; generate, using the saliency map, an inverse transformation function that is representative of an inverse mapping of one or more first pixel coordinates in a warped image to one or more second pixel coordinates in the high-resolution image; implement, using the inverse transformation function, an image warp for converting the high-resolution image to the warped image, the warped image being a foveated image that includes at least one region having a higher resolution than one or more other regions of the

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • based on interpolation, e.g. bilinear interpolation (image demosaicing G06T3/4015; edge-driven or edge-based scaling G06T3/403) · CPC title

  • G06T3/18Primary

    Image warping, e.g. rearranging pixels individually · CPC title

  • G06T3/053Primary

    Detail-in-context presentations (fisheye or wide-angle transformations G06T3/047) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12340482B2 cover?
Systems and methods for processing high resolution images are disclosed. The methods include generating a saliency map of a received high-resolution image using a saliency model. The saliency map includes a saliency value associated with each of a plurality of pixels of the high-resolution image. The method then includes using the saliency map for generating an inverse transformation function t…
Who is the assignee on this patent?
Ford Global Tech Llc
What technology area does this patent fall under?
Primary CPC classification G06T3/18. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).