System for image completion

US12062160B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12062160-B2
Application numberUS-202117478837-A
CountryUS
Kind codeB2
Filing dateSep 17, 2021
Priority dateSep 22, 2020
Publication dateAug 13, 2024
Grant dateAug 13, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system for image completion is disclosed. The system comprises a coordinate generation module configured to receive past frames and a present frame having a first field-of-view and to generate a set of coordinate maps, one for each of the received past frames; and a frame aggregation module configured to receive as input the past frames, the present frame, and the coordinate maps and to synthesize, based on said input, a present frame having a second field-of-view.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for image completion, comprising: a coordinate generation module configured to receive first past frames and a first present frame, the first past frames and the first present frame having a first field-of-view, and to generate a set of coordinate maps, one for each of the received first past frames, wherein the coordinate map corresponding to a first past frame provides a spatial mapping of the first past frame to a coordinate system of the first present frame, wherein the coordinate generation module comprises: a depth network configured to receive the first past frames and to generate a depth map for each of the received first past frames; a pose network configured to receive frame pairs of time-adjacent frames formed from the received first past frames and the first present frame and to generate a relative camera pose for each of the received frame pairs independent from the depth map for each of the received first past frames; and a coordinate calculation module configured to generate the set of coordinate maps based on outputs of the depth network and the pose network; and a frame aggregation module configured to receive as input the first past frames, the first present frame, and the coordinate maps and to synthesize, based on said input, a second present frame having a second field-of-view, wherein a portion of the first past frames received by a depth network and the frame aggregation module are replaced by second past frames having the second field-of-view generated by the frame aggregation module and corresponding to said portion of the first past frames. 2. The system according to claim 1 , wherein the first field-of-view comprises fewer pixels than the second field-of-view. 3. The system according to claim 1 , wherein the frame aggregation module is configured to propagate information contained in the received first past frames to the coordinate system of the first present frame using the set of coordinate maps generated by the coordinate generation module. 4. The system according to claim 1 , wherein the frame aggregation module comprises: an encoder configured to generate a plurality of feature maps based on each of the first past frames and the first present frame; a warping module configured, for each of the first past frames and the first present frame, to warp the plurality of feature maps associated with said each frame, using the respective coordinate map associated with said each frame, to generate a plurality of warped feature maps for said each frame; and an attention-based feature aggregation module configured to aggregate, over all of the first past frames and the first present frame, the generated warped feature maps to generate a set of aggregated feature maps. 5. The system according to claim 4 , wherein the attention-based feature aggregation module is configured in the aggregation to emphasize, for each frame of the first past frames and the first present frame, region-specific features of the frame based on a timing of the frame relative to the first present frame. 6. The system according to claim 4 , wherein the attention-based feature aggregation module is configured to, for each frame of the first past frames and the first present frame: generate a respective frame-wise spatial attention map for each of the warped feature maps associated with said each frame; and multiply each of the warped feature maps associated with said each frame with the respective spatial attention map to generate a respective feature map. 7. The system according to claim 6 , wherein the attention-based feature aggregation module is further configured to sum, over all of the first past frames and the first present frame, the generated respective feature maps to generate the set of aggregated feature maps. 8. The system according to claim 4 , wherein the frame aggregation module further comprises a U-net module configured to generate the second present frame based on the set of aggregated feature maps, the U-net module comprising: a context normalization sub-module configured to receive the set of aggregated feature maps and to out-paint regions of the second present frame falling outside of the field-of-view of the first present frame; a decoder sub-module configured to receive feature maps output by the context normalization sub-module and to process said feature maps output by the context normalization sub-module to in-paint occluded or unobserved regions falling within the field-of-view of the first present frame; and a gated self-attention sub-module configured to receive feature maps output by the decoder sub-module and to spatially aggregate said feature maps output by the decoder sub-module. 9. The system according to claim 8 , wherein the gated self-attention sub-module is configured to spatially aggregate the feature maps output by the decoder sub-module based on weights that are dynamically generated per feature vector based on a spatial location of the feature vector. 10. The system according to claim 1 , comprising: a hallucination uncertainty module configured to generate an uncertainty map associated with the second present frame. 11. The system according to claim 10 , wherein the hallucination uncertainty module is configured to generate the uncertainty map to minimize a loss function incorporating hallucination uncertainty. 12. The system according to claim 10 , wherein the hallucination uncertainty module is configured to generate the uncertainty map based on predicting regions of the second present frame that will have a first hallucinating uncertainty and those that will have a second hallucinating uncertainty, and the first hallucinating uncertainty is higher than the second hallucinating uncertainty. 13. The system according to claim 1 , wherein the second past frames are each concatenated with a respective uncertainty map generated by a hallucination uncertainty module, before providing the second past frames to the frame aggregation module. 14. The system according to claim 1 , wherein the second field-of-view is equal or greater than the first field-of-view. 15. A system for image completion, comprising: a coordinate generation module configured to receive first past frames and a first present frame, the first past frames and the first present frame having a first field-of-view, and to generate a set of coordinate maps, one for each of the received first past frames, wherein the coordinate map corresponding to a first past frame provides a spatial mapping of the first past frame to a coordinate system of the first present frame; and a frame aggregation module configured to receive as input the first past frames, the first present frame, and the coordinate maps and to synthesize, based on said input, a second present frame having a second field-of-view, wherein the frame aggregation module comprises: an encoder configured to generate a plurality of feature maps based on each of the first past frames and the first present frame; a warping module configured, for each of the first past frames and the first present frame, to warp the plurality of feature maps associated with said each frame, using the respective coordinate map associated with said each frame, to generate a plurality of warped feature maps for said each frame; and an attention-based feature aggregation module configured to aggregate, over all of the first past frames and the first present frame, the generated warped feature maps to generate a set of aggregated feature maps. 16. A system for image completion, comprising: a coordinate generation module c

Assignees

Inventors

Classifications

  • Image warping, e.g. rearranging pixels individually · CPC title

  • Context-preserving transformations, e.g. by using an importance map (panospheric to cylindrical image transformations G06T3/12) · CPC title

  • Camera pose · CPC title

  • Image combination · CPC title

  • Video; Image sequence · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12062160B2 cover?
A system for image completion is disclosed. The system comprises a coordinate generation module configured to receive past frames and a present frame having a first field-of-view and to generate a set of coordinate maps, one for each of the received past frames; and a frame aggregation module configured to receive as input the past frames, the present frame, and the coordinate maps and to synth…
Who is the assignee on this patent?
Toyota Motor Co Ltd, Univ Leuven Kath, Eth Zuerich
What technology area does this patent fall under?
Primary CPC classification G06T5/77. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 13 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).