Segmentation mask extrapolation

US12307685B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12307685-B2
Application numberUS-202217931859-A
CountryUS
Kind codeB2
Filing dateSep 13, 2022
Priority dateSep 13, 2022
Publication dateMay 20, 2025
Grant dateMay 20, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and techniques are described herein for processing video data. For instance, a technique can include receiving a first image after a previous image. The process can further include receiving a first segmentation mask associated with the previous image. The process can also include estimating a first set of forward motion vectors between the previous image and the first image. The process can further include estimating a reliability of the first set of forward motion vectors. The process can also include extrapolating a second segmentation mask associated with the first image using the first set of forward motion vectors and first segmentation mask based on the estimated reliability of the first set of forward motion vectors.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing video data, the method comprising: receiving a first image after a previous image; receiving a first segmentation mask associated with the previous image; estimating a first set of forward motion vectors between the previous image and the first image; estimating a reliability of the first set of forward motion vectors; and extrapolating a second segmentation mask associated with the first image using the first set of forward motion vectors and the first segmentation mask based on the estimated reliability of the first set of forward motion vectors. 2. The method of claim 1 , wherein estimating the reliability of the first set of forward motion vectors comprises: determining one or more heuristic statistics based on the first set of forward motion vectors; and evaluating the one or more heuristic statistics to estimate the reliability of the first set of forward motion vectors. 3. The method of claim 2 , wherein the one or more heuristic statistics comprise at least one of: gradients for one or more motion vectors of the first set of forward motion vectors, or a percent change of a foreground. 4. The method of claim 3 , wherein evaluating the one or more heuristic statistics comprise at least one of: evaluating the gradients for one or more motion vectors against a gradient threshold value, or estimating the percent change of a foreground based on a threshold value of foreground motion vectors. 5. The method of claim 1 , further comprising: receiving a second image, wherein the second image is a non-key frame, wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames; estimating a second set of forward motion vectors between the second image and another previous image; estimating the reliability of the second set of forward motion vectors; and determining to apply a segmentation mask machine learning model to the second image based on an estimate that the second set of forward motion vectors is not reliable. 6. The method of claim 1 , further comprising labeling one or more portions of the first segmentation mask as part of a foreground or background. 7. The method of claim 6 , further comprising: detecting an uncovered portion of the first image based on the first set of forward motion vectors; and filling the uncovered portion based on a neighboring background portion of the first segmentation mask and the previous image. 8. The method of claim 7 , wherein filling the uncovered portion comprises: identifying a motion vector associated with a foreground portion of the first segmentation mask closest to a portion of the uncovered portion; copying the motion vector; and associating the copied motion vector with the portion of the uncovered portion. 9. The method of claim 1 , wherein the first image is a non-key frame, and wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames. 10. The method of claim 1 , further comprising: receiving a third image, wherein the third image is a key frame, wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames; and applying a segmentation mask machine learning model to the third image. 11. An apparatus for processing video data, the apparatus comprising: at least one memory; and at least one processor coupled to the at least one memory, the at least one processor being configured to: receive a first image after a previous image; receive a first segmentation mask associated with the previous image; estimate a first set of forward motion vectors between the previous image and the first image; estimate a reliability of the first set of forward motion vectors; and extrapolate a second segmentation mask associated with the first image using the first set of forward motion vectors and the first segmentation mask based on the estimated reliability of the first set of forward motion vectors. 12. The apparatus of claim 11 , wherein, to estimate the reliability of the first set of forward motion vectors, the at least one processor is further configured to: determine one or more heuristic statistics based on the first set of forward motion vectors; and evaluate the one or more heuristic statistics to estimate the reliability of the first set of forward motion vectors. 13. The apparatus of claim 12 , wherein the one or more heuristic statistics comprise at least one of: gradients for one or more motion vectors of the first set of forward motion vectors, or a percent change of a foreground. 14. The apparatus of claim 13 , wherein, to evaluate the one or more heuristic statistics, the at least one processor is further configured to: evaluate the gradients for one or more motion vectors against a gradient threshold value, or estimate the percent change of a foreground based on a threshold value of foreground motion vectors. 15. The apparatus of claim 11 , wherein the at least one processor is further configured to: receive a second image, wherein the second image is a non-key frame, wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames; estimate a second set of forward motion vectors between the second image and another previous image; estimate the reliability of the second set of forward motion vectors; and determine to apply a segmentation mask machine learning model to the second image based on an estimate that the second set of forward motion vectors is not reliable. 16. The apparatus of claim 11 , wherein the at least one processor is further configured to label one or more portions of the first segmentation mask as part of a foreground or background. 17. The apparatus of claim 16 , wherein the at least one processor is further configured to: detect an uncovered portion of the first image based on the first set of forward motion vectors; and fill the uncovered portion based on a neighboring background portion of the first segmentation mask and the previous image. 18. The apparatus of claim 17 , wherein, to fill the uncovered portion, the at least one processor is further configured to: identify a motion vector associated with a foreground portion of the first segmentation mask closest to a portion of the uncovered portion; copy the motion vector; and associate the copied motion vector with the portion of the uncovered portion. 19. The apparatus of claim 17 , wherein the first image is a non-key frame, and wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames. 20. The apparatus of claim 11 , wherein the at least one processor is further configured to: receive a third image, wherein the third image is a key frame, wherein a segmentation mask machine learning model is configured to generate segmentation masks for key frames; and apply a segmentation mask machine learning model to the third image. 21. A non-transitory computer-readable medium having stored thereon instructions that, when executed by at least one processor, cause the at least one processor to: receive a first image after a previous image; receive a first segmentation mask associated with the previous image; estimate a first set of forward motion vectors between the previous image and the first image; estimate a reliability of the first set of forward motion vectors; and extrapolate a second segmentation mask

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12307685B2 cover?
Systems and techniques are described herein for processing video data. For instance, a technique can include receiving a first image after a previous image. The process can further include receiving a first segmentation mask associated with the previous image. The process can also include estimating a first set of forward motion vectors between the previous image and the first image. The proces…
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/215. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).