Object segmentation from light field data

US10136116B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10136116-B2
Application numberUS-201615253037-A
CountryUS
Kind codeB2
Filing dateAug 31, 2016
Priority dateMar 7, 2016
Publication dateNov 20, 2018
Grant dateNov 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A scene is segmented into objects based on light field data for the scene, including based on image pixel values (e.g., intensity) and disparity map(s). In one aspect, the light field data is used to estimate one or more disparity maps for the scene taken from different viewpoints. The scene is then segmented into a plurality of regions that correspond to objects in the scene. Unlike other approaches, the regions can be variable-depth. In one approach, the regions are defined by boundaries. The boundaries are determined by varying the boundary to optimize an objective function for the region defined by the boundary. The objective function is based in part on a similarity function that measures a similarity of image pixel values for pixels within the boundary and also measures a similarity of disparities for pixels within the boundary.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for segmenting a scene into objects based on light field data for the scene, the method implemented on a processor executing instructions and the method comprising: accessing light field data for the scene, the light field data including a plurality of images of the scene taken from different viewpoints, the images comprising pixels having image pixel values; from the light field data, estimating one or more disparity maps for the scene taken from different viewpoints; and segmenting the scene into variable-depth regions that correspond to objects in the scene, the variable-depth regions defined by boundaries, at least one boundary of the boundaries defining at least one variable-depth region of the variable-depth regions determined by varying the at least one boundary to optimize an objective function for the at least one-variable-depth region; wherein: the objective function for the at least one variable-depth region is based in part on a similarity function that measures a similarity of image pixel values for pixels within the at least one boundary defining the at least one variable-depth region; the similarity function takes a form of S ⁡ ( x , L ) = [ D ⁡ ( x , p x ) - 1  L  ⁢ ∑ x ∈ L ⁢ D ⁡ ( x , p x ) ] 2  where x denotes a pixel in the image, x is a disparity of that pixel x, D(x, x ) is a descriptor function, L is the at least one variable-depth region within the at least one boundary, and |L| denotes a cardinality of region L; and the descriptor function D(x, x ) is based on: [μ( x, x )] 2  where μ(x, x ) is a mean image pixel value for all pixels along an epipolar line that passes through the pixel x with the associated disparity x . 2. The method of claim 1 wherein the similarity function also measures a similarity of disparities for pixels within the at least one variable-depth region defined by the at least one boundary. 3. The method of claim 1 wherein the objective function is further based in part on a second similarity function that measures a similarity of pixels outside the at least one variable-depth region defined by the at least one boundary. 4. The method of claim 3 wherein the similarity function for pixels within the at least one variable-depth region has a same functional form as the similarity function for pixels outside the at least one variable-depth region. 5. The method of claim 1 wherein the objective function is further based in part on a factor that measures a length of the at least one boundary defining the at least one variable-depth region. 6. The method of claim 1 wherein the descriptor function D(x, x ) is based on: [ I ( x )−μ( x, x )] 2 where I(x) is the image pixel value for pixel x. 7. The method of claim 1 wherein the similarity function is based on a descriptor function that is calculated for a plurality of different viewpoints. 8. The method of claim 1 wherein the similarity function is based on a descriptor function that is calculated for a plurality of different color channels. 9. The method of claim 1 wherein varying the at least one boundary to optimize an objective function for the at least one variable-depth region comprises: evaluating the similarity function for an initial boundary for the at least one boundary and evolving the at least one boundary based on an active contour framework. 10. The method of claim 1 wherein estimating one or more disparity maps comprises estimating the disparity maps based on epipolar structure in the light field data. 11. The method of claim 1 wherein estimating one or more disparity maps accounts for occlusions in the scene. 12. The method of claim 1 wherein estimating one or more disparity maps is based on a Ray-Gaussian transform. 13. The method of claim 1 further comprising: capturing the light field data for the scene before accessing the light field data. 14. The method of claim 1 wherein the light field data is light field data for an interior of an ear canal, and segmenting the scene into a plurality of regions comprises segmenting the scene into a tympanic membrane region and a background region. 15. A computer-implemented method for segmenting a scene into objects based on light field data for the scene, the method implemented on a processor executing instructions and the method comprising: accessing light field data for the scene, the light field data including a plurality of images of the scene taken from different viewpoints, the images comprising pixels having image pixel values; from the light field data, estimating one or more disparity maps for the scene taken from different viewpoints; and segmenting the scene into variable-depth regions that correspond to objects in the scene, the variable-depth regions defined by boundaries, at least one boundary of the boundaries defining at least one variable-depth region of the variable-depth regions determined by varying the at least one boundary to optimize an objective function for the at least one variable-depth region; wherein: the objective function for the at least one variable-depth region is based in part on a similarity function that measures a similarity of image pixel values for pixels within the at least one boundary defining the at least one variable-depth region; the similarity function takes a form of S ⁡ ( x ,

Assignees

Inventors

Classifications

  • Region-based segmentation · CPC title

  • Hierarchical, coarse-to-fine, multiscale or multiresolution image processing; Pyramid transform · CPC title

  • involving computational photography · CPC title

  • G06T7/557Primary

    from light fields, e.g. from plenoptic cameras · CPC title

  • involving graph-based methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10136116B2 cover?
A scene is segmented into objects based on light field data for the scene, including based on image pixel values (e.g., intensity) and disparity map(s). In one aspect, the light field data is used to estimate one or more disparity maps for the scene taken from different viewpoints. The scene is then segmented into a plurality of regions that correspond to objects in the scene. Unlike other appr…
Who is the assignee on this patent?
Tosic Ivana, Kuruvilla Anupama, Berkner Kathrin, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06T7/557. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).