Eye contact correction in real time using machine learning

US10664949B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10664949-B2
Application numberUS-201615136621-A
CountryUS
Kind codeB2
Filing dateApr 22, 2016
Priority dateApr 22, 2016
Publication dateMay 26, 2020
Grant dateMay 26, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques related to eye contact correction to provide a virtual user gaze aligned with a camera while the user views a display are discussed. Such techniques may include determining and reducing histogram of oriented gradient features for an eye region of a source image to provide a feature set, applying a pretrained classifier to the feature set to determine a motion vector field for the eye region, and warping and inserting the eye region into the source image to generate an eye contact corrected image.

First claim

Opening claim text (preview).

What is claimed is: 1. A machine implemented method for providing eye contact correction comprising: determining histogram of oriented gradient features for an eye region of a source image; reducing the histogram of oriented gradient features to provide a feature set corresponding to the eye region of the source image by applying principal component analysis to the histogram of oriented gradient features using pretrained scaling and weighting factors trained based on a plurality of pairs of training eye region images comprising first images having a gaze angle difference with respect to second images of the pairs of training eye region images, generation of a likelihood map for each pixel of each of the first images of the pairs of training eye region images, wherein each likelihood map comprises a sum of absolute differences for each of a plurality of candidate motion vectors corresponding to each pixel, determination of training stage histogram of oriented gradient features for the first images of the pairs of training eye region images, and application of principal component analysis to reduce the training stage histogram of oriented gradient features to a plurality of training stage feature sets corresponding to the first images of the pairs of training eye region images and to determine the pretrained scaling and weighting factors of the principal component analysis; applying a pretrained random forest classifier to the entire feature set to determine a leaf of the pretrained random forest classifier that correlates the feature set to a motion vector field for the eye region of the source image; and warping the eye region of the source image based on the motion vector field and integrating the warped eye region into a remaining portion of the source image to generate an eye contact corrected image. 2. The method of claim 1 , further comprising: providing face detection and face landmark detection on the source image; and cropping the source image based on the face detection and the face landmark detection to generate the eye region. 3. The method of claim 1 , further comprising: encoding and transmitting the eye contact corrected image to a remote device for presentment to a user. 4. The method of claim 1 , wherein said reducing the histogram of oriented gradient features and said applying the pretrained random forest classifier to the feature set to determine the leaf of the pretrained random forest classifier that correlates the feature set to the motion vector field for the eye region of the source image are selectively provided based on a camera and a display having a first relative position therebetween, the method further comprising, when the camera and the display have a second relative position therebetween: reducing the histogram of oriented gradient features to provide a second feature set corresponding to the eye region of the source image; applying a second pretrained classifier to the second feature set to determine a second motion vector field for the eye region of the source image; and warping the eye region of the source image based on the second motion vector field and integrating the warped eye region into the remaining portion of the source image to generate the eye contact corrected image. 5. The method of claim 1 , wherein the pretrained random forest classifier is generated based on a training stage pretrained classifier trained based on the likelihood maps and the training stage feature sets. 6. The method of claim 5 , wherein the pretrained random forest classifier is further generated based on compression of the training stage pretrained classifier by parameterized surface fitting to generate the pretrained random forest classifier. 7. A system for providing eye contact correction comprising: a memory configured to store a source image; and a processor coupled to the memory, the processor to: determine histogram of oriented gradient features for an eye region of a source image; reduce the histogram of oriented gradient features to provide a feature set corresponding to the eye region of the source image by the processor to apply principal component analysis to the histogram of oriented gradient features using pretrained scaling and weighting factors trained based on a plurality of pairs of training eye region images comprising first images of the pairs having a gaze angle difference with respect to second images of the pairs of training eye region images, generation of a likelihood map for each pixel of each of the first images of the pairs of training eye region images, wherein each likelihood map comprises a sum of absolute differences for each of a plurality of candidate motion vectors corresponding to each pixel, determination of training stage histogram of oriented gradient features for the first images of the pairs of training eye region images, and application of principal component analysis to reduce the training stage histogram of oriented gradient features to a plurality of training stage feature sets corresponding to the first images of the pairs of training eye region images and to determine the pretrained scaling and weighting factors of the principal component analysis; apply a pretrained random forest classifier to the entire feature set to determine a leaf of the pretrained random forest classifier that correlates the feature set to a motion vector field for the eye region of the source image; and warp the eye region of the source image based on the motion vector field and integrate the warped eye region into a remaining portion of the source image to generate an eye contact corrected image. 8. The system of claim 7 , wherein to reduce the histogram of oriented gradient features and to apply the pretrained random forest classifier to the feature set to determine the leaf of the pretrained random forest classifier that correlates the feature set to the motion vector field for the eye region of the source image are selectively provided based on a camera and a display having a first relative position therebetween, the processor further, when the camera and the display have a second relative position therebetween, to reduce the histogram of oriented gradient features to provide a second feature set corresponding to the eye region of the source image, to apply a second pretrained classifier to the second feature set to determine a second motion vector field for the eye region of the source image, and to warp the eye region of the source image based on the second motion vector field and integrate the warped eye region into the remaining portion of the source image to generate the eye contact corrected image. 9. The system of claim 7 , wherein the pretrained random forest classifier is generated based on a training stage pretrained classifier trained based on the likelihood maps and the training stage feature sets, and compression of the training stage pretrained classifier by parameterized surface fitting to generate the pretrained classifier. 10. At least one non-transitory machine readable medium comprising a plurality of instructions that, in response to being executed on a device, cause the device to provide eye contact correction by: determining histogram of oriented gradient features for an eye region of a source image; reducing the histogram of oriented gradient features to provide a feature set corresponding to the eye region of the source image by applying principal component analysis to the histogram of oriented gradient features using pretrained scaling and weighting factors trained based on a plurality of pairs of training eye region images comprising first images having a gaze angle difference with respect to second images of the pairs of training eye region

Assignees

Inventors

Classifications

  • Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals (selecting H04Q) · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10664949B2 cover?
Techniques related to eye contact correction to provide a virtual user gaze aligned with a camera while the user views a display are discussed. Such techniques may include determining and reducing histogram of oriented gradient features for an eye region of a source image to provide a feature set, applying a pretrained classifier to the feature set to determine a motion vector field for the eye…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T3/0093. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 26 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).