System and method for processing video to provide facial de-identification

US9799096B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9799096-B1
Application numberUS-201514794165-A
CountryUS
Kind codeB1
Filing dateJul 8, 2015
Priority dateJul 8, 2014
Publication dateOct 24, 2017
Grant dateOct 24, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for real-time image and video face de-identification that removes the identity of the subject while preserving the facial behavior is described The facial features of the source face are replaced with that of the target face while preserving the facial actions of the source face on the target face. The facial actions of the source face are transferred to the target face using personalized Facial Action Transfer (FAT), and the color and illumination is adapted. Finally, the source image or video containing the target facial features is outputted for display. Alternatively, the system can run in real-time.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for face facial de-identification in an image or video comprising: obtaining an image of a source face represented as a plurality of source pixels; obtaining an image of a target face represented as a plurality of target pixels; computing the shape deformations of the source face with respect to a neutral face using triangle-based deformation and applying the same triangle-based deformation to the neutral face to deform the target face; and adapting illumination and color characteristics of the source face to the target face by: establishing a plurality of 2D landmarks on the source face and target face; for each of the source face and target face, estimating a plurality of 3D landmarks corresponding to the 2D landmarks and representing a surface over the 3D landmarks as a spherical surface; aligning source pixels to target pixels using triangle warping and calculating the corresponding pixels between the source pixels and target pixels; projecting the source pixels and target pixels onto their respective spherical surfaces and computing, for each source and target pixel, a harmonic basis; calculating a plurality of harmonic coefficients representing an approximation of the intensity of source pixels in each of the RGB color channels; and constructing the pixel intensity for the target pixels as the product of the harmonic basis for the target pixels and the harmonic coefficients calculated with respect to the source pixels. 2. The method of claim 1 wherein the step of computing the shape deformations comprises: computing a mapping between a plurality of triangles modelling said source face between a neutral expression and another facial expression; and applying said mapping to said target face to produce a substantially similar facial expression on said target face. 3. The method of claim 2 wherein said mapping is generated by learning a regression between said neutral expression and said other facial expressions. 4. The method of claim 1 further comprising: encoding a password in a de-identified image; and permitting access to an identified image showing said source face when said password is input by a user. 5. The method of claim 1 comprising a prior step of performing facial feature tracking on said source face until a change in facial expression is detected. 6. A system for providing personalized publicity to a user comprising: a display, for displaying one or more advertisements to said user; a processor, running a program for providing the following functions: obtaining an image of said user; replacing the face of a person appearing in said advertisements with said image of said user; and transferring facial expressions of said person in said advertisements to said image of said user; wherein the functions of replacing and transferring use the method of claim 1 . 7. A system for de-identifying a face appearing in a video stream comprising: a source of a stream of video; a video processor running a program which provides the functions of: detecting said face in said stream of video; replacing said face with an image of an anonymous face; and transferring facial expressions of said face to said anonymous face; wherein the functions of replacing and transferring use the method of claim 1 . 8. A system for de-identifying faces in surveillance data comprising: one or more cameras providing a stream of surveillance data; and a processor running a program which provides the functions of: detecting a face in said stream of surveillance data; replacing said face with an image of an anonymous face; and transferring facial expressions of said face to said anonymous face; wherein the functions of replacing and transferring use the method of claim 1 . 9. A system for enhancing a video game comprising: a video game, rendered locally or online; a processor running in conjunction with said video game, said processor running a program which provides the functions of: obtaining an image of a user of said video game; replacing the face of a person appearing in said video game with said image of said user; and transferring facial expressions of said person in said video game to said image of said user; wherein the functions of replacing and transferring use the method of claim 1 . 10. A system for providing communication to users for purposes of long distance learning comprising: a camera, for obtaining a stream of video containing the face a user; a video processor running a program which provides the functions of: detecting the face of said user in said stream of video data; replacing said face with an image of an anonymous face; and transferring the facial expressions of said face to said anonymous face; and a display, for displaying said stream of video data wherein the functions of replacing and transferring use the method of claim 1 . 11. An apparatus comprising: a processor; and memory containing logic for execution by the processor, the logic configured to: obtain an image of a source face represented as a plurality of source pixels; obtain an image of a target face represented as a plurality of target pixels; compute the shape deformations of the source face with respect to a neutral face using triangle-based deformation and apply the same triangle-based deformation to the neutral face to deform the target face; and adapt illumination and color characteristics of the source face to the target face by: establishing a plurality of 2D landmarks on the source face and target face; for each of the source face and target face, estimating a plurality of 3D landmarks corresponding to the 2D landmarks and representing a surface over the 3D landmarks as a spherical surface; aligning source pixels to target pixels using triangle warping and calculating the corresponding pixels between the source pixels and target pixels; projecting the source pixels and target pixels onto their respective spherical surfaces and computing, for each source and target pixel, a harmonic basis; calculating a plurality of harmonic coefficients representing an approximation of the intensity of source pixels in each of the RGB color channels; and constructing the pixel intensity for the target pixels as the product of the harmonic basis for the target pixels and the harmonic coefficients calculated with respect to the source pixels. 12. The apparatus of claim 11 wherein computing the shape deformations comprises: computing a mapping between a plurality of triangles modelling said source face between a neutral expression and another facial expression; and applying said mapping to said target face to produce a substantially similar facial expression on said target face; wherein said mapping is generated by learning a regression between said neutral expression and said other facial expressions. 13. The apparatus of claim 11 wherein the logic is further configured to: encode a password in a de-identified image; and permit access to an identified image showing said source face when said password is input by a user.

Assignees

Inventors

Classifications

  • Output arrangements for video game devices · CPC title

  • A63F13/655Primary

    by importing photos, e.g. of the player · CPC title

  • Morphological image processing · CPC title

  • Morphing · CPC title

  • user representation in the game field, e.g. avatar · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9799096B1 cover?
A system and method for real-time image and video face de-identification that removes the identity of the subject while preserving the facial behavior is described The facial features of the source face are replaced with that of the target face while preserving the facial actions of the source face on the target face. The facial actions of the source face are transferred to the target face usin…
Who is the assignee on this patent?
Univ Carnegie Mellon
What technology area does this patent fall under?
Primary CPC classification A63F13/655. Mapped technology areas include Human Necessities.
When was this patent published?
Publication date Tue Oct 24 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).