Automated facial action coding system

US9530048B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9530048-B2
Application numberUS-201414311814-A
CountryUS
Kind codeB2
Filing dateJun 23, 2014
Priority dateAug 26, 2008
Publication dateDec 27, 2016
Grant dateDec 27, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An automatic facial action coding system and method can include processing an image to identify a face in the image, to detect and align one or more facial features shown in the image, and to define one or more windows on the image. One or more distributions of pixels and color intensities can be quantified in each of the one or more windows to derive one or more two-dimensional intensity distributions of one or more colors within the window. The one or more two-dimensional intensity distributions can be processed to select image features appearing in the one or more windows and to classify one or more predefined facial actions on the face in the image. A facial action code score that includes a value indicating a relative amount of the predefined facial action occurring in the face in the image can be determined for the face in the image for each of the one or more predefined facial actions.

First claim

Opening claim text (preview).

What is claimed: 1. A method comprising: processing a plurality of image windows located on a face in an image to calculate, for each image window of the plurality, a quantity that is informative of an occurrence of a facial action unit, the quantity comprising a first relative likelihood of the occurrence of the facial action unit, and the processing comprising estimating, for each of the plurality of image windows, the first relative likelihood for the image window, the estimating comprising applying a plurality of image filters to each image window to produce characteristics representing the image window, and processing the characteristics to generate the first relative likelihood for the image window; and combining the quantities calculated for the plurality of windows to estimate a relative likelihood of the facial action unit occurring in the face; wherein the processing and the combining are implemented on computer hardware. 2. The method as in claim 1 , wherein the estimating of the plurality of first relative likelihoods comprises use of a machine-learned technique. 3. The method as in claim 1 , wherein the applying of the plurality of image filters to the image window produces image-based representations of the image window. 4. The method as in claim 1 , wherein the processing of the characteristics comprises use of a machine-learned technique. 5. The method as in claim 4 , wherein the machine-learned technique has been trained on other images that include other faces. 6. The method as in claim 4 , wherein the machine-learned technique processes a subset of the characteristics to produce the first relative likelihood for the image window. 7. The method as in claim 1 , wherein the processing of the plurality of image windows further comprises: for each image window of the plurality, also estimating an additional first relative likelihood of an additional facial action unit; and wherein the method further comprises: also combining the plurality of additional first relative likelihoods to estimate an additional relative likelihood of the additional facial action unit occurring in the face. 8. The method as in claim 7 , wherein the also estimating and the also combining for the additional facial action unit are performed in parallel with the estimating and combining for the facial action unit. 9. The method as in claim 7 , wherein the facial action unit and the additional facial action unit are defined by the Facial Action Coding System (FACS). 10. The method as in claim 7 , further comprising: producing a score vector, the score vector comprising the relative likelihood of the facial action unit and the additional relative likelihood of the additional facial action unit. 11. The method as in claim 7 , further comprising: classifying expressions of emotion based on the relative likelihood of the facial action unit and the additional relative likelihood of the additional facial action unit. 12. The method as in claim 1 , wherein the image comprises an image from a video stream, and wherein the method further comprises: also processing an additional plurality of image windows located on the face in a second image from the video stream to calculate, for each additional image window of the additional plurality, an additional quantity that is informative of the occurrence of the facial action unit; also estimating an additional plurality of first relative likelihoods of the facial action unit occurring in each of the additional plurality of image windows; and also combining the additional plurality of first relative likelihoods to estimate an additional second relative likelihood of the facial action unit occurring in the face. 13. The method as in claim 1 , wherein the plurality of image filters comprises at least one of motion detectors and spatio-temporal filters. 14. The method as in claim 1 , wherein the plurality of image windows comprise image windows of different sizes and located to include different locations on the face. 15. The method as in claim 1 , wherein the plurality of image windows is defined relative to a predefined size and orientation of the face in the image. 16. A non-transitory machine readable medium storing instructions that, when executed by at least one processor, cause the at least one processor to perform operations comprising: processing a plurality of image windows located on a face in an image to calculate, for each image window of the plurality, a quantity that is informative of an occurrence of a facial action unit, the quantity comprising a first relative likelihood of the occurrence of the facial action unit, and the processing comprising estimating, for each of the plurality of image windows, the first relative likelihood for the image window, the estimating comprising applying a plurality of image filters to each image window to produce characteristics representing the image window, and processing the characteristics to generate the first relative likelihood for the image window; and combining the quantities calculated for the plurality of windows to estimate a relative likelihood of the facial action unit occurring in the face. 17. A system comprising: computer hardware configured to perform operations comprising: processing a plurality of image windows located on a face in an image to calculate, for each image window of the plurality, a quantity that is informative of an occurrence of a facial action unit, the quantity comprising a first relative likelihood of the occurrence of the facial action unit, and the processing comprising estimating, for each of the plurality of image windows, the first relative likelihood for the image window, the estimating comprising applying a plurality of image filters to each image window to produce characteristics representing the image window, and processing the characteristics to generate the first relative likelihood for the image window; and combining the quantities calculated for the plurality of windows to estimate a relative likelihood of the facial action unit occurring in the face.

Assignees

Inventors

Classifications

  • Movements or behaviour, e.g. gesture recognition (recognition of facial expressions G06V40/16) · CPC title

  • G06V10/56Primary

    relating to colour · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9530048B2 cover?
An automatic facial action coding system and method can include processing an image to identify a face in the image, to detect and align one or more facial features shown in the image, and to define one or more windows on the image. One or more distributions of pixels and color intensities can be quantified in each of the one or more windows to derive one or more two-dimensional intensity distr…
Who is the assignee on this patent?
Univ California, The Res Found Of State Univ Of New York
What technology area does this patent fall under?
Primary CPC classification G06V10/56. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 27 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).