Apparatus and method for video sensor-based human activity and facial expression modeling and recognition

US9489568B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9489568-B2
Application numberUS-201414307342-A
CountryUS
Kind codeB2
Filing dateJun 17, 2014
Priority dateJun 5, 2009
Publication dateNov 8, 2016
Grant dateNov 8, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus and method for human activity and facial expression modeling and recognition are based on feature extraction techniques from time sequential images. The human activity modeling includes determining principal components of depth and/or binary shape images of human activities extracted from video clips. Independent Component Analysis (ICA) representations are determined based on the principal components. Features are determined through Linear Discriminant Analysis (LDA) based on the ICA representations. A codebook is determined using vector quantization, Observation symbol sequences in the video clips am determined. And human activities are learned using the Hidden Markov Model (HMM) based on status transition and an observation matrix.

First claim

Opening claim text (preview).

What is claimed is: 1. A human activity recognition method comprising: determining, by a controller, principal components of depth images and binary shape images of human activities extracted from a video clip; performing, by the controller, a human activity modeling based on the principal components; and determining, by the controller, a human activity model which maximizes a likelihood probability among activity models in a dictionary according to an output value of the human activity modeling, wherein the depth images of the human activities are different images than the binary shape images of the human activities, and the video clip comprises successive images that are based on a video sensor. 2. The human activity recognition method of claim 1 , wherein determining the principal components of depth images and binary shape images of human activities extracted from the video clip comprises: determining at least one principal component representing an entire image based on a human body representation of the depth images and binary shape images. 3. The human activity recognition method of claim 1 , wherein the video clip is represented using a Hidden Markov Model (HMM) based on a status transition and a symbol observation matrix. 4. The human activity recognition method of claim 1 , wherein determining the human activity model which maximizes the likelihood probability among the activity models in the dictionary according to an output value of the human activity modeling comprises: determining the human activity model which maximizes the likelihood probability from a human activity HMM dictionary. 5. The human activity recognition method of claim 1 , wherein performing the human activity modeling based on the principal components comprises; determining Independent Component Analysis (ICA) representations based on the principal components; determining features through Linear Discriminant Analysis (LDA) based on the ICA representations; determining a codebook using vector quantization; determining observation symbol sequences in the video clip; and identifying human activities using a HMM based on a status transition and an observation matrix. 6. The human activity recognition method of claim 5 , wherein determining ICA representations based on the principal components comprises: determining independent ICA representations in the principal components. 7. The human activity recognition method of claim 5 , wherein determining features through LDA based on the ICA representations comprises: emphasizing features of the ICA representations to distinguish the ICA representations in the depth images and binary shape images. 8. The human activity recognition method of claim 5 , wherein determining the codebook using vector quantization comprises: classifying the features determined through the LDA into at least one group using the vector quantization; and assigning a code to the at least one classified group. 9. The human activity recognition method of claim 5 , wherein determining observation symbol sequences in the video clip comprises: determining codes of the video clip and a code arrangement order using codes of the codebook. 10. A human activity recognition apparatus comprising: an input interface configured to receive a video clip; a controller configured to determine principal components of depth images and binary shape images of human activities extracted from the video clip, and for performing a human activity modeling based on the principal components, and determine a human activity model which maximizes a likelihood probability among activity models in a dictionary according to an output value of the human activity modeling; and an output interface configured to output the human activity model, wherein the depth images of the human activities are different images than the binary shape images of the human activities, and the video clip comprises successive images that are based on a video sensor. 11. The human activity recognition apparatus of claim 10 , wherein the controller is configured to determine the principal components of the depth images and binary shape images of the human activities extracted from the video clip by determining at least one principal component representing an entire image based on a human body representation of the depth images and binary shape images. 12. The human activity recognition apparatus of claim 10 , wherein the video clip is represented using a HMM based on the status transition and a symbol observation matrix. 13. The human activity recognition apparatus of claim 10 , wherein the controller is configured to determine the human activity model which maximizes the likelihood probability from a human activity HMM dictionary. 14. The human activity recognition apparatus of claim 10 , wherein the controller is configured to determine ICA representations based on the principal components, determine features through LDA based on the ICA representations, determine a codebook using vector quantization, determine observation symbol sequences in the video clip, and identify human activities using a HMM based on a status transition and an observation matrix.

Assignees

Inventors

Classifications

  • Markov-related models; Markov random fields · CPC title

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries · CPC title

  • Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models · CPC title

  • G06V40/176Primary

    Dynamic expression · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9489568B2 cover?
An apparatus and method for human activity and facial expression modeling and recognition are based on feature extraction techniques from time sequential images. The human activity modeling includes determining principal components of depth and/or binary shape images of human activities extracted from video clips. Independent Component Analysis (ICA) representations are determined based on the …
Who is the assignee on this patent?
Samsung Electronics Co Ltd, Univ-Industry Coop Group Of Kyung Hee Univ
What technology area does this patent fall under?
Primary CPC classification G06V40/176. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).