Authenticating a user by correlating speech and corresponding lip shape

US9754193B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9754193-B2
Application numberUS-201314892011-A
CountryUS
Kind codeB2
Filing dateJun 27, 2013
Priority dateJun 27, 2013
Publication dateSep 5, 2017
Grant dateSep 5, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a method of authenticating a user by correlating speech and corresponding lip shape. An audiovisual of a user requesting authentication is captured. The audiovisual is processed to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the user. A likelihood of the speech vector quantization sequence and the corresponding lip vector quantization sequence with probability distributions of speech vector quantization code words corresponding to different lip shape vector quantization code words of the user requesting authentication weighed by probabilities of speech and lip vector quantization indices of the user requesting authentication is evaluated. If upon evaluation, a likelihood of the user requesting authentication being an authentic user is more than a predefined threshold, the user is authenticated.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of authentication by correlating speech and corresponding lip shape of a user, comprising: capturing an audiovisual of a user requesting authentication; processing the audiovisual to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the user; evaluating a likelihood of the speech vector quantization sequence and the corresponding lip vector quantization sequence to be from an authentic user based on co-occurrence probability distributions of consecutive speech vector quantization code words within a lip frame in the lip vector quantization sequence at a particular time, and weighted by probability distribution of vector quantization indices of speech and lip features corresponding to the user as indexed in a codebook; and authenticating the user, if upon evaluation, a likelihood of the user requesting authentication being the authentic user is more than a predefined threshold. 2. The method of claim 1 , wherein the likelihood of the user requesting authentication being the authentic user is determined in accordance with the following equation: Likelihood Like N = ∏ t = 1 T ⁢ ⁢ ∏ i = 1 3 ⁢ ⁢ P ⁡ ( S t ⁢ ⁢ … ⁢ ⁢ i / L t ) N ⁢ P ⁡ ( S t ⁢ ⁢ … ⁢ ⁢ i / N ) ⁢ P ⁡ ( L i / N ) where, “N” is the user requesting authentication, “t” indexes time of the lip vector quantization sequence and “i” indexes three consecutive speech vector quantization code words within each lip frame in the lip vector quantization sequence, P(S i /L i ) N represents co-occurrence probability distribution, and P(S i /N), and P(L i /N) represent probability distributions of vector quantization indices of speech and lip features correspondingly, of the user requesting authentication ‘N’. 3. The method of claim 1 , further comprising generating the probability distributions of speech vector quantization code words corresponding to different lip shapes of the authentic user. 4. The method of claim 3 , wherein generating the probability distributions of speech vector quantization code words corresponding to different lip shapes of the authentic user comprises: capturing an audiovisual of the authentic user; processing the audiovisual to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the authentic user; and determining probability distributions of the speech vector quantization code words corresponding to different lip shapes of the authentic user. 5. The method of claim 1 , wherein the speech vector quantization sequence and the corresponding lip vector quantization sequence are compared using a different sampling interval. 6. The method of claim 1 , further comprising generating a universal vector quantization codebook of speech signals for a plurality of users including the authentic user. 7. The method of claim 1 , further comprising generating a universal vector quantization codebook of lip shapes for a plurality of users including the authentic user. 8. The method of claim 1 , further comprising generating the probability distributions of speech vector quantization code words and the probability distributions of lip shape vector quantization code words for each enrolled user. 9. A system, comprising: a processor; and a memory coupled to the processor, wherein the memory includes an authentication module to: process an audiovisual of a user requesting authentication to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the user; evaluate a likelihood of the speech vector quantization sequence and the corresponding lip vector quantization sequence to be from an authentic user based on co-occurrence probability distributions of consecutive speech vector quantization code words within a lip frame in the lip vector quantization sequence at a particular time, and weighted by probability distribution of vector quantization indices of speech and lip features corresponding to the user as indexed in a codebook; and authenticate the user, if upon evaluation, a likelihood of the user requesting authentication being the authentic user is more than a predefined threshold.

Assignees

Inventors

Classifications

  • G10L17/10Primary

    Multimodal systems, i.e. based on the integration of multiple recognition engines or fusion of expert systems · CPC title

  • of extracted features · CPC title

  • of results relating to different input data, e.g. multimodal recognition · CPC title

  • of extracted features · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9754193B2 cover?
Provided is a method of authenticating a user by correlating speech and corresponding lip shape. An audiovisual of a user requesting authentication is captured. The audiovisual is processed to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the user. A likelihood of the speech vector quantization sequence and the corresponding lip vector qu…
Who is the assignee on this patent?
Hewlett Packard Development Co Lp, Ramachandrula Sitaram, Ravishankar Hariharan
What technology area does this patent fall under?
Primary CPC classification G10L17/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).