System and method for generating challenge utterances for speaker verification

US10121476B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10121476-B2
Application numberUS-201615075333-A
CountryUS
Kind codeB2
Filing dateMar 21, 2016
Priority dateNov 24, 2010
Publication dateNov 6, 2018
Grant dateNov 6, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems, methods, and non-transitory computer-readable storage media relating to speaker verification. In one aspect, a system receives a first user identity from a second user, and, based on the identity, accesses voice characteristics. The system randomly generates a challenge sentence according to a rule and/or grammar, based on the voice characteristics, and prompts the second user to speak the challenge sentence. The system verifies that the second user is the first user if the spoken challenge sentence matches the voice characteristics. In an enrollment aspect, the system constructs an enrollment phrase that covers a minimum threshold of unique speech sounds based on speaker-distinctive phonemes, phoneme clusters, and prosody. Then user utters the enrollment phrase and extracts voice characteristics for the user from the uttered enrollment phrase. The system generates a user profile, based on the voice characteristics, for generating random challenge sentences according to a grammar.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: collecting user speech uttered while a user is using a speech service associated with a multi-platform automatic speech recognition engine to yield collected user speech; identifying speech of interest from the collected user speech, wherein the speech of interest has high speaker discriminatory ability; identifying, from the speech of interest, voice characteristics for the user, the voice characteristics comprising one or more of speaker-distinctive phonemes, phoneme clusters, and prosody, to yield identified voice characteristics; creating a user profile for the user by flagging the identified voice characteristics in the user profile as important for generating challenge sentences to yield flagged voice characteristics; generating, by a system comprising a processor and based on the flagged voice characteristics in the user profile, a random challenge sentence according to a challenge grammar; and prompting the user to utter the random challenge sentence to verify an identity of the user. 2. The method of claim 1 , further comprising: collecting additional user speech; identifying additional voice characteristics from the additional user speech; and updating the user profile based on the additional voice characteristics. 3. The method of claim 1 , wherein the voice characteristics are associated with different intonation patterns. 4. The method of claim 1 , wherein the random challenge sentence is generated according to one of a plurality of sentence frames. 5. The method of claim 1 , further comprising adding, by the system, personal information associated with the user to the user profile. 6. The method of claim 1 , further comprising adding, by the system, personal contacts associated with the user to the user profile. 7. A non-transitory computer-readable storage medium storing instructions which, responsive to being executed by a processor, cause the processor to perform operations comprising: collecting user speech uttered while a user is using a speech service associated with a multi-platform automatic speech recognition engine to yield collected user speech; identifying speech of interest from the collected user speech, wherein the speech of interest has high speaker discriminatory ability; identifying, from the speech of interest, voice characteristics for the user, the voice characteristics comprising one or more of speaker-distinctive phonemes, phoneme clusters, and prosody, to yield identified voice characteristics; creating a user profile for the user by flagging the identified voice characteristics in the user profile as important for generating challenge sentences to yield flagged voice characteristics; generating, based on the flagged voice characteristics of interest in the user profile, a random challenge sentence according to a challenge grammar; and prompting the user to utter the random challenge sentence to verify an identity of the user. 8. The non-transitory computer-readable storage medium of claim 7 , wherein the voice characteristics are associated with different intonation patterns. 9. The non-transitory computer-readable storage medium of claim 7 , wherein the random challenge sentence is generated according to one of a plurality of sentence frames. 10. A system comprising: a processor; and a computer-readable storage medium having instructions stored which, responsive to being executed by the processor, result in the processor performing operations comprising: collecting user speech uttered while a user is using a speech service associated with a multi-platform automatic speech recognition engine to yield collected user speech; identifying speech of interest from the collected user speech, wherein the speech of interest has high speaker discriminatory ability; identifying, from the speech of interest, voice characteristics for the user, the voice characteristics comprising one or more of speaker-distinctive phonemes, phoneme clusters, and prosody, to yield identified voice characteristics; creating a user profile for the user by flagging the identified voice characteristics in the user profile as important for generating challenge sentences to yield flagged voice characteristics; generating, based on the flagged voice characteristics in the user profile, a random challenge sentence according to a challenge grammar; and prompting the user to utter the random challenge sentence to verify an identity of the user. 11. The system of claim 10 , wherein the operations further comprise: collecting additional user speech; identifying additional voice characteristics from the additional user speech; and updating the user profile based on the additional voice characteristics. 12. The system of claim 10 , wherein the voice characteristics are associated with different intonation patterns. 13. The system of claim 10 , wherein the random challenge sentence is generated according to one of a plurality of sentence frames. 14. The system of claim 10 , wherein the operations further comprise adding personal information associated with the user to the user profile. 15. The system of claim 10 , wherein the operations further comprise adding personal contacts associated with the user to the user profile. 16. The system of claim 10 , wherein the generating of the random challenge sentence comprises: identifying, using the voice characteristics, a plurality of more valuable phonemes determined to be more valuable than second phonemes for verifying an asserted identity; and generating the random challenge sentence, based on the voice characteristics, wherein the random challenge sentence comprises the plurality of more valuable phonemes.

Assignees

Inventors

Classifications

  • Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices · CPC title

  • Training, enrolment or model building · CPC title

  • Phonemes, fenemes or fenones being the recognition units · CPC title

  • Speech classification or search · CPC title

  • Speaker identification or verification techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10121476B2 cover?
Disclosed herein are systems, methods, and non-transitory computer-readable storage media relating to speaker verification. In one aspect, a system receives a first user identity from a second user, and, based on the identity, accesses voice characteristics. The system randomly generates a challenge sentence according to a rule and/or grammar, based on the voice characteristics, and prompts the…
Who is the assignee on this patent?
Nuance Communications Inc
What technology area does this patent fall under?
Primary CPC classification G10L17/24. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).