System and method for detecting synthetic speaker verification

US9412382B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9412382-B2
Application numberUS-201514859449-A
CountryUS
Kind codeB2
Filing dateSep 21, 2015
Priority dateApr 11, 2008
Publication dateAug 9, 2016
Grant dateAug 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems, methods, and tangible computer readable-media for detecting synthetic speaker verification. The method comprises receiving a plurality of speech samples of the same word or phrase for verification, comparing each of the plurality of speech samples to each other, denying verification if the plurality of speech samples demonstrate little variance over time or are the same, and verifying the plurality of speech samples if the plurality of speech samples demonstrates sufficient variance over time. One embodiment further adds that each of the plurality of speech samples is collected at different times or in different contexts. In other embodiments, variance is based on a pre-determined threshold or the threshold for variance is adjusted based on a need for authentication certainty. In another embodiment, if the initial comparison is inconclusive, additional speech samples are received.

First claim

Opening claim text (preview).

I claim: 1. A method comprising: receiving, from a user of a speech verification system, a plurality of speech samples of a same word, wherein the plurality of speech samples of the same word comprise: a current speech response; and a plurality of previously recorded speech samples, wherein each of the previously recorded speech samples is associated with a distinct location; generating, via a processor, a sample similarity from the plurality of speech samples; making a decision, via the processor, whether to enroll the user in the speech verification system according to a comparison of the sample similarity with a threshold, wherein the decision is a first decision if the sample similarity is above the threshold and the decision is a second decision different from and mutually exclusive with the first decision if the sample similarity is below the threshold; and enrolling the user in the speech verification system responsively to the decision's being the second decision. 2. The method of claim 1 , wherein the sample similarity has a range which varies based on a job title of the user. 3. The method of claim 1 , further comprising: verifying speech received from the user as authentic using the speech verification system. 4. The method of claim 1 , wherein the speech verification system provides access to a restricted location. 5. The method of claim 1 , wherein the speech verification system unlocks a cellphone. 6. The method of claim 1 , wherein each of the plurality of speech samples of the same word is collected in a distinct context. 7. The method of claim 1 , further comprising prompting the user to say the same word as part of a user authentication using the speech verification system, to yield the current speech response. 8. A system comprising: a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, result in the processor performing operations comprising: receiving, from a user of a speech verification system, a plurality of speech samples of a same word, wherein the plurality of speech samples of the same word comprise: a current speech response; and a plurality of previously recorded speech samples, wherein each of the previously recorded speech samples is associated with a distinct location; generating a sample similarity from the plurality of speech samples; making a decision whether to enroll the user in the speech verification system according to a comparison of the sample similarity with a threshold, wherein the decision is a first decision if the sample similarity is above the threshold and the decision is a second decision different from and mutually exclusive with the first decision if the sample similarity is below the threshold; and enrolling the user in the speech verification system responsively to the decision's being the second decision. 9. The system of claim 8 , wherein the sample similarity has a range which varies based on a job title of the user. 10. The system of claim 8 , the computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising: verifying speech received from the user as authentic using the speech verification system. 11. The system of claim 8 , wherein the speech verification system provides access to a restricted location. 12. The system of claim 8 , wherein the speech verification system unlocks a cellphone. 13. The system of claim 8 , wherein each of the plurality of speech samples of the same word is collected in a distinct context. 14. The system of claim 8 , the computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising prompting the user to say the same word as part of a user authentication using the speech verification system, to yield the current speech response. 15. A computer-readable storage device having instructions stored which, when executed by a computing device, result in the computing device performing operations comprising: receiving, from a user of a speech verification system, a plurality of speech samples of a same word, wherein the plurality of speech samples of the same word comprise: a current speech response; and a plurality of previously recorded speech samples, wherein each of the previously recorded speech samples is associated with a distinct location; generating a sample similarity from the plurality of speech samples; making a decision whether to enroll the user in the speech verification system according to a comparison of the sample similarity with a threshold, wherein the decision is a first decision if the sample similarity is above the threshold and the decision is a second decision different from and mutually exclusive with the first decision if the sample similarity is below the threshold; and enrolling the user in the speech verification system responsively to the decision's being the second decision. 16. The computer-readable storage device of claim 15 , wherein the sample similarity has a range which varies based on a job title of the user. 17. The computer-readable storage device of claim 15 , having instructions stored which, when executed by the computing device, cause the computing device to perform operations comprising: verifying speech received from the user as authentic using the speech verification system. 18. The computer-readable storage device of claim 15 , wherein the speech verification system provides access to a restricted location. 19. The computer-readable storage device of claim 15 , wherein the speech verification system unlocks a cellphone. 20. The computer-readable storage device of claim 15 , wherein each of the plurality of speech samples of the same word is collected in a distinct context.

Assignees

Inventors

Classifications

  • G10L17/20Primary

    Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions · CPC title

  • Physics · mapped topic

  • G10L17/24Primary

    the user being prompted to utter a password or a predefined phrase · CPC title

  • Speaker identification or verification techniques · CPC title

  • Training, enrolment or model building · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9412382B2 cover?
Disclosed herein are systems, methods, and tangible computer readable-media for detecting synthetic speaker verification. The method comprises receiving a plurality of speech samples of the same word or phrase for verification, comparing each of the plurality of speech samples to each other, denying verification if the plurality of speech samples demonstrate little variance over time or are the…
Who is the assignee on this patent?
At & T Ip I Lp
What technology area does this patent fall under?
Primary CPC classification G10L17/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).