Method and apparatus for speaker-calibrated speaker detection

US9564134B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9564134-B2
Application numberUS-201514868226-A
CountryUS
Kind codeB2
Filing dateSep 28, 2015
Priority dateDec 21, 2011
Publication dateFeb 7, 2017
Grant dateFeb 7, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers and then incorporating the speech features in the speaker model.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for generating a speaker model for use in detecting a speaker of interest, the method implemented by instructions embodied in one or more non-transitory computer accessible storage media and executable by a processor, the method comprising: identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers by: obtaining a plurality of speech samples, the plurality of speech samples comprising a set of speech samples from the speaker of interest and a plurality of additional speech samples from the plurality of impostor speakers; extracting a plurality of speech features from the plurality of speech samples; and ranking the plurality of speech features according to an ability to distinguish the speaker of interest from the plurality impostor speakers, wherein the ranking comprises: modeling one or more speech features within one or more regions; and assigning a performance measure to the one or more speech features subsequent to the modeling, wherein the performance measure for an associated one of the one or more speech features represents a strength with which the associated one of the one or more speech features accurately distinguishes the speaker of interest from the plurality of impostor speakers; wherein at least one signal indicative of the identified one or more speech features is provided to a software application to generate the speaker model; and detecting, by the software application, the speaker of interest based on the generated speaker model. 2. The method of claim 1 , wherein at least one of the one or more speech features comprises a combination of two or more individual speech features. 3. The method of claim 1 , wherein the incorporating comprises: assigning a weight to each of the one or more speech features, based on the performance measure associated with the each of the one or more speech features. 4. The method of claim 3 , wherein a weight of zero excludes an associated one of the one or more speech features from the speaker model. 5. The method of claim 3 , wherein the assigning is performed by a classifier. 6. The method of claim 1 , wherein the one or more speech features includes at least one of: a cepstral feature, a prosodic feature, or a signal processing-based feature. 7. The method of claim 6 , wherein the cepstral feature is constrained by one of: a lexical feature, a phonetic feature, a state-level feature, a prosodic feature, a pause feature, a turn feature, or a speaking-rate feature. 8. The method of claim 1 , wherein the one or more speech features vary from one speaker of interest to another speaker of interest.

Assignees

Inventors

Classifications

  • G10L17/26Primary

    Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices · CPC title

  • G10L17/02Primary

    Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction · CPC title

  • Training, enrolment or model building · CPC title

  • the user being prompted to utter a password or a predefined phrase · CPC title

  • Decision making techniques; Pattern matching strategies · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9564134B2 cover?
The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers and then incorporating the speech features in the speaker model.
Who is the assignee on this patent?
Stanford Res Inst Int
What technology area does this patent fall under?
Primary CPC classification G10L17/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 07 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).