Speaker identification device, speaker identification method, and recording medium

US10249306B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10249306-B2
Application numberUS-201414760617-A
CountryUS
Kind codeB2
Filing dateJan 16, 2014
Priority dateJan 17, 2013
Publication dateApr 2, 2019
Grant dateApr 2, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A speaker identification device includes: a primary speaker identification unit that computes, for each pre-stored registered speaker, a score that indicates the similarity between input speech and speech of the registered speakers; a similar speaker selection unit that selects a plurality of the registered speakers as similar speakers according to the height of the scores thereof; a learning unit that creates a classifier for each similar speaker by sorting the speech of a certain similar speaker among the similar speakers as a positive instance and the speech of the other similar speakers as negative instances; and a secondary speaker identification unit that computes, for each classifier, a score of the classifier with respect to the input speech, and outputs an identification result.

First claim

Opening claim text (preview).

What is claimed is: 1. A speaker identification device comprising: a primary speaker identification unit which computes, for each registered speaker stored in advance, a score that indicates similarity between input speech and speech of the registered speakers; a similar speaker selection unit which selects a plurality of the registered speakers as similar speakers according to height of the scores; a learning unit which creates a plurality of classifiers, each classifier corresponding to a different speaker of similar speakers, wherein for each classifier, the classifier corresponds speech of the different speaker to which the classifier corresponds as a positive instance and speech of other speakers of the similar speakers as negative instances; and a secondary speaker identification unit which computes, for each classifier, a score of the classifier with respect to the input speech and outputs an identification result. 2. The speaker identification device according to claim 1 , wherein the learning unit stores in advance pairs of the similar speakers selected by the similar speaker selection unit in the past and classifiers created by the learning unit in the past as history, and creates a classifier only when there is a difference between the similar speakers in the history and the similar speakers selected by the similar speaker selection unit. 3. The speaker identification device according to claim 1 , wherein the similar speaker selection unit selects a preset number of the similar speakers. 4. The speaker identification device according to claim 1 , wherein the similar speaker selection unit selects the similar speakers based on a preset score threshold. 5. The speaker identification device according to claim 1 , wherein the classifier is a Support Vector Machine (SVM), and the score of the classifier is a distance from a feature point of the input speech to a classification plane. 6. A speaker identification method comprising: computing, for each registered speaker stored in advance, a score that indicates similarity between input speech and speech of the registered speakers; selecting a plurality of the registered speakers as similar speakers according to height of the scores; creating a plurality of classifiers, each classifier corresponding to a different speaker of similar speakers; for each classifier, corresponding, by the classifier, speech of the different speaker to which the classifier corresponds as a positive instance and speech of other speakers of the similar speakers as negative instances; and computing, for each classifier, a score of the classifier with respect to the input speech and outputting an identification result. 7. A non-transitory computer readable medium that stores therein a speaker identification program that causes a computer to execute: primary speaker identification processing of computing, for each registered speaker stored in advance, a score that indicates similarity between input speech and speech of the registered speakers; similar speaker selection processing of selecting a plurality of the registered speakers as similar speakers according to height of the scores; learning processing of creating a plurality of classifiers, each classifier corresponding to a different speaker of similar speakers; for each classifier, corresponding, by the classifier, speech of the different speaker to which the classifier corresponds as a positive instance and speech of other speakers of the similar speakers as negative instances; and secondary speaker identification processing of computing, for each classifier, a score of the classifier with respect to the input speech and outputs an identification result.

Assignees

Inventors

Classifications

  • Decision making techniques; Pattern matching strategies · CPC title

  • G10L17/12Primary

    Score normalisation · CPC title

  • G10L17/04Primary

    Training, enrolment or model building · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10249306B2 cover?
A speaker identification device includes: a primary speaker identification unit that computes, for each pre-stored registered speaker, a score that indicates the similarity between input speech and speech of the registered speakers; a similar speaker selection unit that selects a plurality of the registered speakers as similar speakers according to the height of the scores thereof; a learning u…
Who is the assignee on this patent?
Nec Corp
What technology area does this patent fall under?
Primary CPC classification G10L17/12. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 02 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).