Who is the assignee on this patent?

Blouet Raphael, Daon Holdings Ltd

What technology area does this patent fall under?

Primary CPC classification G10L15/04. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue May 09 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Methods and systems for splitting a digital signal

US9646613B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9646613-B2
Application number	US-201314093200-A
Country	US
Kind code	B2
Filing date	Nov 29, 2013
Priority date	Nov 29, 2013
Publication date	May 9, 2017
Grant date	May 9, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for splitting a digital signal using prosodic features included in the signal is provided that includes calculating onset value locations in the signal. The onset values correspond to stress accents in the signal. Moreover, the method includes splitting, using a processor, the signal into a prosodic unit candidate sequence by superimposing the stress accent locations on the signal, and processing the sequence to include only true prosodic units.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for authenticating users comprising: calculating signal features for an audio digital signal representing a user utterance, the signal features including onset values, the onset values correspond to stress accents in the signal; splitting, by a processor, the signal into a prosodic unit candidate sequence based on the stress accent locations; modifying, by the processor, the prosodic unit candidate sequence based on the signal features, the modified prosodic unit sequence including only true prosodic units calculating a first probability that the true prosodic units match a prosodic unit sequence used to create a hidden Markov model for the user; calculating a second probability that the true prosodic units match prosodic unit sequences used to create a Universal Background Model; and successfully authenticating the user when the difference between the first probability and the second probability is greater than a decision threshold. 2. A method in accordance with claim 1 , said modifying step comprising: calculating the energy level for each prosodic unit candidate in the prosodic unit candidate sequence; and removing from the prosodic unit candidate sequence prosodic unit candidates having a low energy level. 3. A method in accordance with claim 1 , said modifying step comprising: identifying borders, shared by prosodic unit candidates, located in a high level region of the signal; and moving the identified borders to a low level energy region of the signal. 4. A method in accordance with claim 1 , further comprising: creating a hidden Markov model in which each true prosodic unit defines a state in the model; and optimizing the model. 5. A computing system for authenticating users comprising: a processor; and a memory configured to store data, said computing device being associated with a network and said memory being in communication with said processor and having instructions stored therein which, when read and executed by said processor cause said computing device to: calculate signal features for an audio digital signal representing a user utterance, the signal features including onset values, the onset values correspond to stress accents in the signal; split the signal into a prosodic unit candidate sequence based on the stress accent locations; modify the prosodic unit candidate sequence based on the signal features, the modified prosodic unit sequence including only true prosodic units; calculate a first probability that the true prosodic units match a prosodic unit sequence used to create a hidden Markov model for the user; calculate a second probability that the true prosodic units match prosodic unit sequences used to create a Universal Background Model; and successfully authenticate the user when the difference between the first probability and the second probability is greater than a decision threshold. 6. A computing system in accordance with claim 5 , wherein the instructions when executed by said processor further cause said computing device to: calculate the energy level for each prosodic unit candidate in the prosodic unit candidate sequence; and remove from the prosodic unit candidate sequence prosodic unit candidates having a low energy level. 7. A computing system in accordance with claim 5 , wherein the instructions when executed by said processor further cause said computing device to: identify borders, shared by prosodic unit candidates, located in a high level region of the signal; and move the identified borders to a low level energy region of the signal. 8. A computing system in accordance with claim 5 , wherein the instructions when executed by said processor further cause said computing device to: create a hidden Markov model in which each true prosodic unit defines a state in the model; and optimize the model. 9. A computing system in accordance with claim 5 , wherein the instructions when executed by said processor further cause said computing device to compute a decision score using the hidden Markov model. 10. A computing system in accordance with claim 5 , said computing device being a smart phone or a tablet computer. 11. A computing system in accordance with claim 5 , said computing device being an authentication computer system. 12. A computer program recorded on a non-transitory computer-readable recording medium included in a computing device for generating trustworthy authentication transaction results, the computer program being comprised of instructions, which when read and executed by the computing device, cause the computing device to: calculate signal features for an audio digital signal representing a user utterance, the signal features including onset values, the onset values correspond to stress accents in the signal; split the signal into a prosodic unit candidate sequence based on the stress accent locations; modify the prosodic unit candidate sequence based on the signal features, the modified prosodic unit sequence including only true prosodic units; calculate a first probability that the true prosodic units match a prosodic unit sequence used to create a hidden Markov model for the user; calculate a second probability that the true prosodic units match prosodic unit sequences used to create a Universal Background Model; and successfully authenticate the user when the difference between the first probability and the second probability is greater than a decision threshold. 13. A computer program in accordance with claim 12 wherein the instructions which when read and executed by the computing device, further cause the computing device to: calculate the energy level for each prosodic unit candidate in the prosodic unit candidate sequence; and remove from the prosodic unit candidate sequence prosodic unit candidates having a low energy level. 14. A computer program in accordance with claim 12 wherein the instructions which when read and executed by the computing device, further cause the computing device to: identify borders, shared by prosodic unit candidates, located in a high level region of the signal; and move the identified borders to a low level energy region of the signal. 15. A computer program in accordance with claim 12 wherein the instructions which when read and executed by the computing device, further cause the computing device to: create a hidden Markov model in which each true prosodic unit defines a state in the model; and optimize the model. 16. A computer program in accordance with claim 12 wherein the instructions which when read and executed by the computing device, further cause the computing device to compute a decision score using the hidden Markov model.

Assignees

Inventors

Blouet Raphael

Classifications

H04M3/385
using speech signals · CPC title
G10L15/1807
using prosody or stress · CPC title
G10L15/04Primary
Segmentation; Word boundary detection · CPC title
G10L17/04
Training, enrolment or model building · CPC title
G10L17/22Primary
Interactive procedures; Man-machine interfaces · CPC title

Patent family

Related publications grouped by family.

View patent family 51900226

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9646613B2 cover?: A method for splitting a digital signal using prosodic features included in the signal is provided that includes calculating onset value locations in the signal. The onset values correspond to stress accents in the signal. Moreover, the method includes splitting, using a processor, the signal into a prosodic unit candidate sequence by superimposing the stress accent locations on the signal, and…
Who is the assignee on this patent?: Blouet Raphael, Daon Holdings Ltd
What technology area does this patent fall under?: Primary CPC classification G10L15/04. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue May 09 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).