Who is the assignee on this patent?

Nat Inst Inf & Comm Tech, Nat Inst Inf & Comm Tech

What technology area does this patent fall under?

Primary CPC classification G10L13/10. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jun 30 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Quantitative f0 contour generating device and method, and model learning device and method for f0 contour generation

US2016189705A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2016189705-A1
Application number	US-201414911189-A
Country	US
Kind code	A1
Filing date	Aug 13, 2014
Priority date	Aug 23, 2013
Publication date	Jun 30, 2016
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

[Object] An object is to provide an F0 contour synthesizing device based on statistic model, to clarify correspondence between linguistic information and F0 contour while maintaining accuracy. [Solution] An HMM learning device includes: a parameter estimating unit representing an F0 contour 133 fitting a continuous F0 contour 132 as a sum of phrase components and accent components and estimating target points of these; and an HMM learning means conducting learning of HMM 139 using the fitted F0 contour as training data. The continuous F0 contour may be decomposed to accent components 134 , phrase components 136 and micro-prosody components 138 , and separate HMMs 140, 142 and 144 may be trained. Using results of text analysis, accent components, phrase components and micro-prosody components are separately synthesized from HMMs 140, 142 and 144 and the results are synthesized to obtain an F0 contour.

First claim

Opening claim text (preview).

1 . A quantitative F0 contour generating device, comprising: means for generating, for an accent phrase of an utterance obtained by text analysis, accent components of an F0 contour using a given number of target points; means for generating phrase components of the F0 contour using a limited number of target points, by dividing the utterance to groups each including one or more accent phrases, in accordance with linguistic information including an utterance structure; and means for generating an F0 contour based on said accent components and said phrase components. 2 . A quantitative F0 contour generating method, comprising the steps of: generating, for an accent phrase of an utterance obtained by text analysis, accent components of an F0 contour using a given number of target points; generating phrase components of the F0 contour using a limited number of target points, by dividing the utterance to groups each including one or more accent phrases, in accordance with linguistic information including an utterance structure; and generating an F0 contour based on said accent components and said phrase components. 3 .- 4 . (canceled) 5 . A model learning device for F0 contour generation, comprising: F0 contour extracting means for extracting an F0 contour from a speech data signal; parameter estimating means for estimating target parameters representing phrase components and target parameters representing accent components, for representing an F0 contour fitting the extracted F0 contour by superposition of phrase components and accent components; and model learning means, performing F0 generation model learning, using a continuous F0 contour represented by the target parameters of phrase components and the target parameters of accent components estimated by said parameter estimating means as training data. 6 . The model learning device according to claim 5 , wherein said F0 generation model includes a generation model for generating phrase components and a generation model for generating accent components; and said model learning means includes means for performing learning of said generation model for generating phrase components and said generation model for generating accent components, respectively using, as training data, a time change contour of phrase components represented by target parameters of the phrase components and a time change contour of accent components represented by target parameters of the accent components, estimated by said parameter estimating means. 7 . A model learning method for F0 contour generation, comprising the steps of: F0 contour extracting step of extracting an F0 contour from a speech data signal; parameter estimating step of estimating target parameters representing phrase components and target parameters representing accent components, for representing an F0 contour fitting the extracted F0 contour by superposition of phrase components and accent components; and model learning step of performing F0 generation model learning, using a continuous F0 contour represented by the target parameters of phrase components and the target parameters of accent components estimated by said parameter estimating means as training data. 8 . The model learning method according to claim 7 , wherein said F0 generation model includes a generation model for generating phrase components and a generation model for generating accent components; and said model learning step includes the step of performing learning of said generation model for generating phrase components and said generation model for generating accent components, respectively using, as training data, a time change contour of phrase components represented by target parameters of the phrase components and a time change contour of accent components represented by target parameters of the accent components, estimated at said parameter estimating step.

Assignees

Inventors

Classifications

G10L25/18
the extracted parameters being spectral information of each sub-band · CPC title
G10L13/10Primary
Prosody rules derived from text; Stress or intonation · CPC title
G10L13/027
Concept to speech synthesisers; Generation of natural phrases from machine-based concepts (generation of parameters for speech synthesis out of text G10L13/08) · CPC title
G10L13/086
Detection of language · CPC title
G10L21/0205
Physics · mapped topic

Patent family

Related publications grouped by family.

View patent family 52483564

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016189705A1 cover?: [Object] An object is to provide an F0 contour synthesizing device based on statistic model, to clarify correspondence between linguistic information and F0 contour while maintaining accuracy. [Solution] An HMM learning device includes: a parameter estimating unit representing an F0 contour 133 fitting a continuous F0 contour 132 as a sum of phrase components and accent components and…
Who is the assignee on this patent?: Nat Inst Inf & Comm Tech, Nat Inst Inf & Comm Tech
What technology area does this patent fall under?: Primary CPC classification G10L13/10. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jun 30 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Prosodic and lexical addressee detection

Acoustic model training

Sub-lexical language models with word level pronunciation lexicons

Targeted detection of regions in speech processing data streams

Training punctuation models

Generating prosodic contours for synthesized speech

Frequently asked questions