Communication using interactive avatars
US-2024031534-A1 · Jan 25, 2024 · US
US8930183B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-8930183-B2 |
| Application number | US-201113217628-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 25, 2011 |
| Priority date | Mar 29, 2011 |
| Publication date | Jan 6, 2015 |
| Grant date | Jan 6, 2015 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method of converting speech from the characteristics of a first voice to the characteristics of a second voice, the method comprising: receiving a speech input from a first voice, dividing said speech input into a plurality of frames; mapping the speech from the first voice to a second voice; and outputting the speech in the second voice, wherein mapping the speech from the first voice to the second voice comprises, deriving kernels demonstrating the similarity between speech features derived from the frames of the speech input from the first voice and stored frames of training data for said first voice, the training data corresponding to different text to that of the speech input and wherein the mapping step uses a plurality of kernels derived for each frame of input speech with a plurality of stored frames of training data of the first voice.
Opening claim text (preview).
The invention claimed is: 1. A method of converting speech from the characteristics of a first voice to the characteristics of a second voice, the method comprising: receiving a speech input from a first voice, dividing said speech input into a plurality of frames; in a processor, mapping the speech from the first voice to a second voice using a Gaussian process; and outputting the speech in the second voice, wherein mapping the speech from the first voice to the second voice comprises, deriving kernels demonstrating the similarity between speech features derived from the frames of the speech input from the first voice and stored frames of training data for said first voice, the training data corresponding to different text to that of the speech input and wherein the mapping step uses a plurality of kernels derived for each frame of input speech with a plurality of stored frames of training data of the first voice and using said plurality of kernels to define a non-parametric Gaussian process prior for said mapping. 2. A method according to claim 1 , wherein kernels are derived for both static and dynamic speech features. 3. A method according to claim 1 , wherein the speech to be output is determined according to a Gaussian Process predictive distribution: p ( y t |x t ,x*,y *, )= (μ( x t ),Σ( x t )), where y t is the speech vector for frame t to be output, x t is the speech vector for the input speech for frame t, x*, y* is {x 1 *, y 1 *}, . . . , {x N *, y N *}, where x t * is the t-th frame of training data for the first voice and y t * is the t-th frame of training data for the second voice, M denotes the model, μ(x t ) and Σ(x t ) are the mean and variance of the predictive distribution for given x t . 4. A method according to claim 3 , wherein μ ( x t ) = m ( x t ) + k t T [ K * + σ 2 I ] - 1 ( y * - μ * ) , ∑ ( x t ) = k ( x t , x t ) + σ 2 - k t T { K * + σ 2 I ] - 1 k t , where μ * = [ m ( x 1 * ) m ( x 2 * ) … m ( x N * ) ] T
characterised by the process used · CPC title
Voice editing, e.g. manipulating the voice of the synthesiser · CPC title
Voice conversion or morphing · CPC title
Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility (G10L19/00 takes precedence) · CPC title
Training · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.