Voice data playback speed conversion method and voice data playback speed conversion device

US9361905B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9361905-B2
Application numberUS-201414763303-A
CountryUS
Kind codeB2
Filing dateJan 21, 2014
Priority dateJan 28, 2013
Publication dateJun 7, 2016
Grant dateJun 7, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention addresses the problems of enabling a process of converting voice data playback speed even in a voice data playback device alone. The solution is a voice data playback speed conversion method and a voice data playback speed conversion device, comprising: a step of setting a reference zero cross point from any arbitrary zero cross point; a step of selecting a zero cross point temporally after the reference zero cross point within a first predetermined time range; a step of calculating a reference correlation function in a waveform from the reference zero cross point until a second predetermined time; and a step of calculating a correlation function in a waveform from a plurality of previously selected zero cross points until the second predetermined time, wherein a second reference zero cross point is the zero cross point of the waveform having a correlation function in which a concordance rate of the correlation value between the reference correlation function and the correlation function is the highest value, the difference between the reference zero cross point and the second reference zero cross point is calculated as a basic cycle, and the expansion and contraction of voice data is executed in basic cycle units so as to perform a process of converting the playback speed of the voice data.

First claim

Opening claim text (preview).

What is claimed is: 1. A voice data playback speed conversion method for converting voice data playback speed, comprising: a step of removing DC components, wherein DC components of original voice data being a playback object are removed; a step of extracting basic voice signals constituted by a basic frequency of the voice data, from which DC components have been removed, by setting a cutoff frequency at an intermediate value of the basic frequency and low-pass filtering so as to extract the basic frequency; a step of extracting rising zero cross points of the basic voice signals; a step of setting a reference zero cross point, which is an arbitrary reference zero cross point selected from the rising zero cross points; a step of selecting a plurality of the rising zero cross points temporally after the reference zero cross point within a first predetermined time range; a step of selecting a reference waveform temporally after the reference zero cross point until a second predetermined time; a step of selecting comparison object waveforms from each of the zero cross points, which has been selected in said step of selecting the rising zero cross points, until the second predetermined time; a step of calculating an autocorrelation value between the reference waveform and the reference waveform by using a correlation function; a step of calculating correlation values between the reference waveform and the comparison object waveforms by using a correlation function; a step of calculating voice blocks each of which is segmented by a start point of the voice data and an end point thereof, wherein the autocorrelation value is compared with the correlation values, the zero cross point of the comparison object waveform which is used for calculating the correlation value whose concordance rate with respect to the autocorrelation value is highest is defined as a second reference zero cross point, the start point of the voice data corresponds to the reference zero cross point, and the end point of the voice data corresponds to the second reference zero cross; and a step of expanding and contracting the voice data in basic cycle units so as to convert the playback speed of the voice data. 2. A voice data playback speed conversion device for converting voice data playback speed, comprising: means for removing DC components, wherein DC components of original voice data being a playback object are removed; means for extracting basic voice signals constituted by a basic frequency of the original voice data, from which DC components have been removed, by setting a cutoff frequency at an intermediate value of the basic frequency and low-pass filtering so as to extract the basic frequency; means for extracting rising zero cross points of the basic voice signals; means for setting a reference zero cross point, which is an arbitrary zero cross point selected from the rising zero cross points; means for selecting a plurality of the rising zero cross points temporally after the reference zero cross point within a first predetermined time range; means for selecting a reference waveform temporally after the reference zero cross point until a second predetermined time; means for selecting comparison object waveforms from each of the zero cross points, which has been selected by the means for selecting the rising zero cross points, until the second predetermined time; means for calculating an autocorrelation value between the reference waveform and the reference waveform by using a correlation function; means for calculating correlation values between the reference waveform and the comparison object waveforms by using a correlation function; means for calculating voice blocks each of which is segmented by a start point of the voice data and an end point thereof, wherein the autocorrelation value is compared with the correlation values, the zero cross point of the comparison object waveform which is used for calculating the correlation value whose concordance rate with respect to the autocorrelation value is highest is defined as a second reference zero cross point, the start point of the voice data corresponds to the reference zero cross point, and the end point of the voice data corresponds to the second reference zero cross point; and means for expanding and contracting the voice data in basic cycle units so as to convert the playback speed of the voice data.

Assignees

Inventors

Classifications

  • characterised by the analysis technique · CPC title

  • the extracted parameters being zero crossing rates · CPC title

  • G10L21/047Primary

    characterised by the type of waveform to be thinned out or inserted · CPC title

  • characterised by the interconnection of waveforms · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9361905B2 cover?
The present invention addresses the problems of enabling a process of converting voice data playback speed even in a voice data playback device alone. The solution is a voice data playback speed conversion method and a voice data playback speed conversion device, comprising: a step of setting a reference zero cross point from any arbitrary zero cross point; a step of selecting a zero cross poin…
Who is the assignee on this patent?
Shinano Kenshi Co
What technology area does this patent fall under?
Primary CPC classification G10L21/047. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 07 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).