Long-term prediction and frequency domain pitch period based encoding and decoding

US10096327B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10096327-B2
Application numberUS-201815904159-A
CountryUS
Kind codeB2
Filing dateFeb 23, 2018
Priority dateMay 23, 2012
Publication dateOct 9, 2018
Grant dateOct 9, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A frequency-domain sample interval corresponding to a time-domain pitch period L corresponding to a time-domain pitch period code of an audio signal in a given time period is obtained as a converted interval T1, a frequency-domain pitch period T is chosen from among candidates including the converted interval T1 and integer multiples U×T1 of the converted interval T1, and a frequency-domain pitch period code indicating how many times the frequency-domain pitch period T is greater than the converted interval T1 is obtained. The frequency-domain pitch period code is output so that a decoding side can identify the frequency-domain pitch period T.

First claim

Opening claim text (preview).

What is claimed is: 1. An encoding method comprising: a long-term prediction analysis step of receiving an audio signal in a given time period, performing time-domain long-term prediction analysis of the audio signal in the given time period to obtain a time-domain pitch period L and a time-domain pitch period code corresponding to the time-domain pitch period L, and outputting the time-domain pitch period code to a decoder; a long-term prediction residual generation step of using the time-domain pitch period L to obtain a long-term prediction residual signal of the audio signal; a frequency-domain sample string generation step of obtaining an N-points frequency-domain sample string which is derived from the long-term prediction residual signal or an N-points frequency-domain sample string which is derived from the audio signal; a period conversion step of obtaining, as a converted interval T 1 , a sample interval in the N-points frequency-domain sample string, the sample interval corresponding to the time-domain pitch period L; a frequency-domain pitch period analysis step of receiving the N-points frequency-domain sample string, choosing a first frequency-domain pitch period T from among a plurality of candidates including integer multiples U×T 1 of the converted interval T 1 , where U is an integer in a predetermined first range, the first frequency-domain pitch period T being a pitch period in the N-points frequency-domain sample string, obtaining a first frequency-domain pitch period code indicating how many times the first frequency-domain pitch period T is greater than the converted interval T 1 , and outputting the first frequency-domain pitch period code to the decoder; and a frequency-domain-pitch-period-based encoding step of encoding a first sample group of all or some of one or a plurality of successive samples including a sample corresponding to the first frequency-domain pitch period T in the N-points frequency-domain sample string and one or a plurality of successive samples including a sample corresponding to an integer multiple of the first frequency-domain pitch period T in the N-points frequency-domain sample string in accordance with a first criterion corresponding to magnitudes of amplitudes or estimated magnitudes of amplitudes of samples included in the first sample group and encoding a second sample group of samples in the sample string that are not included in the first sample group in accordance with a second criterion corresponding to magnitudes of amplitudes or estimated magnitudes of amplitudes of samples included in the second sample group, to obtain a code string, and outputting the code string which is obtained by encoding the first sample group and the second sample group to the decoder, wherein the first sample group is a part of the N-points frequency-domain sample string. 2. A non-transitory computer-readable recording medium storing a program for causing a computer to execute the encoding method according to claim 1 . 3. A decoding method comprising: a long-term prediction information decoding step of receiving a time-domain pitch period code which is output from an encoder, and decoding the received time-domain pitch period code to obtain a time-domain pitch period L; a period converting step of obtaining, as a converted interval T 1 , a sample interval in an N-points frequency-domain sample string, the sample interval corresponding to the time-domain pitch period L, receiving a first frequency-domain pitch period code which is output from the encoder, decoding the received first frequency-domain pitch period code to obtain a multiple value indicating how many times a first frequency-domain pitch period T is greater than the converted interval T 1 , and obtaining, as the first frequency-domain pitch period T, the converted interval T 1 multiplied by the multiple value; a frequency-domain-pitch-period-based decoding step of receiving a code string which is output from the encoder, and decoding the code string by a decoding method in which a first sample group of all or some of one or a plurality of successive samples including a sample corresponding to the first frequency-domain pitch period T in the N-points frequency-domain sample string and one or a plurality of successive samples including a sample corresponding to an integer multiple of the first frequency-domain pitch period T in the N-points frequency-domain sample string is obtained by decoding processes according to a first criterion corresponding to magnitudes of amplitudes or estimated magnitudes of amplitudes of samples included in the first sample group and a second sample group of samples in the N-points frequency-domain sample string that are not included in the first sample group is obtained by decoding processes according to a second criterion corresponding to magnitudes of amplitudes or estimated magnitudes of amplitudes of samples included in the second sample group, to obtain and output the first sample group and the second sample group of the N-points frequency-domain sample string, wherein the first sample group is a part of the N-points frequency-domain sample string; a time-domain signal string generation step of obtaining a time-domain signal string derived from the N-points frequency-domain sample string; and a long-term prediction combining step of using the time-domain signal string, the time-domain pitch period L and a previous decoded audio signal string to obtain and output a decoded audio signal string. 4. A non-transitory computer-readable recording medium storing a program for causing a computer to execute the decoding method according to claim 3 . 5. An encoder comprising: a long-term prediction analyzer receiving an audio signal in a given time period, performing time-domain long-term prediction analysis of the audio signal in the given time period to obtain a time-domain pitch period L and a time-domain pitch period code corresponding to the time-domain pitch period L, and outputting the time-domain pitch period code to a decoder; a long-term prediction residual arithmetic unit using the time-domain pitch period L to obtain a long-term prediction residual signal of the audio signal; a frequency-domain transformer obtaining an N-points frequency-domain sample string which is derived from the long-term prediction residual signal or an N-points frequency-domain sample string which is derived from the audio signal; a period converter obtaining, as a converted interval T 1 , a sample interval in the N-points frequency-domain sample string, the sample interval corresponding to the time-domain pitch period L; a frequency-domain pitch period analyzer receiving the N-points frequency-domain sample string, choosing a first frequency-domain pitch period T from among a plurality of candidates including integer multiples U×T 1 of the converted interval T 1 , where U is an integer in a predetermined first range, the first frequency-domain pitch period T being a pitch period in the N-points frequency-domain sample string, obtaining a first frequency-domain pitch period code indicating how many times the first frequency-domain pitch period T is greater than the converted interval T 1 , and outputting the first frequency-domain pitch period code to the decoder; and a frequency-domain-pitch-period-based encoder encoding a first sample group of all or some of one or a plurality of successive samples including a sample corresponding to the first frequency-domain pitch period T in the N-points frequency-domain sample string and one or a plurality of successive samples including a sample corresponding to an integer multiple of the first frequency-domain pitch period T in the N-points frequency-domain sample string in accordance with a first criterion corresponding to magnitudes of amplitudes or estimated magnitude

Assignees

Inventors

Classifications

  • G10L19/09Primary

    Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor · CPC title

  • Pitch determination of speech signals · CPC title

  • using orthogonal transformation · CPC title

  • G10L19/032Primary

    Quantisation or dequantisation of spectral components · CPC title

  • Pitch tracking · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10096327B2 cover?
A frequency-domain sample interval corresponding to a time-domain pitch period L corresponding to a time-domain pitch period code of an audio signal in a given time period is obtained as a converted interval T1, a frequency-domain pitch period T is chosen from among candidates including the converted interval T1 and integer multiples U×T1 of the converted interval T1, and a frequency-domain pit…
Who is the assignee on this patent?
Nippon Telegraph & Telephone
What technology area does this patent fall under?
Primary CPC classification G10L19/09. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 09 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).