Harmonicity-dependent controlling of a harmonic filter tool

US11581003B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11581003-B2
Application numberUS-202016885109-A
CountryUS
Kind codeB2
Filing dateMay 27, 2020
Priority dateJul 28, 2014
Publication dateFeb 14, 2023
Grant dateFeb 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The coding efficiency of an audio codec using a controllable—switchable or even adjustable—harmonic filter tool is improved by performing the harmonicity-dependent controlling of this tool using a temporal structure measure in addition to a measure of harmonicity in order to control the harmonic filter tool. In particular, the temporal structure of the audio signal is evaluated in a manner which depends on the pitch. This enables to achieve a situation-adapted control of the harmonic filter tool so that in situations where a control made solely based on the measure of harmonicity would decide against or reduce the usage of this tool, although using the harmonic filter tool would, in that situation, increase the coding efficiency, the harmonic filter tool is applied, while in other situations where the harmonic filter tool may be inefficient or even destructive, the control reduces the appliance of the harmonic filter tool appropriately.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for performing a harmonicity-dependent controlling of a harmonic filter tool of an audio codec, comprising a harmonicity measurer configured to determine a measure of harmonicity of the audio signal, a temporal structure analyzer configured to determine at least one temporal structure measure measuring a characteristic of a temporal structure of the audio signal; a controller configured to control the harmonic filter tool depending on the temporal structure measure and the measure of harmonicity. 2. The apparatus according to claim 1 , wherein the harmonicity measurer is configured to determine the measure of harmonicity by computing a normalized correlation of the audio signal or a pre-modified version thereof at or around a pitch-lag of the audio signal. 3. The apparatus according to claim 1 , further comprising a pitch estimator configured to determine a pitch of the audio signal. 4. The apparatus according to claim 3 , wherein the pitch estimator is configured to, within a first stage, determine a preliminary estimation of the pitch at a down-sampled domain of a first sample rate and, within a second stage, refine the preliminary estimation of the pitch at a second sample rate, higher than the first sample rate. 5. The apparatus according to claim 3 , wherein the pitch estimator is configured to determine the pitch using autocorrelation. 6. The apparatus according to claim 3 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure within a temporal region temporally placed depending on the pitch. 7. The apparatus according to claim 6 , wherein the temporal structure analyzer is configured to position a temporally past-heading end of the temporal region, or of a region of higher influence onto the determination of the temporal structure measure, depending on the pitch. 8. The apparatus according to claim 3 , wherein the temporal structure analyzer is configured to position the temporal past-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, such that the temporally past-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, is displaced into past direction by a temporal amount monotonically increasing with a decrease of the pitch. 9. The apparatus according to claim 7 , wherein the temporal structure analyzer is configured to position a temporally future-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, depending on the temporal structure of the audio signal within a temporal candidate region extending from the temporally past-heading end of the temporal region, or of the region of higher influence onto the determination of the temporal structure measure, to a temporally future-heading end of a current frame. 10. The apparatus according to claim 9 , wherein the temporal structure analyzer is configured to use an amplitude or ratio between maximum and minimum energy samples within the temporal candidate region in order to position the temporally future-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure. 11. The apparatus according to claim 1 , wherein the controller comprises a logic configured to check whether a predetermined condition is met by the at least one temporal structure measure and the measure of harmonicity so as to achieve a check result; and a switch configured to switch between enabling and disabling the harmonic filter tool depending on the check result. 12. The apparatus according to claim 11 , wherein the at least one temporal structure measure measures an average or maximum energy variation of the audio signal within the temporal region and the logic is configured such that the predetermined condition is met if both the at least one temporal structure measure is smaller than a predetermined first threshold and the measure of harmonicity is, for a current frame and/or a previous frame, above a second threshold. 13. The apparatus according to claim 12 , wherein the logic is configured such that the predetermined condition is also met if the measure of harmonicity is, for a current frame, above a third threshold, and the measure of harmonicity is, for a current frame and/or a previous frame, above a fourth threshold which decreases with an increase of a pitch lag of the audio signal. 14. The apparatus according to claim 1 , wherein the controller is configured to control the harmonic filter tool by explicitly signaling a control signal via an audio codec's data stream to a decoding side; or explicitly signaling a control signal via an audio codec's data stream to a decoding side for controlling a post-filter at the decoding side and, in line with the control of the post-filter at the decoding side, controlling a pre-filter at an encoder side. 15. The apparatus according to claim 1 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure in a spectrally discriminating manner so as to acquire one value of the at least one temporal structure measure per spectral band of a plurality of spectral bands. 16. The apparatus according to claim 1 , wherein the controller is configured to control the harmonic filter tool at units of frames, and the temporal structure analyzer is configured to sample an energy of the audio signal at a sample rate higher than a frame rate of the frames so as to acquire energy samples of the audio signal and to determine the at least one temporal structure measure on the basis of the energy samples. 17. The apparatus according to claim 16 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure within a temporal region temporally placed depending on a pitch of the audio signal and the temporal structure analyzer is configured to determine the at least one temporal structure measure on the basis of the energy samples by computing a set of energy change values measuring a change between pairs of immediately consecutive energy samples of the energy samples within the temporal region and subjecting the set of energy change values to a scalar function comprising a maximum operator or a sum over addends each of which depends on exactly one of the set of energy change values. 18. The apparatus according to claim 16 , wherein the temporal spectrum analyzer is configured to perform the sampling of the energy of the audio signal within a high-pass filtered domain. 19. The apparatus according to claim 3 , wherein the pitch estimator, the harmonicity measurer and the temporal structure analyzer perform its determination based on different versions of the audio signal comprising the original audio signal and some pre-modified version thereof. 20. The apparatus according to claim 1 , wherein the controller is configured to, in controlling the harmonic filter tool, depending on the temporal structure measure and the measure of harmonicity switch between enabling and disabling a pre-filter and/or a post-filter of the harmonic filter tool, or gradually adapt a filter strength of the pre-filter and/or the post-filter of the harmonic filter tool, wherein the harmonic filter tool is of a pre-filter plus post-filter approach and the pre-filter of the harmonic filter tool is configure

Assignees

Inventors

Classifications

  • G10L19/26Primary

    Pre-filtering or post-filtering · CPC title

  • the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders · CPC title

  • Noise substitution, i.e. substituting non-tonal spectral components by noisy source (comfort noise for discontinuous speech transmission G10L19/012) · CPC title

  • Mode decision, i.e. based on audio signal content versus external parameters · CPC title

  • G10L19/025Primary

    Detection of transients or attacks for time/frequency resolution switching · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11581003B2 cover?
The coding efficiency of an audio codec using a controllable—switchable or even adjustable—harmonic filter tool is improved by performing the harmonicity-dependent controlling of this tool using a temporal structure measure in addition to a measure of harmonicity in order to control the harmonic filter tool. In particular, the temporal structure of the audio signal is evaluated in a manner whic…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).