Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction
US-10706865-B2 · Jul 7, 2020 · US
US11581003B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11581003-B2 |
| Application number | US-202016885109-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 27, 2020 |
| Priority date | Jul 28, 2014 |
| Publication date | Feb 14, 2023 |
| Grant date | Feb 14, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The coding efficiency of an audio codec using a controllable—switchable or even adjustable—harmonic filter tool is improved by performing the harmonicity-dependent controlling of this tool using a temporal structure measure in addition to a measure of harmonicity in order to control the harmonic filter tool. In particular, the temporal structure of the audio signal is evaluated in a manner which depends on the pitch. This enables to achieve a situation-adapted control of the harmonic filter tool so that in situations where a control made solely based on the measure of harmonicity would decide against or reduce the usage of this tool, although using the harmonic filter tool would, in that situation, increase the coding efficiency, the harmonic filter tool is applied, while in other situations where the harmonic filter tool may be inefficient or even destructive, the control reduces the appliance of the harmonic filter tool appropriately.
Opening claim text (preview).
The invention claimed is: 1. An apparatus for performing a harmonicity-dependent controlling of a harmonic filter tool of an audio codec, comprising a harmonicity measurer configured to determine a measure of harmonicity of the audio signal, a temporal structure analyzer configured to determine at least one temporal structure measure measuring a characteristic of a temporal structure of the audio signal; a controller configured to control the harmonic filter tool depending on the temporal structure measure and the measure of harmonicity. 2. The apparatus according to claim 1 , wherein the harmonicity measurer is configured to determine the measure of harmonicity by computing a normalized correlation of the audio signal or a pre-modified version thereof at or around a pitch-lag of the audio signal. 3. The apparatus according to claim 1 , further comprising a pitch estimator configured to determine a pitch of the audio signal. 4. The apparatus according to claim 3 , wherein the pitch estimator is configured to, within a first stage, determine a preliminary estimation of the pitch at a down-sampled domain of a first sample rate and, within a second stage, refine the preliminary estimation of the pitch at a second sample rate, higher than the first sample rate. 5. The apparatus according to claim 3 , wherein the pitch estimator is configured to determine the pitch using autocorrelation. 6. The apparatus according to claim 3 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure within a temporal region temporally placed depending on the pitch. 7. The apparatus according to claim 6 , wherein the temporal structure analyzer is configured to position a temporally past-heading end of the temporal region, or of a region of higher influence onto the determination of the temporal structure measure, depending on the pitch. 8. The apparatus according to claim 3 , wherein the temporal structure analyzer is configured to position the temporal past-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, such that the temporally past-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, is displaced into past direction by a temporal amount monotonically increasing with a decrease of the pitch. 9. The apparatus according to claim 7 , wherein the temporal structure analyzer is configured to position a temporally future-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure, depending on the temporal structure of the audio signal within a temporal candidate region extending from the temporally past-heading end of the temporal region, or of the region of higher influence onto the determination of the temporal structure measure, to a temporally future-heading end of a current frame. 10. The apparatus according to claim 9 , wherein the temporal structure analyzer is configured to use an amplitude or ratio between maximum and minimum energy samples within the temporal candidate region in order to position the temporally future-heading end of the temporal region or, of the region of higher influence onto the determination of the temporal structure measure. 11. The apparatus according to claim 1 , wherein the controller comprises a logic configured to check whether a predetermined condition is met by the at least one temporal structure measure and the measure of harmonicity so as to achieve a check result; and a switch configured to switch between enabling and disabling the harmonic filter tool depending on the check result. 12. The apparatus according to claim 11 , wherein the at least one temporal structure measure measures an average or maximum energy variation of the audio signal within the temporal region and the logic is configured such that the predetermined condition is met if both the at least one temporal structure measure is smaller than a predetermined first threshold and the measure of harmonicity is, for a current frame and/or a previous frame, above a second threshold. 13. The apparatus according to claim 12 , wherein the logic is configured such that the predetermined condition is also met if the measure of harmonicity is, for a current frame, above a third threshold, and the measure of harmonicity is, for a current frame and/or a previous frame, above a fourth threshold which decreases with an increase of a pitch lag of the audio signal. 14. The apparatus according to claim 1 , wherein the controller is configured to control the harmonic filter tool by explicitly signaling a control signal via an audio codec's data stream to a decoding side; or explicitly signaling a control signal via an audio codec's data stream to a decoding side for controlling a post-filter at the decoding side and, in line with the control of the post-filter at the decoding side, controlling a pre-filter at an encoder side. 15. The apparatus according to claim 1 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure in a spectrally discriminating manner so as to acquire one value of the at least one temporal structure measure per spectral band of a plurality of spectral bands. 16. The apparatus according to claim 1 , wherein the controller is configured to control the harmonic filter tool at units of frames, and the temporal structure analyzer is configured to sample an energy of the audio signal at a sample rate higher than a frame rate of the frames so as to acquire energy samples of the audio signal and to determine the at least one temporal structure measure on the basis of the energy samples. 17. The apparatus according to claim 16 , wherein the temporal structure analyzer is configured to determine the at least one temporal structure measure within a temporal region temporally placed depending on a pitch of the audio signal and the temporal structure analyzer is configured to determine the at least one temporal structure measure on the basis of the energy samples by computing a set of energy change values measuring a change between pairs of immediately consecutive energy samples of the energy samples within the temporal region and subjecting the set of energy change values to a scalar function comprising a maximum operator or a sum over addends each of which depends on exactly one of the set of energy change values. 18. The apparatus according to claim 16 , wherein the temporal spectrum analyzer is configured to perform the sampling of the energy of the audio signal within a high-pass filtered domain. 19. The apparatus according to claim 3 , wherein the pitch estimator, the harmonicity measurer and the temporal structure analyzer perform its determination based on different versions of the audio signal comprising the original audio signal and some pre-modified version thereof. 20. The apparatus according to claim 1 , wherein the controller is configured to, in controlling the harmonic filter tool, depending on the temporal structure measure and the measure of harmonicity switch between enabling and disabling a pre-filter and/or a post-filter of the harmonic filter tool, or gradually adapt a filter strength of the pre-filter and/or the post-filter of the harmonic filter tool, wherein the harmonic filter tool is of a pre-filter plus post-filter approach and the pre-filter of the harmonic filter tool is configure
Pre-filtering or post-filtering · CPC title
the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders · CPC title
Noise substitution, i.e. substituting non-tonal spectral components by noisy source (comfort noise for discontinuous speech transmission G10L19/012) · CPC title
Mode decision, i.e. based on audio signal content versus external parameters · CPC title
Detection of transients or attacks for time/frequency resolution switching · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.