Estimation of background noise in audio signals

US11164590B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11164590-B2
Application numberUS-202016738504-A
CountryUS
Kind codeB2
Filing dateJan 9, 2020
Priority dateDec 19, 2013
Publication dateNov 2, 2021
Grant dateNov 2, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosure relates to a background noise estimator and a method therein, for supporting sound activity detection in an audio signal segment. The method comprises reducing a current background noise estimate when the audio signal segment is determined to comprise music and the current background noise estimate exceeds a minimum value. This is to be performed when an energy level of an audio signal segment is more than a threshold higher than a long term minimum energy level, lt_min, which is determined over a plurality of preceding audio signal segments, or, when the energy level of the audio signal segment is less than a threshold higher than lt_min, but no pause is detected in the audio signal segment.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method by an apparatus, the method comprising: performing by at least one processor of the apparatus: when a current energy level of an audio signal segment is higher than a current sub-band noise estimate and no pause is in the audio signal segment, reducing the current sub-band noise estimate by reducing the current sub-band noise estimate by a defined amount for at least one sub-band of the audio signal segment. 2. The method according to claim 1 , wherein reducing the current sub-band noise estimate when the current energy level of the audio signal segment is higher than the current sub-band noise estimate and no pause is in the audio signal segment comprises reducing the current sub-band noise estimate by the defined amount for the at least one sub-band of the audio signal segment responsive to the audio signal segment comprising music when the current energy level of the audio signal segment is higher than the current sub-band noise estimate, and no pause is in the audio signal segment. 3. The method according to claim 1 , wherein no pause is in the audio signal segment when one or both of the following is fulfilled: a predefined number of consecutive preceding audio signal segments have been determined to comprise an active signal; a dynamic of an audio signal comprising an audio signal segment does not exceed a signal dynamics threshold. 4. The method according to claim 1 , further comprising responsive to the current sub-band noise estimate satisfying a defined rule, reducing the current sub-band noise estimate by the defined amount for the at least one sub-band of the audio signal segment. 5. The method according to claim 4 , wherein the current sub-band noise estimate satisfies the defined rule when the current sub-band noise estimate exceeds a predefined value. 6. The method according to claim 4 , wherein the current sub-band noise estimate satisfies the defined rule when an energy level of an audio signal segment is less than a threshold level higher than a long term minimum energy level (lt_min) determined over a plurality of preceding audio signal segments. 7. The method according to claim 6 , wherein an energy level of an audio signal segment is determined to be less than the threshold level higher than lt_min based on information derived from an input audio signal and not based on use of information from a sound activity detector. 8. The method according to claim 1 , wherein reducing the current sub-band noise estimate by the defined amount comprises: determining a step size to reduce the current sub-band noise estimate; and reducing the current sub-band noise estimate by the step size. 9. An apparatus comprising: at least one processor configured to perform operations comprising: when a current energy level of an audio signal segment is higher than a current sub-band noise estimate and no pause is in the audio signal segment, reducing the current sub-band noise estimate by reducing the current sub-band noise estimate by a defined amount for at least one sub-band of the audio signal segment. 10. The apparatus according to claim 9 , wherein reducing the current sub-band noise estimate when the current energy level of the audio signal segment is higher than the current sub-band noise estimate and no pause is in the audio signal segment comprises reducing the current sub-band noise estimate by the defined amount for the at least one sub-band of the audio signal segment responsive to the audio signal segment comprising music when the current energy level of the audio signal segment is higher than the current sub-band noise estimate, and no pause is in the audio signal segment. 11. The apparatus according to claim 9 , wherein no pause is considered to in the audio signal segment when one or both of the following is fulfilled: a predefined number of consecutive preceding audio signal segments have been determined to comprise an active signal; a dynamic of an audio signal comprising an audio signal segment does not exceed a signal dynamics threshold. 12. The apparatus according to claim 9 , wherein the at least one processor configured to perform further operations comprising: responsive to the current sub-band noise estimate satisfying a defined rule, reducing the current sub-band noise estimate by the defined amount for the at least one sub-band of the audio signal segment. 13. The apparatus according to claim 12 , wherein the current sub-band noise estimate satisfies the defined rule when the current sub-band noise estimate exceeds a predefined value. 14. The apparatus according to claim 12 , wherein the at least one processor performs further operations comprising setting a flag to indicate that the energy level of the audio signal segment is close to a long term minimum energy level (lt_min). 15. The apparatus according to claim 12 , wherein the current sub-band noise estimate satisfies the defined rule when an energy level of an audio signal segment is less than a threshold level higher than a long term minimum energy level (lt_min) determined over a plurality of preceding audio signal segments, wherein an energy level of an audio signal segment is determined to be less than the threshold level higher than lt_min based on information derived from an input audio signal, and not based on use of information from a sound activity detector. 16. The apparatus according to claim 12 , wherein in reducing the current sub-band noise estimate responsive to the current sub-band noise estimate satisfying the defined rule, the at least one processor performs operations comprising: determining a step size to reduce the current sub-band noise estimate; and reducing the current sub-band noise estimate by the step size. 17. A computer program product comprising a non-transitory computer readable storage medium storing instructions which, when executed on at least one processor, cause the at least one processor to perform operations comprising: when a current energy level of an audio signal segment is higher than a current sub-band noise estimate and no pause is detected in the audio signal segment, reducing the current sub-band noise estimate by reducing the current sub-band noise estimate by a defined amount for at least one sub-band of the audio signal segment. 18. The computer program product of claim 17 , wherein reducing the current sub-band noise estimate when the current energy level of the audio signal segment is higher than the current sub-band noise estimate and no pause is detected in the audio signal segment comprises reducing the current sub-band noise estimate by the defined amount for the at least one sub-band of the audio signal segment responsive to the audio signal segment comprising music when the current energy level of the audio signal segment is higher than the current sub-band noise estimate, and no pause is detected in the audio signal segment. 19. The computer program product of claim 17 , wherein no pause is considered to in the audio signal segment when one or both of the following is fulfilled: a predefined number of consecutive preceding audio signal segments have been determined to comprise an active signal; a dynamic of an audio signal comprising an audio signal segment does not exceed a signal dynamics threshold. 20. The computer program product of claim 17 , wherein the non-transitory computer readable storage medium stores further instructions which, when executed on the at least one processor, cause the at least one processor to perform fur

Assignees

Inventors

Classifications

  • Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • Constructional arrangements · CPC title

  • the extracted parameters being correlation coefficients · CPC title

  • Processing or transfer of terminal data, e.g. status or physical capabilities · CPC title

  • G10L25/21Primary

    the extracted parameters being power information · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11164590B2 cover?
The disclosure relates to a background noise estimator and a method therein, for supporting sound activity detection in an audio signal segment. The method comprises reducing a current background noise estimate when the audio signal segment is determined to comprise music and the current background noise estimate exceeds a minimum value. This is to be performed when an energy level of an audio …
Who is the assignee on this patent?
Ericsson Telefon Ab L M
What technology area does this patent fall under?
Primary CPC classification G10L25/21. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 02 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).