Detector and method for voice activity detection

US9773511B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9773511-B2
Application numberUS-201013121305-A
CountryUS
Kind codeB2
Filing dateOct 18, 2010
Priority dateOct 19, 2009
Publication dateSep 26, 2017
Grant dateSep 26, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The embodiments of the present invention relates to a voice activity detector and a method thereof. The voice activity detector is configured to detect voice activity in a received input signal comprising an input section configured to receive a signal from a primary voice detector of said VAD indicative of a primary VAD decision and at least one signal from at least one external VAD indicative of a voice activity decision from the at least one external VAD, a processor configured to combine the voice activity decisions indicated in the received signals to generate a modified primary VAD decision, and an output section configured to send the modified primary VAD decision to a hangover addition unit of said VAD.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method in a first voice activity detector, VAD, for detecting voice activity in a received input signal, the method comprising: receiving a signal from a primary voice detector of said first VAD indicative of a primary voice activity decision made by the primary voice detector regarding voice activity in said input signal, wherein the primary voice activity decision is an intermediate voice activity decision of said first VAD in the sense that the primary voice activity decision is made by the first VAD without having been processed by a hangover addition unit of said first VAD, receiving one or more signals from one or more second VADs external to the first VAD each indicative of a voice activity decision made by a respective second VAD regarding voice activity in said input signal, each second VAD comprising its own primary voice detector and hangover addition unit distinct from that of said first VAD, combining the voice activity decisions indicated in the signal received from the primary voice detector of said first VAD and the one or more signals received from the one or more second VADs to generate a modified primary voice activity decision, and sending the modified primary voice activity decision to a hangover addition unit of said first VAD that is configured to make a final voice activity decision of said first VAD. 2. The method according to claim 1 , wherein the voice activity decisions in the signals received from the primary voice detector and the one or more second VADs are combined by a logical AND, the modified primary voice activity decision thereby indicating voice only if the signal from the primary voice detector and each signal from the one or more second VADs indicate voice. 3. The method according to claim 1 , wherein the voice activity decisions in the signals received from the primary voice detector and the one or more second VADs are combined by a logical OR, the modified primary voice activity decision thereby indicating voice if at least one signal of the signal from the primary voice detector and the one or more signals from the one or more second VADs indicate voice. 4. The method according to claim 1 , wherein at least one signal from a second VAD is a final voice activity decision made by that second VAD in the sense that the final voice activity decision is made by the second VAD after having been processed by the hangover addition unit of said second VAD. 5. The method according to claim 1 , wherein at least one signal from a second VAD is a primary voice activity decision made by a primary voice detector of that second VAD, the primary voice activity decision being an intermediate voice activity decision of the second VAD in the sense that the primary voice activity decision is made by the second VAD without having been processed by the hangover addition unit of said second VAD. 6. The method according to claim 1 , comprising receiving only one signal from one of said second VADs. 7. The method according to claim 1 , comprising receiving a plurality of signals from a plurality of said second VADs. 8. The method according to claim 1 , wherein the voice activity decisions indicated in the signals received from the primary voice detector and the one or more second VADs are combined in dependence on input signal properties. 9. The method according to claim 8 , wherein the input signal properties comprise at least one of estimated signal-to-noise-ratio and background characteristics. 10. A first voice activity detector, VAD, configured to detect voice activity in a received input signal, the first VAD comprising: an input circuit configured to: receive a signal from a primary voice detector of said first VAD indicative of a primary voice activity decision regarding voice activity in said input signal, wherein the primary voice activity decision is an intermediate voice activity decision of said first VAD in the sense that the primary voice activity decision is made by the first VAD without having been processed by a hangover addition unit of said first VAD, and receive one or more signals from one or more second VADs external to the first VAD each indicative of a voice activity decision made by a respective second VAD regarding voice activity in said input signal, each second VAD comprising its own primary voice detector and hangover addition unit distinct from that of said first VAD, a processor circuit configured to combine the voice activity decisions indicated in the signal received from the primary voice detector of said first VAD and the one or more signals received from the one or more second VADs to generate a modified primary voice activity decision, and an output circuit configured to send the modified primary voice activity decision to a hangover addition unit of said first VAD that is configured to make a final voice activity decision of said first VAD. 11. The first VAD according to claim 10 , wherein the processor circuit is configured to combine the voice activity decisions in the signals received from the primary voice detector and the one or more second VADs by a logical AND, the modified primary voice activity decision thereby indicating voice only if the signal from the primary voice detector and each signal from the one or more second VADs indicate voice. 12. The first VAD according to claim 10 , wherein the processor circuit is configured to combine the voice activity decisions in the signals received from the primary voice detector and the one or more second VADs by a logical OR, the modified primary voice activity decision thereby indicating voice if at least one signal of the signal from the primary voice detector and the one or more signals from the one or more second VADs indicate voice. 13. The first VAD according to claim 10 , wherein at least one signal from a second VAD is a final voice activity decision made by that second VAD in the sense that the final voice activity decision is made by the second VAD after having been processed by the hangover addition unit of said second VAD. 14. The first VAD according to claim 10 , wherein at least one signal from a second VAD is a primary voice activity decision made by a primary voice detector of that second VAD, the primary voice activity decision being an intermediate voice activity decision of the second VAD in the sense that the primary voice activity decision is made by the second VAD without having been processed by the hangover addition unit of said second VAD. 15. The first VAD according to claim 10 , wherein the input circuit is configured to receive only one signal from one of said second VADs. 16. The first VAD according to claim 10 , wherein the input circuit is configured to receive a plurality of signals from a plurality of said second VADs. 17. The first VAD according to claim 10 , wherein the voice activity decisions indicated in the signals received from the primary voice detector and the one or more second VADs are combined in dependence on input signal properties. 18. The first VAD according to claim 17 , wherein the input signal properties comprise at least one of estimated signal-to-noise-ratio and background characteristics. 19. The method according to claim 1 , wherein at least one of the one or more second VADs is configured to generate lower activity or introduce less speech clipping than the first VAD under certain input conditions comprising one or more of a certain noise level, a certain signal-to-noise ratio, and a certain noise characteristic. 20. The method accor

Assignees

Inventors

Classifications

  • G10L25/78Primary

    Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • the extracted parameters being power information · CPC title

  • for discriminating voice from noise · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9773511B2 cover?
The embodiments of the present invention relates to a voice activity detector and a method thereof. The voice activity detector is configured to detect voice activity in a received input signal comprising an input section configured to receive a signal from a primary voice detector of said VAD indicative of a primary VAD decision and at least one signal from at least one external VAD indicative…
Who is the assignee on this patent?
Sehlstedt Martin, ERICSSON TELEFON AB L M (publ)
What technology area does this patent fall under?
Primary CPC classification G10L25/78. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 26 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).