VAD detection apparatus and method of operation the same

US9830913B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9830913-B2
Application numberUS-201514861113-A
CountryUS
Kind codeB2
Filing dateSep 22, 2015
Priority dateOct 29, 2013
Publication dateNov 28, 2017
Grant dateNov 28, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A microphone assembly includes an acoustic sensor and a voice activity detector on an integrated circuit coupled to an external-device interface. The acoustic sensor produces an electrical signal representative of acoustic energy detected by the sensor. A filter bank separates data representative of the acoustic energy into a plurality of frequency bands. A power tracker obtains a power estimate for at least one band, including a first estimate based on relatively fast changes in a power metric of the data and a second estimate based on relatively slow changes in a power metric of the data. The presence of voice activity in the electrical signal is based upon the power estimate.

First claim

Opening claim text (preview).

What is claimed is: 1. A method in a microphone assembly including an acoustic sensor and a voice activity detector on an integrated circuit coupled to an external-device interface of the microphone assembly, the method comprising: receiving acoustic energy at the acoustic sensor; filtering data representative of the acoustic energy into a plurality of bands; obtaining a power estimate for at least one of the plurality of bands, the power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy; determining whether voice activity is present in the acoustic energy based on the power estimate for the at least one band. 2. The method of claim 1 , further comprising, determining a ratio of the first estimate and the second estimate of a corresponding band; and determining whether voice activity is present in the acoustic energy based on a comparison of the ratio to a predetermined threshold. 3. The method of claim 1 , obtaining a power estimate for each of the plurality of bands, each power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy; determining multiple ratios based on the first estimate and the second estimate of the plurality of bands; determining whether voice activity is present in the acoustic energy based on a comparison of the multiple ratios to predetermined thresholds. 4. The method of claim 3 , further comprising summing results of the comparisons and determining whether voice activity is present in the acoustic energy based on the summation of results. 5. The method of claim 3 , determining the multiple ratios includes determining at least one ratio using the first estimate and the second estimate obtained for the same band. 6. The method of claim 3 , determining the multiple ratios includes determining at least one ratio using the first estimate obtained for one band and the second estimate obtained for another band. 7. The method of claim 1 , providing an interrupt signal at the external-device interface upon determining that voice activity is present in the acoustic energy. 8. A microphone assembly having an external-device interface, the microphone assembly comprising: an acoustic sensor having an acoustic input and an electrical output; a filter bank having an input coupled to the electrical output of the transducer, the filter bank configured to filter data representative of energy detected by the acoustic sensor into a plurality of frequency bands; a power tracker having an input coupled to an output of the filter bank, the power tracker configured to obtain a power estimate for at least one of the plurality of frequency bands, the power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy; a comparison entity coupled to the output of the power tracker, the comparison entity configured to determine whether voice activity is present in the data representative of acoustic energy based upon the power estimate; and a signal generator configured to generate a wake up signal upon determining that voice activity is present in the data representative of acoustic energy. 9. The microphone assembly of claim 8 , the power tracker configured to determine a ratio of the first estimate and the second estimate of a corresponding frequency band, and the comparison entity configured to determine whether voice activity is present in the acoustic energy based on a comparison of the ratio to a predetermined threshold. 10. The microphone assembly of claim 8 , the power tracker configured to obtain a power estimate for each of the plurality of frequency bands, each power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy, the power tracker configured to determine multiple ratios based on the first estimate and the second estimate of the plurality of frequency bands, the comparison entity configured to determine whether voice activity is present in the acoustic energy based on a comparison of the multiple ratios to predetermined thresholds. 11. The microphone assembly of claim 10 , the comparison entity configured to sum results of the comparisons and to determine whether voice activity is present in the acoustic energy based on the summation of results. 12. The microphone assembly of claim 10 , at least one of the multiple ratios includes a ratio of the first estimate and the second estimate for the same frequency band. 13. The microphone assembly of claim 10 , at least one of the multiple ratios includes a ratio of the first estimate obtained for one frequency band and the second estimate obtained for another frequency band. 14. The microphone assembly of claim 8 , a signal generator configured to provide the wake up signal at the external-device interface upon determining that voice activity is present in the acoustic energy. 15. The microphone assembly of claim 8 , wherein the filter bank, the power tracker, the comparison entity, and the signal generator are implemented on an integrated circuit of the microphone assembly. 16. A microphone assembly having an external-device interface, the microphone assembly comprising: an acoustic sensor having an acoustic input and an electrical output; an analog to digital (A/D) converter coupled to the acoustic sensor, the A/D converter configured to generate a data representative of an electrical signal generated by the acoustic sensor; a processor coupled to the A/D converter, the processor configured to: filter the data representative of the electrical signal into a plurality of bands; obtain a power estimate for at least one of the plurality of bands, the power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy; determine whether voice activity is present in the data representative of the electrical signal based upon the power estimate; and generate a wake up signal upon determining that voice activity is present in the data representative of the electrical signal. 17. The microphone assembly of claim 16 , the processor further configured to determine a ratio of the first estimate and the second estimate and to determine whether voice activity is present in the data representative of the electrical signal based on a comparison of the ratio to a predetermined threshold. 18. The microphone assembly of claim 16 , the processor configured to obtain a power estimate for each of the plurality of bands, each power estimate including a first estimate based on relatively fast changes in a power metric of the data representative of the acoustic energy and a second estimate based on relatively slow changes in a power metric of the data representative of the acoustic energy,

Assignees

Inventors

Classifications

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • using subband decomposition · CPC title

  • the extracted parameters being power information · CPC title

  • Word spotting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9830913B2 cover?
A microphone assembly includes an acoustic sensor and a voice activity detector on an integrated circuit coupled to an external-device interface. The acoustic sensor produces an electrical signal representative of acoustic energy detected by the sensor. A filter bank separates data representative of the acoustic energy into a plurality of frequency bands. A power tracker obtains a power estimat…
Who is the assignee on this patent?
Knowles Electronics Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).