Speech Intelligibility

US2016372133A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016372133-A1
Application numberUS-201615180202-A
CountryUS
Kind codeA1
Filing dateJun 13, 2016
Priority dateJun 17, 2015
Publication dateDec 22, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device including a processor and a memory is disclosed. The memory includes a noise spectral estimator to calculate noise spectral estimates from a sampled environmental noise, a speech spectral estimator to calculate speech spectral estimates from the input speech, a formant signal to noise ratio (SNR) estimator to calculate SNR estimates using the noise spectral estimates and speech spectral estimates within each formant detected in a speech spectrum. The memory also includes a formant boost estimator to calculate and apply a set of gain factors to each frequency component of the input speech such that the resulting SNR within each formant reaches a pre-selected target value.

First claim

Opening claim text (preview).

1 . A device, comprising: a processor; a memory, wherein the memory includes: a noise spectral estimator to calculate noise spectral estimates from a sampled environmental noise; a speech spectral estimator to calculate speech spectral estimates from a input speech; a formant signal to noise ratio (SNR) estimator to calculate SNR estimates using the noise spectral estimates and speech spectral estimates within each formant detected in the input speech; and a formant boost estimator to calculate and apply a set of gain factors to each frequency component of the input speech such that the resulting SNR within each formant reaches a pre-selected target value. 2 . The device of claim 1 , wherein the noise spectral estimator is configured to calculate noise spectral estimates through averaging, using a smoothing parameter and past spectral magnitude values obtained through a Discrete Fourier Transform of the sampled noise. 3 . The device of claim 1 , wherein the speech spectral estimator is configured to calculate the speech spectral estimates using a low order linear prediction filter. 4 . The device of claim 3 , wherein the low order linear prediction filter uses Levinson-Durbin algorithm. 5 . The device of claim 1 , wherein the formant SNR estimator is configured to calculate the formant SNR estimates using a ratio of speech and noise sums of squared spectral magnitudes estimates over a critical band centered on a formant center frequency, wherein the critical band is a frequency bandwidth of an auditory filter. 6 . The device of claim 1 , wherein the set of gain factors is calculated by multiplying each formant segment in the input speech by a pre-selected factor. 7 . The device of claim 1 , further including an output limiting mixer, wherein the formant boost estimator produces a filter to filter the input speech and an output of the filter combined with the input speech is passed through the output limiting mixer. 8 . The device of claim 7 , further including a formant unmasking filter to filter the input speech and inputting an output of the formant unmasking filter to the output limiting mixer. 9 . The device of claim 6 , wherein the each formant in the speech input is detected by a formant segmentation module, wherein the formant segmentation module segments the speech spectral estimates into formants. 10 . A method for performing an operation of improving speech intelligibility, comprising: receiving an input speech signal; calculating noise spectral estimates from a sampled environmental noise; calculating speech spectral estimates from the input speech; calculating formant signal to noise ratio (SNR) in the calculated noise spectral estimates and the speech spectral estimates; segmenting formants in the speech spectral estimates; and calculating formant boost factor for each of the formants based on the calculated formant boost estimates. 11 . The method of claim 10 , wherein the noise spectral estimates are calculated through a process of averaging, using a smoothing parameter and past spectral magnitude values obtained through a Discrete Fourier Transform of the sampled environmental noise. 12 . The method of claim 10 , wherein the calculating the noise spectral estimates includes calculating the speech spectral estimates using a low order linear prediction filter. 13 . The method of claim 12 , wherein the low order linear prediction filter uses Levinson-Durbin algorithm. 14 . The method of claim 10 , wherein the calculating the formant SNR estimates includes using a ratio of speech and noise sums of squared spectral magnitudes estimates over a critical band centered on a formant center frequency, wherein the critical band is a frequency bandwidth of an auditory filter. 15 . The method of claim 10 , wherein the set of gain factors is calculated by multiplying each formant segment in the input speech by a pre-selected factor. 16 . A computer program product comprising instructions which, when being executed by a processor, cause said processor to carry out or control the method of claim 10 .

Assignees

Inventors

Classifications

  • for improving intelligibility · CPC title

  • the extracted parameters being formant information · CPC title

  • Codebook for LPC parameters · CPC title

  • Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients · CPC title

  • characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016372133A1 cover?
A device including a processor and a memory is disclosed. The memory includes a noise spectral estimator to calculate noise spectral estimates from a sampled environmental noise, a speech spectral estimator to calculate speech spectral estimates from the input speech, a formant signal to noise ratio (SNR) estimator to calculate SNR estimates using the noise spectral estimates and speech spectra…
Who is the assignee on this patent?
Nxp Bv
What technology area does this patent fall under?
Primary CPC classification G10L21/0364. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 22 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).