Voice processing method, apparatus, electronic device, and storage medium

US11587574B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11587574-B2
Application numberUS-202117404175-A
CountryUS
Kind codeB2
Filing dateAug 17, 2021
Priority dateFeb 22, 2019
Publication dateFeb 21, 2023
Grant dateFeb 21, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided in the present disclosure are a voice processing method, an apparatus, an electronic device, and a storage medium, the method comprising: detecting the working state of a current call system, and when the working state is a two-end speaking state or a remote-end speaking state, performing compression processing on a subsequent remote-end voice signal, acquiring a near-end voice signal by means of a microphone, performing echo processing on the basis of the near-end voice signal and the compression-processed remote-end voice signal to obtain an echo-processed near-end voice signal and a remaining echo signal, performing non-linear suppression processing on the near-end voice signal and the remaining echo signal, and performing gain control on the suppression-processed near-end voice signal.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing a voice, comprising: detecting a working state of a communication system currently; calculating a detection probability of a dual-end talk state; determining a second reference attenuation threshold based on a first reference attenuation threshold and the detection probability, wherein the first reference attenuation threshold is a default; attenuation a remote-end voice signal by the second reference attenuation threshold based on the working state being a dual-end talk state or a remote-end talk state, wherein the remote-end voice signal is subsequently received, and the subsequent remote-end voice signal, whose value is larger than the second reference attenuation threshold, is attenuated by adopting the second reference attenuation threshold; obtaining a first near-end voice signal by a microphone; obtaining a second near-end voice signal and a residual echo signal by performing echo processing on the first near-end voice signal and the attenuated remote-end voice signal, wherein the second near-end voice signal is subjected to echo processing; performing non-linear suppression processing on the second near-end voice signal and the residual echo signal; and performing gain control on a third near-end voice signal, wherein the third near-end voice signal is subjected to suppression processing, and a gain value of the gain control is different from the first reference attenuation threshold and the second reference attenuation threshold. 2. The method for processing the voice according to claim 1 , wherein prior to attenuating the remote-end voice signal, the method further comprises: calculating an energy value of the remote-end voice signal; and determining whether the energy value is greater than the first reference attenuation threshold; and attenuating the remote-end voice signal based on the first reference attenuation threshold when the energy value is greater than the first reference attenuation threshold. 3. The method for processing the voice according to claim 1 , wherein when the working state is the remote-end talk state, said performing gain control on the third near-end voice signal comprises: in response to original gain value being larger than 1, gradually decreasing the original gain value to approach 1 at a certain rate; in response to the original gain value being smaller than 1, maintaining the original gain value not changed. 4. The method for processing the voice according to claim 1 , further comprising: performing equalization filtering processing on a low-frequency signal in the remote-end voice signal, prior to attenuating the remote-end voice signal; said attenuating the remote-end voice signal comprises: attenuating the remote-end voice signal subjected to equalization filtering processing. 5. Electronic equipment, comprising: a processor; and a memory configured to store instructions that are executed by the processor; wherein the processor is configured to: detect a working state of a communication system currently; calculate a detection probability of a dual-end talk state; determine a second reference attenuation threshold based on a first reference attenuation threshold and the detection probability, wherein the first reference attenuation threshold is a default; attenuate a remote-end voice signal by the second reference attenuation threshold based on the working state being a dual-end talk state or a remote-end talk state, wherein the remote-end voice signal is subsequently received, and the subsequent remote-end voice signal, whose value is larger than the second reference attenuation threshold, is attenuated by adopting the second reference attenuation threshold; obtain a first near-end voice signal by a microphone; obtain a second near-end voice signal and a residual echo signal by performing echo processing on the first near-end voice signal and the attenuated remote-end voice signal, wherein the second near-end voice signal is subjected to echo processing; perform non-linear suppression processing on the second near-end voice signal and the residual echo signal; and perform gain control on a third near-end voice signal, wherein the third near-end voice signal is subjected to suppression processing, and a gain value of the gain control is different from the first reference attenuation threshold and the second reference attenuation threshold. 6. The electronic equipment according to claim 5 , wherein the processor is further configured to: calculate an energy value of the remote-end voice signal; and determine whether the energy value is greater than the first reference attenuation threshold; and attenuate the remote-end voice signal based on the first reference attenuation threshold when the energy value is greater than the first reference attenuation threshold. 7. The electronic equipment according to claim 5 , wherein when the working state is the remote-end talk state, the processor is further configured to: in response to original gain value being larger than 1, gradually decrease the original gain value to approach 1 at a certain rate; in response to the original gain value being smaller than 1, maintain the original gain value not changed. 8. The electronic equipment according to claim 5 , wherein the processor is further configured to: perform equalization filtering processing on a low-frequency signal in the remote-end voice signal, prior to attenuating the remote-end voice signal; attenuate the remote-end voice signal subjected to equalization filtering processing. 9. A non-transitory computer readable storage medium, wherein when instructions in the storage medium are executed by a processor of electronic equipment, the electronic equipment is caused to perform the method for processing the voice according to claim 1 . 10. The method for processing the voice according to claim 1 , wherein when the working state is the dual-end talk state, said performing gain control on the third near-end voice signal comprises: maintaining original gain value unchanged until the dual-end talk state is exited. 11. The electronic equipment according to claim 5 , wherein the processor is further configured to: maintain original gain value unchanged until the dual-end talk state is exited.

Assignees

Inventors

Classifications

  • Arrangements for preventing acoustic feed-back (H04M1/62 takes precedence) · CPC title

  • Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic · CPC title

  • characterised by the method used for estimating noise · CPC title

  • H04M9/082Primary

    using echo cancellers (echo cancellers per se H04B3/23) · CPC title

  • the noise being echo, reverberation of the speech · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11587574B2 cover?
Provided in the present disclosure are a voice processing method, an apparatus, an electronic device, and a storage medium, the method comprising: detecting the working state of a current call system, and when the working state is a two-end speaking state or a remote-end speaking state, performing compression processing on a subsequent remote-end voice signal, acquiring a near-end voice signal …
Who is the assignee on this patent?
Beijing Dajia Internet Information Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L21/0216. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 21 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).