User selectable noise suppression in a voice communication

US12537012B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12537012-B2
Application numberUS-202218146842-A
CountryUS
Kind codeB2
Filing dateDec 27, 2022
Priority dateDec 27, 2022
Publication dateJan 27, 2026
Grant dateJan 27, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for audio communication includes a memory configured to receive audio data from a user of a voice communication, and one or more processors in communication with the memory. The one or more processors are configured to receive the audio data for the voice communication, which includes voice data of the user and background audio data. The processors are further configured to classify the background audio data into a plurality of types of background audio data, determine to not suppress a subset of the plurality of types of background audio data, process the audio data to not suppress the subset of the plurality of types of background audio data to generate output audio data, and transmit the output audio data.

First claim

Opening claim text (preview).

What is claimed is: 1 . An apparatus configured for audio communication, the apparatus comprising: a memory configured to receive audio data from a user of a voice communication; and one or more processors in communication with the memory, the one or more processors configured to: receive the audio data for the voice communication, the audio data including voice data of the user and background audio data; classify the background audio data into a plurality of types of background audio data; determine to suppress a subset of the plurality of types of background audio data based on a noise suppression configuration, wherein the noise suppression configuration includes the indication of a noise suppression aggressiveness; process the audio data to suppress-the subset of the plurality of types of background audio data to generate output audio data based on the noise suppression aggressiveness; and transmit the output audio data. 2 . The apparatus of claim 1 , wherein to determine to suppress the subset of the plurality of types of background audio data, the one or more processors are configured to: receive an input from the user that specifies the noise suppression configuration that indicates the subset of the plurality of types of background audio data. 3 . The apparatus of claim 2 , further comprising a display, wherein the one or more processors are configured to: generate a graphical user interface on the display, wherein the graphical user interface provides a plurality of selections corresponding to the plurality of types of background audio data; receive one or more second indications, via the graphical user interface, which identifies the subset of the plurality of background audio data to suppress; and determine to suppress a subset of the plurality of types of background audio data based on the one more second indications. 4 . The apparatus of claim 3 , wherein to generate the graphical user interface, the one or more processors are configured to: adaptively detect the plurality of types of background audio data; and adaptively update the plurality of selections based on the plurality of types of background audio data based detected. 5 . The apparatus of claim 1 , wherein the subset of the plurality of types of background audio data include background audio data related to security. 6 . The apparatus of claim 1 , wherein to classify the background audio data into the plurality of types of background audio data, the one or more processors are configured to: process the audio data using an artificial intelligence process to identify the plurality of types of background audio data. 7 . The apparatus of claim 6 , wherein the artificial intelligence process is one or more of a neural network, an artificial neural network, a deep neural network, a predictive analytics system, supervised learning, unsupervised learning, semi-supervised learning, or transfer learning. 8 . The apparatus of claim 1 , wherein to process the audio data to suppress, based on the indication of noise aggressiveness, the subset of the plurality of types of background audio data to generate output audio data, the one or more processors are configured to: perform, on a first stream of the audio data, a noise suppression process to suppress the background audio data to generate voice only audio data; extract, on a second stream of the audio data, the subset of the plurality of types of background audio data; and combine the voice only audio data with the subset of the plurality of types of background audio data to generate the output audio data. 9 . The apparatus of claim 1 , wherein to process the audio data to suppress the subset of the plurality of types of background audio data to generate output audio data, the one or more processors are configured to: perform a noise suppression process on the audio data to suppress types of background audio data not in the subset of the plurality of types of background audio data to generate the output audio data. 10 . The apparatus of claim 1 , wherein the plurality of types of background audio data may include one or more of environmental noise, human-generated noise, animal-generated noise, mechanical noise, or electronic noise. 11 . The apparatus of claim 1 , wherein the apparatus is a mobile communications device. 12 . The apparatus of claim 1 , wherein to transmit the output audio data, the one or more processors are configured to: transmit the output audio data via a wireless communication standard. 13 . The apparatus of claim 1 , wherein the indication of the noise suppression aggressiveness indicates that a user set a specific level of noise suppression for the subset of the plurality of types of background audio data. 14 . The apparatus of claim 1 , wherein a user need not reselect what types of background noise, in the subset of the plurality of types of background audio data, are passed through for every voice communication. 15 . The apparatus of claim 1 , wherein the noise suppression configuration is reset and updated for each voice communication. 16 . A method for audio communication, the method comprising: receiving audio data for the voice communication, the audio data including voice data of the user and background audio data; classifying the background audio data into a plurality of types of background audio data; determining to suppress a subset of the plurality of types of background audio data based on a noise suppression configuration, wherein the noise suppression configuration includes an indication of a noise suppression aggressiveness; processing the audio data to suppress the subset of the plurality of types of background audio data to generate output audio data based on a noise suppression configuration, wherein the noise suppression configuration includes the indication of a noise suppression aggressiveness; and transmitting the output audio data. 17 . The method of claim 16 , wherein determining to not suppress the subset of the plurality of types of background audio data comprises: receiving an input from the user that specifies the noise suppression configuration indicating the subset of the plurality of types of background audio data. 18 . The method of claim 17 , further comprising: generating a graphical user interface on a display, wherein the graphical user interface provides a plurality of selections corresponding to the plurality of types of background audio data; receiving one or more second indications, via the graphical user interface, which identifies the subset of the plurality of background audio data to suppress; and determining to suppress a subset of the plurality of types of background audio data based on the one more second indications. 19 . The method of claim 18 , wherein generating the graphical user interface comprises: adaptively detecting the plurality of types of background audio data; and adaptively updating the plurality of selections based on the plurality of types of background audio data based detected. 20 . The method of claim 16 , wherein the subset of the plurality of types of background audio data include background audio data related to security. 21 . The method of claim 16 , wherein classifying the background audio data into the plurality of types of background audio data comprises: processing the audio data using an artificial intelligence process to identify the plurality of types of background audio data.

Assignees

Inventors

Classifications

  • Feedback of the input speech · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Speech classification or search · CPC title

  • Noise filtering · CPC title

  • for comparison or discrimination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12537012B2 cover?
An apparatus for audio communication includes a memory configured to receive audio data from a user of a voice communication, and one or more processors in communication with the memory. The one or more processors are configured to receive the audio data for the voice communication, which includes voice data of the user and background audio data. The processors are further configured to classif…
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 27 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).