Processing of audio signals from multiple microphones

US2023036986A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023036986-A1
Application numberUS-202217814660-A
CountryUS
Kind codeA1
Filing dateJul 25, 2022
Priority dateJul 27, 2021
Publication dateFeb 2, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.

First claim

Opening claim text (preview).

What is claimed is: 1 . A first device comprising: a memory configured to store instructions; and one or more processors configured to: receive audio signals from multiple microphones; process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals; and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information. 2 . The first device of claim 1 , wherein the one or more processors are further configured to process signal data corresponding to the audio signals to determine the class or embedding. 3 . The first device of claim 2 , wherein the one or more processors are further configured to perform a beamforming operation on the audio signals to generate the signal data. 4 . The first device of claim 2 , wherein the one or more processors are further configured to process the signal data at one or more classifiers to determine the class from among multiple classes supported by the one or more classifiers, for a sound represented in one or more of the audio signals and associated with an audio event, and wherein the class is sent to the second device. 5 . The first device of claim 2 , wherein the one or more processors are further configured to process the signal data at one or more encoders to generate the embedding, the embedding corresponding to a sound represented in one or more of the audio signals and associated with an audio event, and wherein the embedding is sent to the second device. 6 . The first device of claim 1 , wherein one or more processors are further configured to process image data at one or more encoders to generate the embedding, the embedding corresponding to an object represented in the image data and associated with an audio event, and wherein the embedding is sent to the second device. 7 . The first device of claim 6 , further comprising one or more cameras configured to generate the image data. 8 . The first device of claim 1 , wherein: the class corresponds to a category for a particular sound represented in the audio signals and associated with a particular audio event; and the embedding includes a signature or information that corresponds to the particular sound or the particular audio event and is configured to enable detection, via processing of other audio signals, of the particular sound or the particular audio event in the other audio signals. 9 . The first device of claim 1 , wherein the one or more processors are further configured to: perform spatial processing on the audio signals based on the direction-of-arrival information to generate one or more beamformed audio signals; and send the one or more beamformed audio signals to the second device. 10 . The first device of claim 1 , wherein the memory and the one or more processors are integrated into a headset device, and wherein the second device corresponds to a mobile phone. 11 . The first device of claim 1 , further comprising a modem, wherein the data is sent to the second device via the modem. 12 . The first device of claim 1 , wherein the one or more processors are further configured to send a representation of the audio signals to the second device. 13 . The first device of claim 12 , wherein the representation of the audio signals corresponds to one or more beamformed audio signals. 14 . The first device of claim 1 , wherein the one or more processors are further configured to generate a user interface output indicative of at least one of an environmental event or an acoustic event. 15 . The first device of claim 1 , wherein the one or more processors are further configured to receive, from the second device, data indicative of an acoustic event. 16 . The first device of claim 1 , wherein the one or more processors are further configured to: receive, from the second device, directional information associated with the audio signals; and perform an audio zoom operation based on the directional information. 17 . The first device of claim 1 , wherein the one or more processors are integrated in a vehicle. 18 . The first device of claim 1 , wherein the data based on the direction-of-arrival information includes a report indicating at least one detected event and a direction of the detected event. 19 . The first device of claim 1 , further comprising the multiple microphones. 20 . The first device of claim 1 , further comprising at least one speaker configured to output a sound associated with at least one of the audio signals. 21 . A method of processing audio, the method comprising: receiving, at one or more processors of a first device, audio signals from multiple microphones; processing the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals; and sending, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information. 22 . The method of claim 21 , further comprising processing signal data corresponding to the audio signals to determine the class or embedding. 23 . The method of claim 22 , further comprising performing a beamforming operation on the audio signals to generate the signal data. 24 . The method of claim 22 , wherein the signal data is processed at one or more classifiers to determine the class from among multiple classes supported by the one or more classifiers, for a sound represented in one or more of the audio signals and associated with an audio event, and wherein the class is sent to the second device. 25 . The method of claim 22 , wherein the signal data is processed at one or more encoders to generate the embedding, the embedding corresponding to a sound represented in one or more of the audio signals and associated with an audio event, and wherein the embedding is sent to the second device. 26 . The method of claim 21 , further comprising sending a representation of the audio signals to the second device. 27 . The method of claim 21 , further comprising: receiving, at one or more processors of the second device, the data based on the direction-of-arrival information and the class; obtaining, at the one or more processors of the second device, audio data representing a sound associated with the direction-of-arrival information and the class; and verifying the class, at the one or more processors of the second device, based on at least the audio data and the direction-of-arrival information. 28 . The method of claim 21 , further comprising: receiving, at one or more processors of the second device, the data based on the direction-of-arrival information and the embedding; and processing, at the one or more processors of the second device, audio data representing a sound scene based on the direction-of-arrival information and the embedding to generate modified audio data corresponding to an updated sound scene. 29 . A non-transitory computer-readable medium comprising instructions that, when executed by one or more processors of a first device, cause the one or more processors to: receive audio signals from multiple microphones; process the audio signals to generate direct

Assignees

Inventors

Classifications

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • for comparison or discrimination · CPC title

  • Hearing devices using active noise cancellation · CPC title

  • Microphone arrays; Beamforming · CPC title

  • Direction finding using a sum-delay beam-former · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023036986A1 cover?
A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are…
Who is the assignee on this patent?
Qualcomm Inc
What technology area does this patent fall under?
Primary CPC classification H04R1/1041. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Feb 02 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).