Detecting an audio signal with a microphone to determine presence of a playback device

US11988784B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11988784-B2
Application numberUS-202117460708-A
CountryUS
Kind codeB2
Filing dateAug 30, 2021
Priority dateAug 31, 2020
Publication dateMay 21, 2024
Grant dateMay 21, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An example involves detecting, during a time period by a receiving device, an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices. The example further comprises determining a time-frequency representation of the audio signal and obtaining data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation. The example further comprises, based on the time-frequency representation of the audio signal and the data, determining that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device.

First claim

Opening claim text (preview).

We claim: 1. A receiving device comprising: a microphone configured to receive an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices; one or more processors; and a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, instruct the receiving device to: determine a time-frequency representation of the audio signal received by the microphone; obtain data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation; based on the time-frequency representation of the audio signal and the data, determine that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device; based on the audio signal, determine a first magnitude associated with the first audio signal and a second magnitude associated with the second audio signal; determine that the first magnitude is greater than the second magnitude; determine that the first playback device is arranged in closer proximity than the second playback device to the receiving device; and transition a playback session between the receiving device and the first playback device. 2. The receiving device of claim 1 , wherein the first playback device is playing back audio content in the playback session, and wherein instructing the receiving device to transition the playback session between the receiving device and the first playback device comprises instructing the receiving device to: transition the playback session from the first playback device to the receiving device, such that the receiving device plays back the audio content. 3. The receiving device of claim 1 , wherein the receiving device is playing back audio content in the playback session, and wherein instructing the receiving device to transition the playback session between the receiving device and the first playback device comprises instructing the receiving device to: transition the playback session from the receiving device to the first playback device, such that the first playback device plays back the audio content. 4. The receiving device of claim 1 , wherein each reference audio signal comprises a plurality of tones, wherein each of the plurality of tones has a predefined frequency and initiation time. 5. The receiving device of claim 4 , wherein at least some of the plurality of tones are spaced apart in time by an interval of time. 6. The receiving device of claim 4 , wherein the predefined frequencies are selected from a predefined set of frequencies and wherein a frequency difference between each frequency within the predefined set of frequencies is greater than about 50 Hz. 7. The receiving device of claim 1 , further comprising a network interface configured to transmit, to the plurality of playback devices, a command to cause the plurality of playback devices to output respective reference audio signals concurrently. 8. A method comprising: detecting, during a time period by a receiving device, an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices; determining a time-frequency representation of the audio signal; obtaining data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation; and based on the time-frequency representation of the audio signal and the data, determining that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device. 9. The method of claim 8 , further comprising: based on the audio signal, determining a first magnitude associated with the first audio signal and a second magnitude associated with the second audio signal; determining that the first magnitude is greater than the second magnitude; and determining that the first playback device is arranged closer than the second playback device to the receiving device. 10. The method of claim 9 , wherein the first playback device is playing back audio content in a playback session, and the method further comprises transitioning the playback session from the first playback device to the receiving device, such that the receiving device plays back the audio content. 11. The method of claim 8 , wherein determining the time-frequency representation of the audio signal comprises: decomposing a first portion of the audio signal into constituent frequencies, wherein the first portion of the audio signal is received during a first portion of the time period; and decomposing a second portion of the audio signal into constituent frequencies, wherein the second portion of the audio signal is received during a second portion of the time period. 12. The method of claim 8 , wherein each reference audio signal comprises a plurality of tones, wherein each of the plurality of tones has a predefined frequency and initiation time. 13. The method of claim 12 , wherein at least some of the plurality of tones are spaced apart in time by an interval of time. 14. The method of claim 13 , wherein the interval of time is between about 10 ms and about 50 ms. 15. The method of claim 14 , wherein a first interval of time within the reference audio signal differs to at least another interval of time within the reference audio signal. 16. The method of claim 12 , wherein the predefined frequencies are selected from a predefined set of frequencies and wherein a frequency difference between each frequency within the predefined set of frequencies is greater than about 50 Hz. 17. The method of claim 12 , wherein the predefined frequencies are selected from a predefined set of frequencies, and wherein each frequency within the predefined set of frequencies is greater than about 18 kHz. 18. The method of claim 12 , wherein each of the plurality of tones has a duration of between about 10 ms and about 20 ms. 19. The method of claim 8 , wherein each reference audio signal has a duration of between about 400 ms and about 800 ms. 20. The method of claim 8 , further comprising causing the plurality of playback devices to output respective reference audio signals.

Assignees

Inventors

Classifications

  • G01S7/539Primary

    using analysis of echo signal for target characterisation; Target signature; Target cross-section · CPC title

  • Extracting wanted echo signals {(Doppler systems G01S15/50)} · CPC title

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Aspects of volume control, not necessarily automatic, in sound systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11988784B2 cover?
An example involves detecting, during a time period by a receiving device, an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices. The example further comprises determining a time-frequency representation of the audio signal and obtaining data associating playback devices with respective reference audio signals, each …
Who is the assignee on this patent?
Sonos Inc
What technology area does this patent fall under?
Primary CPC classification G01S7/539. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).