Automatic transfer of audio-related task to a smart speaker
US-2020029167-A1 · Jan 23, 2020 · US
US11988784B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11988784-B2 |
| Application number | US-202117460708-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 30, 2021 |
| Priority date | Aug 31, 2020 |
| Publication date | May 21, 2024 |
| Grant date | May 21, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An example involves detecting, during a time period by a receiving device, an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices. The example further comprises determining a time-frequency representation of the audio signal and obtaining data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation. The example further comprises, based on the time-frequency representation of the audio signal and the data, determining that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device.
Opening claim text (preview).
We claim: 1. A receiving device comprising: a microphone configured to receive an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices; one or more processors; and a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, instruct the receiving device to: determine a time-frequency representation of the audio signal received by the microphone; obtain data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation; based on the time-frequency representation of the audio signal and the data, determine that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device; based on the audio signal, determine a first magnitude associated with the first audio signal and a second magnitude associated with the second audio signal; determine that the first magnitude is greater than the second magnitude; determine that the first playback device is arranged in closer proximity than the second playback device to the receiving device; and transition a playback session between the receiving device and the first playback device. 2. The receiving device of claim 1 , wherein the first playback device is playing back audio content in the playback session, and wherein instructing the receiving device to transition the playback session between the receiving device and the first playback device comprises instructing the receiving device to: transition the playback session from the first playback device to the receiving device, such that the receiving device plays back the audio content. 3. The receiving device of claim 1 , wherein the receiving device is playing back audio content in the playback session, and wherein instructing the receiving device to transition the playback session between the receiving device and the first playback device comprises instructing the receiving device to: transition the playback session from the receiving device to the first playback device, such that the first playback device plays back the audio content. 4. The receiving device of claim 1 , wherein each reference audio signal comprises a plurality of tones, wherein each of the plurality of tones has a predefined frequency and initiation time. 5. The receiving device of claim 4 , wherein at least some of the plurality of tones are spaced apart in time by an interval of time. 6. The receiving device of claim 4 , wherein the predefined frequencies are selected from a predefined set of frequencies and wherein a frequency difference between each frequency within the predefined set of frequencies is greater than about 50 Hz. 7. The receiving device of claim 1 , further comprising a network interface configured to transmit, to the plurality of playback devices, a command to cause the plurality of playback devices to output respective reference audio signals concurrently. 8. A method comprising: detecting, during a time period by a receiving device, an audio signal comprising a superposition of a plurality of audio signals transmitted by a plurality of respective playback devices; determining a time-frequency representation of the audio signal; obtaining data associating playback devices with respective reference audio signals, each reference audio signal having a predefined time-frequency representation; and based on the time-frequency representation of the audio signal and the data, determining that the audio signal comprises a superposition of a first audio signal transmitted by a first playback device and a second audio signal transmitted by a second playback device. 9. The method of claim 8 , further comprising: based on the audio signal, determining a first magnitude associated with the first audio signal and a second magnitude associated with the second audio signal; determining that the first magnitude is greater than the second magnitude; and determining that the first playback device is arranged closer than the second playback device to the receiving device. 10. The method of claim 9 , wherein the first playback device is playing back audio content in a playback session, and the method further comprises transitioning the playback session from the first playback device to the receiving device, such that the receiving device plays back the audio content. 11. The method of claim 8 , wherein determining the time-frequency representation of the audio signal comprises: decomposing a first portion of the audio signal into constituent frequencies, wherein the first portion of the audio signal is received during a first portion of the time period; and decomposing a second portion of the audio signal into constituent frequencies, wherein the second portion of the audio signal is received during a second portion of the time period. 12. The method of claim 8 , wherein each reference audio signal comprises a plurality of tones, wherein each of the plurality of tones has a predefined frequency and initiation time. 13. The method of claim 12 , wherein at least some of the plurality of tones are spaced apart in time by an interval of time. 14. The method of claim 13 , wherein the interval of time is between about 10 ms and about 50 ms. 15. The method of claim 14 , wherein a first interval of time within the reference audio signal differs to at least another interval of time within the reference audio signal. 16. The method of claim 12 , wherein the predefined frequencies are selected from a predefined set of frequencies and wherein a frequency difference between each frequency within the predefined set of frequencies is greater than about 50 Hz. 17. The method of claim 12 , wherein the predefined frequencies are selected from a predefined set of frequencies, and wherein each frequency within the predefined set of frequencies is greater than about 18 kHz. 18. The method of claim 12 , wherein each of the plurality of tones has a duration of between about 10 ms and about 20 ms. 19. The method of claim 8 , wherein each reference audio signal has a duration of between about 400 ms and about 800 ms. 20. The method of claim 8 , further comprising causing the plurality of playback devices to output respective reference audio signals.
using analysis of echo signal for target characterisation; Target signature; Target cross-section · CPC title
Extracting wanted echo signals {(Doppler systems G01S15/50)} · CPC title
Management of the audio stream, e.g. setting of volume, audio stream path · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Aspects of volume control, not necessarily automatic, in sound systems · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.