Focus Session at a Voice Interface Device
US-2018122378-A1 · May 3, 2018 · US
US11094319B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11094319-B2 |
| Application number | US-201916557734-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 30, 2019 |
| Priority date | Aug 30, 2019 |
| Publication date | Aug 17, 2021 |
| Grant date | Aug 17, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
While a media content item is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound by using the timing information and the offset information to align the audio stream with the ambient sound and performing a subtraction operation to substantially subtract the audio stream from the ambient sound.
Opening claim text (preview).
What is claimed is: 1. A method performed by a first electronic device, the method comprising: while a media content item is emitted by a second electronic device that is remote from the first electronic device: receiving data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item; detecting ambient sound that includes sound corresponding to the media content item emitted by the second electronic device; and generating a cleaned version of the ambient sound, including: using the timing information and the offset information to align the audio stream with the ambient sound; and performing a subtraction operation to substantially subtract the audio stream from the ambient sound. 2. The method of claim 1 , wherein the timing information includes a clock signal. 3. The method of claim 1 , wherein the timing information includes an indication of latency between the second electronic device and the first electronic device. 4. The method of claim 1 , wherein the offset information includes an indication of a time duration between the initial position of the media content item and the current playback position of the media content item. 5. The method of claim 1 , wherein the offset information includes an indication of a data amount that corresponds to the difference between the initial position of the media content item and the current playback position of the media content item. 6. The method of claim 1 , wherein the timing information is received from the second electronic device. 7. The method of claim 1 , wherein the timing information is received from a first server. 8. The method of claim 1 , wherein the offset information is received from the second electronic device. 9. The method of claim 1 , wherein the offset information is received from a first server. 10. The method of claim 1 , wherein the audio stream has a lower data rate than the media content item that is provided to the second electronic device. 11. The method of claim 1 , wherein the audio stream is received from the second electronic device. 12. The method of claim 1 , wherein the audio stream is received from a first server. 13. The method of claim 1 , wherein the audio stream is received from a second server distinct from a first server. 14. The method of claim 1 , wherein the timing information is embedded in the audio stream. 15. The method of claim 1 , including analyzing the cleaned version of the ambient sound to determine whether a command is present in them ambient sound. 16. The method of claim 1 , wherein the first electronic device is not playing the media content item. 17. The method of claim 1 , wherein the first electronic device is playing the media content item. 18. A first electronic device comprising: one or more processors; and memory storing instructions for execution by the one or more processors, the instructions including instructions for: while a media content item is emitted by a second electronic device that is remote from the first electronic device: receiving data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item; detecting ambient sound that includes sound corresponding to the media content item emitted by the second electronic device; and generating a cleaned version of the ambient sound, including: using the timing information and the offset information to align the audio stream with the ambient sound; and performing a subtraction operation to substantially subtract the audio stream from the ambient sound. 19. A non-transitory computer-readable storage medium storing instructions for execution by a first electronic device having one or more processors, the instructions including instructions for: while a media content item is emitted by a second electronic device that is remote from a first electronic device: receiving data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item; detecting ambient sound that includes sound corresponding to the media content item emitted by the second electronic device; and generating a cleaned version of the ambient sound, including: using the timing information and the offset information to align the audio stream with the ambient sound; and performing a subtraction operation to substantially subtract the audio stream from the ambient sound.
Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title
Execution procedure of a spoken command · CPC title
for discriminating voice from noise · CPC title
Management of the audio stream, e.g. setting of volume, audio stream path · CPC title
Number of inputs available containing the signal or the noise to be suppressed · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.