Controller for audio device and associated operation method
US-2015063580-A1 · Mar 5, 2015 · US
US12360734B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12360734-B2 |
| Application number | US-202318484198-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 10, 2023 |
| Priority date | May 10, 2018 |
| Publication date | Jul 15, 2025 |
| Grant date | Jul 15, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and methods for media playback via a media playback system include (i) capturing a voice input comprising a request for media content, (ii) receiving information derived at least from the request for media content, (iii) requesting and receiving information from at least one remote computing device associated with a first media content service and at least one remote computing device associated with a second media content service, wherein (a) the information identifies first media content available via the first media content service for playback and identifies second media content available via the second media content service for playback, and (b) the first and second media content are related to the requested media content, and (iv) after receiving at least one of the first information and the second information, (a) selecting the first media content instead of the second media content, and (b) playing back the first media content.
Opening claim text (preview).
The invention claimed is: 1. A media playback system comprising: one or more processors; at least one network microphone device (NMD); and tangible, non-transitory, computer-readable media storing instructions executable by one or more processors to cause the media playback system to perform operations comprising: capturing a first voice input via one or more microphones of the NMD, wherein the first voice input comprises a user request; transmitting the first voice input to one or more first remote computing devices associated with a voice assistant service for deriving intent information regarding the request based at least on the first voice input; receiving a first response from the one or more first remote computing devices, the first response comprising first information associated with audio content; outputting an audio response via one or more audio transducers of the NMD based on the first response; capturing a second voice input via the one or more microphones of the NMD, wherein the second voice input comprises a request for media content; transmitting the second voice input to the one or more first remote computing devices for deriving intent information regarding the request for media content based at least on the second voice input; receiving a second response from the one or more first remote computing devices, wherein the second response comprises the derived intent information and an identified media content service; based at least in part on the derived intent information, requesting, independent of the voice assistant service, media content information directly from one or more second remote computing devices hosting the identified media content service; receiving, independent of the voice assistant service, second information from the one or more second remote computing devices, wherein the second information identifies media content available via the media content service for playback; and independent of the voice assistant service, playing back the media content via the NMD. 2. The media playback system of claim 1 , wherein the first information associated with the audio content comprises at least one of: a storage address, a link, a URL, or a file. 3. The media playback system of claim 1 , wherein the first information associated with the audio content comprises a voice response from the voice assistant service. 4. The media playback system of claim 1 , further comprising one or more third remote computing devices, wherein the receiving, independent of the voice assistant service, the second information from the one or more third remote computing devices comprises receiving the second information via the one or more third remote computing devices. 5. The media playback system of claim 4 , wherein the operations further comprise, after receiving the second information, (i) transmitting a uniform resource identifier (URI) or uniform resource locator (URL) associated with the media content from the one or more third remote computing devices of the media playback system to the NMD, and (ii) requesting, via the NMD, the media content, via the URI or URL, from the one or more third remote computing devices of the media content service for playback. 6. The media playback system of claim 4 , wherein the requesting, via the media playback system and independent of the voice assistant service, media content information from one or more second remote computing devices hosting the identified media content service comprises transmitting a request from the one or more third remote computing devices of the media playback system to the one or more second remote computing devices hosting the identified media content service. 7. The media playback system of claim 1 , wherein the derived intent information comprises a predefined data structure including one or more media content attributes, and wherein requesting media content information from the media content service comprises querying the media content service for media corresponding to the media content attributes. 8. A method performed by a media playback system comprising a network microphone device (NMD), the method comprising: capturing a first voice input via one or more microphones of the NMD, wherein the first voice input comprises a user request; transmitting the first voice input to one or more first remote computing devices associated with a voice assistant service for deriving intent information regarding the request based at least on the first voice input; receiving a first response from the one or more first remote computing devices, the first response comprising first information associated with audio content; outputting an audio response via one or more audio transducers of the NMD based on the first response; capturing a second voice input via the NMD, wherein the second voice input comprises a request for media content; transmitting the second voice input to the one or more first remote computing devices for deriving intent information regarding the request for media content based at least on the second voice input; receiving a second response from the one or more first remote computing devices, wherein the second response comprises the derived intent information and an identified media content service; based at least in part on the derived intent information, requesting, independent of the voice assistant service, media content information directly from one or more second remote computing devices hosting the identified media content service; receiving, independent of the voice assistant service, second information from the one or more second remote computing devices, wherein the second information identifies media content available via the media content service for playback; and independent of the voice assistant service, playing back the media content via the NMD. 9. The method of claim 8 , wherein the first information associated with the audio content comprises at least one of: a storage address, a link, a URL, or a file. 10. The method of claim 8 , wherein the first information associated with the audio content comprises a voice response from the voice assistant service. 11. The method of claim 8 , wherein the media playback system further comprises one or more third remote computing devices, and wherein the receiving, independent of the voice assistant service, the second information from the one or more third remote computing devices comprises receiving the second information via the one or more third remote computing devices. 12. The method of claim 11 , further comprising, after receiving the second information, (i) transmitting a uniform resource identifier (URI) or uniform resource locator (URL) associated with the media content from the one or more third remote computing devices of the media playback system to the NMD, and (ii) requesting, via the NMD, the media content, via the URI or URL, from the one or more third remote computing devices of the media content service for playback. 13. The method of claim 11 , wherein the requesting, via the media playback system and independent of the voice assistant service, media content information from one or more second remote computing devices hosting the identified media content service comprises transmitting a request from the one or more third remote computing devices of the media playback system to the one or more second remote computing devices hosting the identified media content service. 14. The method of claim 8 , wherein the derived intent information comprises a predefined data structure including one or more media content attributes, and wherein requesting media content informatio
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Announcement of recognition results · CPC title
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
Execution procedure of a spoken command · CPC title
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.