Voice controlled assistant with coaxial speaker and microphone arrangement

US9390724B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9390724-B2
Application numberUS-201514738669-A
CountryUS
Kind codeB2
Filing dateJun 12, 2015
Priority dateJun 1, 2012
Publication dateJul 12, 2016
Grant dateJul 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).

First claim

Opening claim text (preview).

What is claimed is: 1. A device comprising: a housing having a base to support the housing on a surface and a distal top end; at least one microphone mounted proximal to the top end to receive audio; a processor mounted within the housing to process a signal representation of the audio; a memory comprising one or more of a speech recognition module or an acoustic echo cancellation module, wherein the speech recognition module is executable by the processor to recognize speech in the signal representation, and the acoustic echo cancellation module is executable by the processor to reduce acoustic echoes detected in the signal representation; at least one speaker mounted in the housing and oriented to output sound in a downward direction toward the base and away from the top end; and a sound distribution cone mounted in the housing to distribute the sound emitted from the at least one speaker, wherein the sound distribution cone and the at least one speaker are coaxially aligned. 2. The device of claim 1 , wherein the at least one speaker comprises first and second speakers coaxially aligned with one another and with the sound distribution cone. 3. The device of claim 1 , wherein the sound distribution cone directs the sound in a radial outward direction substantially perpendicular to the downward direction. 4. A method comprising: receiving audio, via one or more microphones, in a device having the one or more microphones positioned at a top end; processing a signal representation of the audio, the processing comprising one or more of: recognizing speech in the signal representation using a speech recognition module, or cancelling acoustic echoes detected in the signal representation using an acoustic echo cancellation module; and outputting sound via one or more speakers arranged in a base end of the device, wherein the sound is output from the one or more speakers in a downward direction toward the base end directionally opposite to the one or more microphones at the top end. 5. The method of claim 4 , wherein recognizing the speech comprises processing the signal representation to parse the speech. 6. The method of claim 4 , further comprising processing the signal representation to reduce double talk detected in the signal representation, in conjunction with cancelling the acoustic echoes detected in the signal representation. 7. The method of claim 4 , wherein processing the signal representation comprises substantially cancelling acoustic echoes detected in the signal representation, and then parsing the speech in the signal representation. 8. A device comprising: a housing having a base to support the housing on a surface and a distal end; at least one microphone mounted at the distal end of the housing to receive audio; a processor mounted within the housing to process a signal representation of the audio; a memory comprising one or more of a speech recognition module or an acoustic echo cancellation module, wherein the speech recognition module is executable by the processor to recognize speech in the signal representation, and the acoustic echo cancellation module is executable by the processor to substantially cancel acoustic echoes detected in the signal representation; and at least one speaker arranged inside the housing and oriented to output sound in a direction away from the at least one microphone. 9. The device of claim 8 , wherein the speech comprises specific commands. 10. The device of claim 8 , wherein the housing has one or more openings near the base to pass sound waves from the at least one speaker. 11. The device of claim 8 , wherein the at least one speaker comprises a plurality of speakers that are coaxially aligned. 12. The device of claim 8 , wherein the at least one speaker and the at least one microphone are coaxially aligned. 13. The device of claim 8 , further comprising a sound distribution cone arranged inside of the housing to distribute the sound emitted from the at least one speaker. 14. The device of claim 13 , wherein the at least one speaker and the sound distribution cone are coaxially aligned. 15. The device of claim 13 , wherein the sound distribution cone directs the sound at least partially in a radial outward direction. 16. The device of claim 13 , wherein the sound distribution cone directs the sound outward from the housing proximal to the base.

Assignees

Inventors

Classifications

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • Noise filtering · CPC title

  • G10L17/22Primary

    Interactive procedures; Man-machine interfaces · CPC title

  • for loudspeakers (H04R1/34 and H04R1/40 take precedence) · CPC title

  • for distributing signals to two or more loudspeakers {(specially adapted for hearing aids H04R25/407)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9390724B2 cover?
A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligne…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L21/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).