Sound source localization using phase spectrum

US9817100B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9817100-B2
Application numberUS-201615241332-A
CountryUS
Kind codeB2
Filing dateAug 19, 2016
Priority dateJul 14, 2011
Publication dateNov 14, 2017
Grant dateNov 14, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of microphones. The result is a list of candidate angles for the frame. This list is processed to select a final candidate angle for the frame. The list of candidate angles is tracked over time to assist in the process of selecting the final candidate angle for an audio segment.

First claim

Opening claim text (preview).

What is claimed is: 1. A process for sound source localization with a plurality of pairs of microphones with known spatial relationship, comprising: receiving signals from the plurality of pairs of microphones into a memory as a plurality of frames; processing each frame of the signals from the plurality of pairs of microphones to identify when the received signals are active in the frame; computing frequency spectrum data for each frame of the received signals; for each pair of active signals in a frame, determining a candidate angle of sound arrival on the plurality of pairs of microphones using the frequency spectrum data; and selecting, for a current frame, an angle of sound arrival on the plurality of pairs of microphone from among the candidate angles determined for the current frame, by: tracking a history of candidate angles determined for multiple frames; updating the history based on the candidate angles determined for the current frame; and selecting, as the angle for the current frame, an angle from the history having a phase distortion less than or equal to a minimum phase distortion of the candidate angles and similar to a highest ranked candidate angle determined for the current frame. 2. The computer-implemented process of claim 1 , wherein processing, computing, determining and selecting are performed on a per frame basis. 3. The computer-implemented process of claim 1 , wherein selecting the angle from the history is further based on the selected candidate angle having a presence score greater than or equal to a maximum presence score of candidate angles in the history. 4. The process of claim 1 , wherein the history comprises, for each candidate angle, a phase distortion, a presence score and a presence counter. 5. The process of claim 4 , wherein updating the history comprises: for candidate angles in the history other than the selected candidate angle for the current frame, decrementing the presence counter for the candidate angles. 6. The process of claim 5 , wherein updating the history comprises: for the selected candidate angle for the current frame, incrementing a presence counter for the selected candidate angle. 7. The process of claim 6 , further comprising, in response to a determination that a presence counter for a selected candidate angle for the current frame exceeds a threshold, reporting the selected candidate angle for the current frame as a detected angle of sound arrival on the plurality of pairs of microphone. 8. The process of claim 4 wherein updating the history comprises: for a target candidate angle in the history having a lowest phase distortion, updating the presence score for the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle. 9. The process of claim 4 wherein updating the history comprises: for the target candidate angle in the history having a lowest phase distortion, updating the phase distortion for the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle. 10. The process of claim 4 wherein updating the history comprises: for the target candidate angle in the history having a lowest phase distortion, updating the angle of the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle. 11. A computing machine comprising: a memory; an input for receiving signals from a plurality of pairs of microphones into the memory as a plurality of frames; a processing unit configured to process each frame of the received signals from the plurality of pairs of microphones to identify when the received signals are active in the frame and to compute frequency spectrum data for each frame of the received signals; wherein the processing unit is further configured to, for each pair of active signals in a frame, determine a candidate angle of sound arrival on the plurality of pairs of microphones using the frequency spectrum data, and to select, for a current frame, an angle of sound arrival on the plurality of pairs of microphones from among the candidate angles determined for the current frame, by: tracking a history of candidate angles determined for multiple frames; updating the history based on the candidate angles determined for the current frame; and selecting, as the angle for the current frame, an angle from the history having a phase distortion less than or equal to a minimum phase distortion of the candidate angles and similar to a highest ranked candidate angle determined for the current frame. 12. The computing machine of claim 11 , wherein the processing unit is configured to process the signals on a per frame basis. 13. The computing machine of claim 11 , wherein to select the angle from the history, the processing unit is further configured to select the candidate angle having a presence score greater than or equal to a maximum presence score of candidate angles in the history. 14. The computing machine of claim 11 , wherein the history comprises, for each candidate angle, a phase distortion, a presence score and a presence counter. 15. The computing machine of claim 14 , wherein updating the history comprises: for candidate angles in the history other than the selected candidate angle for the current frame, decrementing the presence counter for the candidate angles. 16. The computing machine of claim 15 , wherein updating the history comprises: for the selected candidate angle for the current frame, incrementing a presence counter for the selected candidate angle. 17. The computing machine of claim 16 , further comprising, in response to a determination that a presence counter for a selected candidate angle for the current frame exceeds a threshold, reporting the selected candidate angle for the current frame as a detected angle of sound arrival on the plurality of pairs of microphone. 18. The computing machine of claim 14 wherein updating the history comprises: for a target candidate angle in the history having a lowest phase distortion, updating the presence score for the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle. 19. The computing machine of claim 14 wherein updating the history comprises: for the target candidate angle in the history having a lowest phase distortion, updating the phase distortion for the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle. 20. The computing machine of claim 14 wherein updating the history comprises: for the target candidate angle in the history having a lowest phase distortion, updating the angle of the target candidate angle based on a candidate angle for the current frame having an angle similar to the target candidate angle.

Assignees

Inventors

Classifications

  • G01S3/8083Primary

    determining direction of source · CPC title

  • G01S3/8006Primary

    Multi-channel systems specially adapted for direction-finding, i.e. having a single aerial system capable of giving simultaneous indications of the directions of different signals · CPC title

  • with means for adjusting phase or compensating for time-lag errors · CPC title

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9817100B2 cover?
An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of mic…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G01S3/8083. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 14 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).