Sound source localization using phase spectrum

US9435873B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9435873-B2
Application numberUS-201113182449-A
CountryUS
Kind codeB2
Filing dateJul 14, 2011
Priority dateJul 14, 2011
Publication dateSep 6, 2016
Grant dateSep 6, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of microphones. The result is a list of candidate angles for the frame. This list is processed to select a final candidate angle for the frame. The list of candidate angles is tracked over time to assist in the process of selecting the final candidate angle for an audio segment.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented process comprising: receiving signals from a plurality of pairs of microphones into a memory; processing the signals from the microphones to identify when the signals are active; computing frequency spectrum data for the signals; for each pair of active signals: determining a phase spectrum of a cross power spectrum using the frequency spectrum data of the pair of active signals, determining a phase distortion for the pair from the phase spectrum, and deriving a candidate angle from the determined phase distortion; and selecting an angle from among the candidate angles derived for the plurality of pairs of microphones. 2. The computer-implemented process of claim 1 , wherein receiving signals includes receiving each signal as a plurality of frames, and wherein processing, computing, determining and selecting are performed on a per frame basis. 3. The computer-implemented process of claim 1 , wherein selecting the angle from among the candidate angles comprises selecting the angle using information about prior candidate angles. 4. The computer-implemented process of claim 3 , wherein selecting further comprises: tracking a history of candidate angles over multiple frames; and updating the history with the candidate angles from the current frame. 5. The computer-implemented process of claim 4 , further comprising selecting the angle from the history which has a phase distortion less than or equal to the minimum phase distortion of all entries. 6. The computer-implemented process of claim 4 , further comprising selecting the angle from the history having an angle similar to a highest ranked candidate angle for the frame. 7. The computer-implemented process of claim 4 , further comprising selecting the angle from the history having a presence score greater than or equal to a maximum presence score of angles in the history. 8. An article of manufacture comprising: a computer storage medium; computer program instructions stored on the computer storage medium which, when processed by a processing device, instruct the processing device to perform a process comprising: receiving signals from a plurality of pairs of microphones into a memory; processing the signals from the microphones to identify when the signals are active; computing frequency spectrum data for the signals; for each pair of active signals: determining a phase spectrum of a cross power spectrum using the frequency spectrum data of the pair of active signals, determining a phase distortion for the pair from the phase spectrum, and deriving a candidate angle from the determined phase distortion; and selecting an angle from among the candidate angles derived for the plurality of pairs of microphones. 9. The article of manufacture of claim 8 , wherein receiving signals includes receiving each signal as a plurality of frames, and wherein processing, computing, determining and selecting are performed on a per frame basis. 10. The article of manufacture of claim 8 , wherein selecting the angle from among the candidate angles comprises selecting the angle using information about prior candidate angles. 11. The article of manufacture of claim 10 , wherein selecting further comprises: tracking a history of candidate angles over multiple frames; and updating the history with the candidate angles from the current frame. 12. The article of manufacture of claim 11 , further comprising selecting the angle from the history which has a phase distortion less than or equal to the minimum phase distortion of all entries. 13. The article of manufacture of claim 11 , further comprising selecting the angle from the history having an angle similar to a highest ranked candidate angle for the frame. 14. The article of manufacture of claim 11 , further comprising selecting the angle from the history having a presence score greater than or equal to a maximum presence score of angles in the history. 15. A computing machine comprising: a memory; an input for receiving signals from a plurality of pairs of microphones into the memory; a processing unit configured to process the signals from the microphones to identify when the signals are active and compute frequency spectrum data for the signals; wherein the processing unit is further configured to, for each pair of active signals, determine a phase spectrum of a cross power spectrum using the frequency spectrum data of the pair of active signals, determine a phase distortion for the pair using the phase spectrum, and derive a candidate angle from the phase distortion, and select an angle from among the candidate angles derived for the plurality of pairs of microphones. 16. The computing machine of claim 15 , wherein the input receives and stores each signal as a plurality of frames, and wherein the processing unit is configured to process the signals on a per frame basis. 17. The computing machine of claim 15 , wherein the memory further stores a history of candidate angles over multiple frames, and the processing unit is configured to update the history with the candidate angles from the current frame. 18. The computing machine of claim 17 , wherein the angle selected from the history has a phase distortion less than or equal to the minimum phase distortion of all entries. 19. The computing machine of claim 17 , wherein the angle selected from the history has an angle similar to a highest ranked candidate angle for the frame. 20. The computing machine of claim 17 , wherein the angle selected from the history has a presence score greater than or equal to a maximum presence score of angles in the history.

Assignees

Inventors

Classifications

  • with means for adjusting phase or compensating for time-lag errors · CPC title

  • G01S3/8006Primary

    Multi-channel systems specially adapted for direction-finding, i.e. having a single aerial system capable of giving simultaneous indications of the directions of different signals · CPC title

  • G01S3/8083Primary

    determining direction of source · CPC title

  • for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9435873B2 cover?
An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of mic…
Who is the assignee on this patent?
Regunathan Shankar, Koishida Kazuhito, Kikkeri Harshavardhana Narayana, and 1 more
What technology area does this patent fall under?
Primary CPC classification G01S3/8006. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 06 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).