Device specific multi-channel data compression

US9875747B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9875747-B1
Application numberUS-201615211417-A
CountryUS
Kind codeB1
Filing dateJul 15, 2016
Priority dateJul 15, 2016
Publication dateJan 23, 2018
Grant dateJan 23, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A sensor device may include a computing device in communication with multiple microphones. A neural network executing on the computing device may receive audio signals from each microphone. One microphone signal may serve as a reference signal. The neural network may extract differences in signal characteristics of the other microphone signals as compared to the reference signal. The neural network may combine these signal differences into a lossy compressed signal. The sensor device may transmit the lossy compressed signal and the lossless reference signal to a remote neural network executing in a cloud computing environment for decompression and sound recognition analysis.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method comprising: determining, by at least a first neural network layer of a neural network of a first device, a first signal difference between a signal characteristic of a first audio signal and a signal characteristic of a second audio signal, wherein the first signal difference includes a difference in a frequency response; compressing, by at least a second neural network layer of the neural network and based on the first signal difference, the first audio signal and the second audio signal into a third audio signal; and providing, by the first device to a second device, the first audio signal and the third audio signal. 2. The method of claim 1 , further comprising: determining, by at least the first neural network layer, a plurality of signal differences between one or more signal characteristics of the first audio signal and one or more signal characteristics of the second audio signal; and selecting, by the neural network of the first device, the first signal difference from among the plurality of signal differences. 3. The method of claim 1 , further comprising: receiving, by the first device from a first audio signal source, the first audio signal; and receiving, by the first device from a second audio signal source, the second audio signal, wherein the first audio signal source comprises a first microphone and the second audio signal source comprises a second microphone distinct from the first microphone. 4. The method of claim 1 , further comprising: receiving, by the first device from a first audio signal source, the first audio signal; and receiving, by the first device from a second audio signal source, the second audio signal, wherein the first audio signal source comprises a first microphone, the second audio signal source comprises a second microphone distinct from the first microphone, and the first microphone and the second microphone are disposed at distinct locations on the first device. 5. The method of claim 1 , further comprising: receiving, by the first device from a first audio signal source, the first audio signal; and receiving, by the first device, a plurality of audio signals from a plurality of audio signal sources other than the first audio signal source. 6. The method of claim 1 , further comprising: receiving, by the first device from a first audio signal source, the first audio signal; receiving, by the first device, a plurality of audio signals from a plurality of audio signal sources other than the first audio signal source; and determining, by at least the first neural network layer, a plurality of signal differences between one or more signal characteristics of the first audio signal and one or more signal characteristics of the plurality of audio signals. 7. The method of claim 1 , further comprising: receiving, by the first device from a first audio signal source, the first audio signal; receiving, by the first device, a plurality of audio signals from a plurality of audio signal sources other than the first audio signal source; determining, by at least the first neural network layer, a plurality of signal differences between one or more signal characteristics of the first audio signal and one or more signal characteristics of the plurality of audio signals; and generating, by at least the second neural network layer based on the plurality of signal differences, the third audio signal. 8. The method of claim 1 , wherein the first audio signal comprises a lossless signal and the third audio signal comprises an audio signal generated by lossy compression. 9. The method of claim 1 , further comprising: losslessly compressing, by the first device, the first audio signal. 10. The method of claim 1 , wherein a bit rate of the first audio signal is greater than a bit rate of the third audio signal. 11. The method of claim 1 , wherein the first neural network layer and the second neural network layer are distinct neural network layers of the neural network of the first device. 12. The method of claim 1 , wherein the neural network of the first device comprises at least one selected from the group consisting of a deep neural network, convolutional neural network, long short-term memory neural network, and a convolutional, long short-term memory, fully connected deep neural network. 13. The method of claim 1 , wherein the first signal difference comprises at least one selected from the group consisting of: a difference in phase, a difference in magnitude, and a difference in gain. 14. The method of claim 1 , wherein the first signal difference comprises at least one selected from the group consisting of: a transfer function of the first audio signal source and a transfer function of the second audio signal source. 15. The method of claim 1 , wherein the first neural network layer comprises a plurality of nodes. 16. The method of claim 1 , wherein a total number of nodes of the first neural network layer is greater than a total number of nodes of the second neural network layer. 17. The method of claim 1 , wherein the second neural network layer comprises exactly one node. 18. The method of claim 1 , wherein the neural network defines one or more cell states. 19. The method of claim 1 , wherein the neural network comprises three or more layers and there is no layer between the second neural network layer and the output of the neural network. 20. The method of claim 1 , wherein the second device is distinct and remote from the first device. 21. A non-transitory, computer-readable medium storing instructions that, when executed by a processor, cause the processor to perform operations comprising: determining, by at least a first neural network layer of a neural network of a first device, a first signal difference between a signal characteristic of a first audio signal and a signal characteristic of a second audio signal, wherein the first signal difference includes a difference in a frequency response; compressing, by at least a second neural network layer of the neural network and based on the first signal difference, the first audio signal and the second audio signal into a third audio signal; and providing, by the first device to a second device, the first audio signal and the third audio signal. 22. A first device comprising: a processor; and a non-transitory, computer-readable medium in communication with the processor and storing instructions that, when executed by the processor, cause the processor to perform operations comprising: determining, by at least a first neural network layer of a neural network of a first device, a first signal difference between a signal characteristic of a first audio signal and a signal characteristic of a second audio signal, wherein the first signal difference includes a difference in a frequency response; compressing, by at least a second neural network layer of the neural network and based on the first signal difference, the first audio signal and the second audio signal into a third audio signal; and providing, to a second device, the first audio signal and the third audio signal. 23. A method comprising: generating, by a first device and based on a first audio signal and a second audio signal, a third audio signal; determining, by at least a first neural network layer of a neural network of the first device, a first signal difference between a signal characteristic of the first audio signal and a signal charac

Assignees

Inventors

Classifications

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error (G10L19/24 takes precedence) · CPC title

  • using neural networks · CPC title

  • for transmitting results of analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9875747B1 cover?
A sensor device may include a computing device in communication with multiple microphones. A neural network executing on the computing device may receive audio signals from each microphone. One microphone signal may serve as a reference signal. The neural network may extract differences in signal characteristics of the other microphone signals as compared to the reference signal. The neural net…
Who is the assignee on this patent?
Google Inc, Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).