Progressive encoding of audio

US8965545B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-8965545-B2
Application numberUS-89525810-A
CountryUS
Kind codeB2
Filing dateSep 30, 2010
Priority dateSep 30, 2010
Publication dateFeb 24, 2015
Grant dateFeb 24, 2015

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure includes processing a signal to generate a first sub-set of data, transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal having a fidelity relative to the signal, processing the signal to generate a second sub-set of data and a third sub-set of data, the second sub-set of data defining a second portion of the signal and comprising data that is different than data of the first sub-set of data, and the third sub-set of data defining a third portion of the signal and comprising data that is different than data of the first and second sub-sets of data, comparing a priority of the second sub-set of data to a priority of the third sub-set of data, and transmitting one of the second sub-set of data and the third sub-set of data over the network for improving the fidelity.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: one or more computers; and a computer-readable medium coupled to the one or more computers and having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising: processing a digital audio signal to generate a first sub-set of data, the first sub-set of data defining an entire portion of the digital audio signal; transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal being i) a reconstruction of the entire digital audio signal and ii) having a first fidelity relative to a fidelity of the digital audio signal; receiving a first signal indicating that the first fidelity of the reconstructed audio signal is less than a threshold fidelity; in response to receiving the first signal, processing the digital audio signal to generate a second sub-set of data, the second sub-set of data comprising data that is different than data of the first sub-set of data; transmitting the second sub-set of data, wherein the second sub-set of data is useable to obtain an updated reconstructed audio signal, the updated reconstructed audio signal being i) a reconstruction of the entire digital audio signal and ii) having a second fidelity relative to the fidelity of the digital audio signal; receiving a second signal indicating that the second fidelity of the updated reconstructed audio signal is greater than the threshold fidelity; and in response to receiving the second signal, ceasing generation of subsequent sub-sets of data based on the digital audio signal. 2. The system of claim 1 , wherein the second sub-set of data includes more data than the first sub-set of data. 3. The system of claim 1 , the operations further comprising: in response to receiving the first signal, processing the digital audio signal to generate a third sub-set of data, the third sub-set of data comprising data that is different than data of the first and second sub-sets of data; and comparing a priority of the second sub-set of data to a priority of the third sub-set of data; wherein transmitting further comprises transmitting, based on the comparing, only one of the second sub-set of data and the third sub-set of data, wherein the operations further comprise subsequently transmitting the other of the second sub-set of data and the third sub-set of data to obtain a further updated reconstructed audio signal having a third fidelity relative to the fidelity of the digital audio signal, the third fidelity of the third version of the reconstructed audio signal greater than the second fidelity. 4. The system of claim 3 , wherein the third sub-set of data includes more data than each of the second sub-set of data and the first sub-set of data. 5. The system of claim 3 , wherein processing the digital audio signal to generate a first sub-set of data comprises: determining an original sampling rate of the digital audio signal; and down-sampling data of the digital audio signal at a first sampling rate that is less than the original sampling rate to provide the first sub-set of data. 6. The system of claim 5 , wherein processing the digital audio signal to generate a second sub-set of data comprises: up-sampling data of the first sub-set of data at the original sampling rate to provide first up-sampled data; subtracting the first up-sampled data from data of the digital audio signal to provide first residual data; and down-sampling the first residual data at a second sampling rate that is greater than the first sampling rate and that is less than the original sampling rate to provide the second sub-set of data. 7. The system of claim 6 , wherein processing the digital audio signal to generate a third sub-set of data comprises: up-sampling data of the second sub-set of data at the original sampling rate to provide second up-sampled data; and subtracting the second up-sampled data from the first residual data to provide second residual data, the second residual data defining the third sub-set of data. 8. The system of claim 1 , wherein processing the digital audio signal to generate a first sub-set of data comprises: determining a bit-depth of data of the digital audio signal; and extracting a first bit of each sample of the data of the digital audio signal to provide first extracted data, the first extracted data defining the first sub-set of data and the first bit being determined based on the bit-depth. 9. The system of claim 8 , wherein processing the digital audio signal to generate a second sub-set of data comprises extracting a second bit of each sample of the data of the data set to provide second extracted data, the second extracted data defining the second sub-set of data and the second bit being determined based on the bit-depth. 10. The system of claim 3 , wherein the operations further comprise compressing the first sub-set of data and the one of the second sub-set of data and the third sub-set of data. 11. A method, comprising: processing a digital audio signal to generate a first sub-set of data, the first sub-set of data defining an entire portion of the digital audio signal; transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal being i) a reconstruction of the entire digital audio signal and ii) having a first fidelity relative to a fidelity of the digital audio signal; receiving a first signal indicating that the first fidelity of the reconstructed audio signal is less than a threshold fidelity; in response to receiving the first signal, processing the digital audio signal to generate a second sub-set of data, the second sub-set of data comprising data that is different than data of the first sub-set of data; transmitting the second sub-set of data, wherein the second sub-set of data is useable to obtain an updated reconstructed audio signal, the updated reconstructed audio signal being i) a reconstruction of the entire digital audio signal and ii) having a second fidelity relative to fidelity of the digital audio signal; receiving a second signal indicating that the second fidelity of the updated reconstructed audio signal is greater than the threshold fidelity; and in response to receiving the second signal, ceasing generation of subsequent sub-sets of data based on the digital audio signal. 12. A non-transitory computer-readable medium coupled to one or more computers and having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising: processing a digital audio signal to generate a first sub-set of data, the first sub-set of data defining an entire portion of the digital audio signal; transmitting the first sub-set of data for generation of a reconstructed audio signal, reconstructed audio signal being i) a reconstruction of the entire digital audio signal and ii) having a first fidelity relative to a fidelity of the digital audio signal; receiving a first signal indicating that the first fidelity of the reconstructed audio signal is less than a threshold fidelity; in response to receiving the first signal, processing the digital audio signal to generate a second sub-set of data, the second sub-set of data comprising data that is different than data of the first sub-set of data; transmitting the second sub-set of data, wherein the second sub-set of data and the third sub-set of data is useable to obtain an updated reconstructed audio signal, the updated reconstructed audio signal being i) a reconstruction of the entire digital audio sig

Assignees

Inventors

Classifications

  • using natural language modelling · CPC title

  • G10L15/20Primary

    Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • Execution procedure of a spoken command · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US8965545B2 cover?
The present disclosure includes processing a signal to generate a first sub-set of data, transmitting the first sub-set of data for generation of a reconstructed audio signal, the reconstructed audio signal having a fidelity relative to the signal, processing the signal to generate a second sub-set of data and a third sub-set of data, the second sub-set of data defining a second portion of the …
Who is the assignee on this patent?
Lloyd Matthew I, Jansche Martin, Google Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 24 2015 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).