Methods and system for cue detection from audio input, low-power data processing and related arrangements

US2020133625A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020133625-A1
Application numberUS-201916665906-A
CountryUS
Kind codeA1
Filing dateOct 28, 2019
Priority dateDec 24, 2013
Publication dateApr 30, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and arrangements involving electronic devices, such as smartphones, tablet computers, wearable devices, etc., are disclosed. One arrangement involves a low-power processing technique for discerning cues from audio input. Another involves a technique for detecting audio activity based on the Kullback-Liebler divergence (KLD) (or a modified version thereof) of the audio input. Still other arrangements concern techniques for managing the manner in which policies are embodied on an electronic device. Others relate to distributed computing techniques. A great variety of other features are also detailed.

First claim

Opening claim text (preview).

1 . A method, comprising: obtaining audio input; at a first processor, processing the audio input to discern a characteristic of the audio input; wherein processing the audio input to discern the characteristic of the audio input comprises processing the audio input to discern auxiliary data conveyed by a digital audio watermark signal present within the audio input, the processing of the audio input to discern the auxiliary data comprising: buffering frames of the audio input, transforming the frames into spectral magnitude frames, accumulating spectral magnitude frames into an accumulation buffer, extracting spectral magnitude values corresponding to selected bits of the digital audio watermark signal, and correlating the extracted spectral magnitude values with a predetermined signal to produce a correlation metric; generating an output based upon the processing to discern the characteristic; and controlling an operation of a second processor distinct from the first processor based on the generated output. 2 . The method of claim 1 , wherein the first and second processors are components of an electronic device, the method further comprising generating an audio signal corresponding to sound propagating within an aural environment surrounding the electronic device, wherein the obtained audio input comprises a plurality of samples of the audio signal. 3 . The method of claim 2 , wherein the second processor is a CPU. 4 . The method of claim 3 , wherein the first processor is a digital signal processor. 5 . The method of claim 3 , further comprising processing the audio input while the second processor is in an idle or sleep state. 6 . The method of claim 5 , wherein controlling an operation of the second processor comprises causing the second processor to enter into a higher power state than the idle or sleep state. 7 . The method of claim 1 , wherein processing the audio input to discern the characteristic of the audio input comprises processing the audio input to determine the presence of audio activity within the audio input. 8 . The method of claim 7 , wherein processing the audio input to determine the presence of audio activity comprises determining zero-crossing or short-term energy metrics from the audio input, determining co-occurrence statistics of the zero-crossing or short term energy metrics, and classifying the audio input based on the co-occurrence statistics. 9 . (canceled) 10 . The method of claim 1 , wherein processing the audio input to discern the characteristic of the audio input comprises processing the audio input to discern auxiliary data conveyed by a digital audio watermark signal present within the audio input. 11 . (canceled) 12 . The method of claim 1 , wherein accumulating comprises: accumulating spectral magnitude frames into a first accumulation buffer, the spectral magnitude frames corresponding to shift groups; and accumulating spectral magnitude frames from the first accumulation buffer according to shift group in a second accumulation buffer. 13 . The method of claim 1 comprising: scaling the spectral magnitude frames in the second accumulation buffer according to plural noise profiles to produce candidate spectral magnitude profiles for each of the noise profiles; and extracting spectral magnitude values from the candidate spectral magnitude profiles corresponding to selected bits of the digital audio watermark signal. 14 . The method of claim 1 , comprising: correlating the extracted spectral magnitude values with predetermined signals to produce correlation metrics for the predetermined signals; determining a reference spectral magnitude sequence for a predetermined signal detected based on the correlation metrics; generating a structural strength metric for the reference spectral magnitude sequence; selecting spectral magnitude sequences from which to decode auxiliary data by identifying spectral magnitude sequences with a structural strength metric that exceeds a threshold decode candidate value. 15 . The method of claim 14 , comprising: identifying similar spectral magnitude code sequences based on similarity of time shift or noise profile of the spectral magnitude code sequences to produce sub-sets of similar spectral magnitude code sequences; and selecting spectral magnitude sequences from which to decode auxiliary data by selecting within a sub-set based on the structural strength metric. 16 . The method of claim 1 , comprising: detecting presence of the digital watermark signal from a first sub-band spanning a first frequency range; and decoding auxiliary data from the digital watermark signal from second sub-bands spanning a frequency range greater than the first frequency range. 17 . The method of claim 16 , comprising: transforming the frame with a sparse FFT in a process of detecting the presence of the digital watermark signal from the first sub-band. 18 . The method of claim 16 , comprising: transforming a frame with a first FFT for audio input sampled at a first sample rate in a process of detecting the presence of the digital watermark signal from the first sub-band; and transforming a frame with a second FFT for audio input sampled at a second sample rate higher than the first sample rate, in a process of decoding auxiliary data from the digital watermark signal from the second sub-bands. 19 - 21 . (Canceled) 22 . The method of claim 1 , comprising: obtaining a plurality of samples of audio input; processing the audio input samples to determine a relative-entropy of the audio input; estimating the presence of audio activity based on the determined relative-entropy of the audio input. 23 . A method, comprising: receiving a captured audio signal corresponding to sound captured by a microphone of an electronic device comprising a CPU; and processing the captured audio signal to perform at least one of the following: estimate a likelihood that the captured sound has encoded therein a watermark signal; detect a watermark signal encoded within the captured sound; and decode a watermark signal to extract auxiliary data from the captured sound. 24 . The method of claim 23 , further comprising processing the captured audio when the CPU is in a sleep or idle state. 25 . The method of claim 23 , further comprising processing the captured audio when the device is in a global sleep state.

Assignees

Inventors

Classifications

  • Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

  • the extracted parameters being zero crossing rates · CPC title

  • using orthogonal transformation · CPC title

  • the extracted parameters being power information · CPC title

  • G06F3/165Primary

    Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020133625A1 cover?
Methods and arrangements involving electronic devices, such as smartphones, tablet computers, wearable devices, etc., are disclosed. One arrangement involves a low-power processing technique for discerning cues from audio input. Another involves a technique for detecting audio activity based on the Kullback-Liebler divergence (KLD) (or a modified version thereof) of the audio input. Still other…
Who is the assignee on this patent?
Digimarc Corp
What technology area does this patent fall under?
Primary CPC classification G06F3/165. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 30 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).