Multi-mode audio recognition and auxiliary data encoding and decoding

US2016293172A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016293172-A1
Application numberUS-201615090279-A
CountryUS
Kind codeA1
Filing dateApr 4, 2016
Priority dateOct 15, 2012
Publication dateOct 6, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.

First claim

Opening claim text (preview).

We claim: 1 . A method of embedding a watermark in an electronic audio signal, the method comprising: analyzing the audio signal to identify an embedding location that does not have sufficient signal in which to embed a watermark signal element; boosting the audio signal at the embedding location; and embedding the watermark signal element at the embedding location, using the boosting to mask audibility of a change in the audio signal made to embed the watermark signal. 2 . The method of claim 1 wherein the analyzing comprises analyzing a spectral domain of a segment of the audio signal, and wherein boosting comprises boosting the audio signal at frequency locations where the audio signal has sparse spectral components. 3 . The method of claim 2 wherein in boosting comprises applying an equalizer function to the segment. 4 . The method of claim 3 including controlling the equalizer function based on a measure of correlation of equalized audio segment relative to an original audio segment. 5 . The method of claim 4 including varying the equalizer function over time segments, and keeping change due to applying the equalizer from segment to segment within a constraint. 6 . A method of embedding a watermark in an electronic audio signal, the method comprising: determining whether an audio segment of the audio signal is stationary or non-stationary; adapting resolution of a perceptual model based on whether the audio segment is stationary or non-stationary; and inserting a watermark into the audio segment using the adapted perceptual model. 7 . A method of detecting a watermark in an electronic audio signal, the method comprising: estimating rake receiver parameters using known attributes of a watermark signal in the electronic audio signal; forming a rake receiver using the estimated rake receiver parameters, wherein the rake receiver detects reflections of a watermark signal due to multipath; and combining the reflections of the watermark signal to improve watermark signal to noise ratio. 8 . A method of embedding a watermark in an electronic audio signal, the method comprising: generating a watermark signal for insertion into the electronic audio signal; evaluating perceptual audio quality of the electronic audio signal relative to changes of that electronic audio signal corresponding to the watermark signal through automated application of a perceptual audio quality measure that computes audio quality parameters based on a human auditory model, including parameters for estimating quality based on a difference between the audio signal and a watermarked version of the audio signal; updating a watermark embedding parameter based on the evaluating; and embedding the watermark signal into the electronic audio signal using the updated watermark embedding parameter. 9 . The method of claim 8 including: evaluating robustness of a watermarked audio signal using bit error rate or detection rate metrics for the generated watermark signal in the watermarked audio signal; and based on the robustness, updating the watermark embedding parameter. 10 . The method of claim 8 , the method comprising: analyzing the audio signal for a harmonic; for embedding locations corresponding to the harmonic, structuring the watermark signal to be masked by the harmonic. 11 . The method of claim 10 including: detecting a complex tone including harmonics; generating a watermark signal that exploits a harmonic relationship in the complex tone, including increasing a first harmonic and decreasing a second harmonic in the harmonic relationship. 12 . The method of 1 wherein generating a watermark signal comprises generating a frequency domain signal with plural elements mapped to corresponding plural frequency locations in an audio frame, with the plural elements being structured having at least partially offsetting values in the first and second harmonics. 13 . A method of embedding a watermark in an electronic audio signal, the method comprising: generating a watermark signal using orthogonal frequency division multiplexing in which auxiliary data is modulated onto OFDM carrier signals; computing a frequency magnitude envelope for embedding locations in a frequency domain transform of the audio signal; and inserting the watermark signal by replacing audio signal frequency components with modulated OFDM carrier signals at the embedding locations while maintaining the frequency magnitude envelope at the embedding locations. 14 . The method of claim 13 comprising: generating a high frequency watermark signal by modulating a carrier signal using a set of frequency shaping patterns at a frequency range of 10 to 22 kHz; and inserting the watermark signal into carrier signal. 15 . The method of claim 13 , wherein the high frequency watermark signal is a time-varying signal. 16 . The method of claim 13 , wherein the high frequency watermark signal is a periodic signal. 17 . The method of claim 13 , wherein the high frequency watermark signal is a non-periodic signal. 18 . The method of claim 13 comprising weighting the audio signal in a frequency range from 16 to at least 19 Khz, the weighting being selected to counter a drop in frequency response of audio equipment over the frequency range from 16 to at least 19 Khz.

Assignees

Inventors

Classifications

  • G10L19/018Primary

    Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

  • using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016293172A1 cover?
Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated i…
Who is the assignee on this patent?
Digimarc Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/018. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 06 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).