Multi-mode audio recognition and auxiliary data encoding and decoding

US11990143B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11990143-B2
Application numberUS-202117532884-A
CountryUS
Kind codeB2
Filing dateNov 22, 2021
Priority dateOct 15, 2012
Publication dateMay 21, 2024
Grant dateMay 21, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.

First claim

Opening claim text (preview).

We claim: 1. An apparatus for embedding a digital watermark in an audio signal, the apparatus comprising: memory for storing an audio signal; means for identifying an embedding location within the audio signal that does not have sufficient signal in which to embed a digital watermark signal element; means for boosting the audio signal at the embedding location; and means for embedding the digital watermark signal element at the embedding location, using boosting by said means for boosting to mask audibility of a change in the audio signal made to embed the digital watermark signal. 2. The apparatus of claim 1 wherein said means for identifying investigates a spectral domain of a segment of the audio signal, and wherein said means for boosting boosts the audio signal at frequency locations where the audio signal has sparse spectral components. 3. The apparatus of claim 2 wherein said means for boosting applies an equalizer function to the segment. 4. The apparatus of claim 3 including means for controlling the equalizer function based on a measure of correlation of equalized audio segment relative to an original audio segment. 5. The apparatus of claim 4 in which said means for controlling varies the equalizer function over time segments, and keeping change due to applying the equalizer from segment to segment within a constraint. 6. An apparatus for embedding a digital watermark in an audio signal, the apparatus comprising: memory for storing an audio signal; means for evaluating whether an audio segment of the audio signal is stationary or non-stationary; means for adapting resolution of a perceptual model based on whether the audio segment is stationary or non-stationary, said means for adapting generating an adapted perceptual model; and a digital watermark embedder for a digital watermark into the audio segment using the adapted perceptual model. 7. An apparatus for detecting a digital watermark in an audio signal, the method comprising: an input to receive an audio signal, the audio signal comprising a digital watermark encoded therein; means for estimating rake receiver parameters using known attributes of a digital watermark signal in the audio signal; a rake receiver using the estimated rake receiver parameters, wherein the rake receiver detects reflections of a digital watermark signal due to multipath; and means for combining the reflections of the digital watermark signal to improve digital watermark signal to noise ratio. 8. An apparatus for embedding a digital watermark in an audio signal, the apparatus comprising: memory for storing an audio signal; a digital watermark embedder configured to produce a digital watermark signal for insertion into the electronic audio signal; means for evaluating perceptual audio quality of the electronic audio signal relative to changes of that audio signal corresponding to the digital watermark signal through application of a perceptual audio quality measure that computes audio quality parameters based on a human auditory model, including parameters for estimating quality based on a difference between the audio signal and a digital watermarked version of the audio signal, said means for evaluating generating an evaluation; means for changing a digital watermark embedding parameter based on the evaluation, said means for changing generating a changed watermark embedding parameter; and wherein the digital watermark embedder is configured to embed the digital watermark signal into the audio signal using the changed watermark embedding parameter. 9. The apparatus of claim 8 including: means for evaluating robustness of a watermarked audio signal using bit error rate or detection rate metrics for the generated digital watermark signal in the watermarked audio signal; and based on the robustness, prompting said means for changing to change the digital watermark embedding parameter. 10. The apparatus of claim 8 , the apparatus comprising: means for analyzing the audio signal for a harmonic; in which said digital watermark embedder is configured to structure, the digital watermark signal to be masked by the harmonic for embedding locations corresponding to the harmonic. 11. The apparatus of claim 10 including: means for detecting a complex tone including harmonics; in which said digital watermark embedder is configured to generate a digital watermark signal that exploits a harmonic relationship in the complex tone, including increasing a first harmonic and decreasing a second harmonic in the harmonic relationship. 12. The apparatus of 11 wherein said digital watermark embedder is configured to generate a digital watermark signal comprising a frequency domain signal with plural elements mapped to corresponding plural frequency locations in an audio frame, with the plural elements being structured having at least partially offsetting values in the first and second harmonics. 13. An apparatus for embedding a digital watermark in an audio signal, the method comprising: an input to receive an audio signal; means for generating a digital watermark signal using orthogonal frequency division multiplexing in which auxiliary data is modulated onto OFDM carrier signals; means for computing a frequency magnitude envelope for embedding locations in a frequency domain transform of the audio signal; and means for inserting the digital watermark signal by replacing audio signal frequency components with modulated OFDM carrier signals at the embedding locations while maintaining the frequency magnitude envelope at the embedding locations. 14. The apparatus of claim 13 comprising: means for generating a high frequency digital watermark signal by modulating a carrier signal using a set of frequency shaping patterns at a frequency range of 10 to 22 kHz; and means for inserting the digital watermark signal into carrier signal. 15. The apparatus of claim 13 , wherein the high frequency digital watermark signal is a time-varying signal. 16. The apparatus of claim 13 , wherein the high frequency digital watermark signal is a periodic signal. 17. The apparatus of claim 13 , wherein the high frequency digital watermark signal is a non-periodic signal. 18. The apparatus of claim 13 comprising means for weighting the audio signal with a weighting in a frequency range from 16 to at least 19 Khz, the weighting being selected to counter a drop in frequency response of audio equipment over the frequency range from 16 to at least 19 Khz.

Assignees

Inventors

Classifications

  • G10L19/018Primary

    Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

  • using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11990143B2 cover?
Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated i…
Who is the assignee on this patent?
Digimarc Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/018. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).