Audio data hiding based on perceptual masking and detection based on code multiplexing

US2016111102A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016111102-A1
Application numberUS-201514985047-A
CountryUS
Kind codeA1
Filing dateDec 30, 2015
Priority dateNov 2, 2012
Publication dateApr 21, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A spread spectrum data hiding for audio signals is described. A set of pseudo-random noise sequences is added to an audio signal according to a data to be embedded. A masking curve is used to shape the added noise. A transient detection step can be used to control whether a shaped noise sequence is to be added or not. Embedded information is detected by first performing a whitening step and then performing a phase-only correlation with a same set of pseudo-random noise sequences. A detection method that is based on correlation of multiplexed noise sequences with a noise sequence embedded in the audio is also described.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method to embed data in an audio signal, comprising: selecting a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shaping a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detecting, for audio signal frames, presence or absence of transients; and adding the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 2 . The method of claim 1 , wherein selecting the pseudo-random sequence comprises selecting the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 3 . The method of claim 2 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 4 . The method of claim 3 , wherein B=log 2 L. 5 . A non-transitory computer-readable storage medium having stored thereon computer-executable instructions executable by a processor to detect embedded data in an audio signal, comprising: performing a phase-only correlation between a frequency spectrum of the audio signal with embedded data and a noise sequence; and performing a detection decision based on a result of the phase-only correlation, wherein the data embedded in the audio signal is embedded according to a method comprising: selecting a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shaping a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detecting, for audio signal frames, presence or absence of transients; and adding the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 6 . The non-transitory computer-readable storage medium according to claim 5 , wherein the embedded data has been embedded based on one or more pseudo-random noise sequences of a plurality of a set of unmultiplexed pseudo-random noise sequences; and performing the phase-only correlation comprises performing the phase-only correlation a plurality of times against a set of multiplexed pseudo-random noise sequences. 7 . The non-transitory computer-readable storage medium of claim 6 , wherein the set of multiplexed pseudo-random noise sequences comprises a smaller number of pseudo-noise sequences than the number of pseudo-noise sequences in the set of unmultiplexed pseudo-random noise sequences. 8 . The non-transitory computer-readable storage medium according to claim 7 , wherein the multiplexed noise sequences are derived from a subset of the set of unmultiplexed pseudo-noise sequences by circularly shifting each pseudo-noise sequence in the subset by a unique amount and accumulating. 9 . The non-transitory computer-readable storage medium according to claim 7 , wherein phase-only correlation between the frequency spectrum of the audio signal with embedded data and the frequency spectrum of the pseudo-random noise sequence is performed a number of times in relation to the number of multiplexed pseudo-random noise sequences. 10 . The non-transitory computer-readable storage medium according to claim 9 , wherein the number of times phase-only correlation is performed is one. 11 . The non-transitory computer-readable storage medium according to claim 7 , wherein performing phase-only correlation comprises: computing a correlation between the noise sequences embedded in the audio signal and the set of multiplexed noise pseudo-random sequences; and identifying a location of a peak in a correlation value that relates to the data embedded in the audio signal. 12 . The non-transitory computer-readable storage medium according to claim 5 , further comprising performing whitening of the audio signal with the embedded data before performing phase-only correlation, wherein the whitening of the audio signal is performed by dividing the complex number in each frequency bin (a+ib) by its absolute value (sqrt(a 2 +b 2 )). 13 . The non-transitory computer-readable storage medium according to claim 5 , wherein selecting the pseudo-random sequence comprises selecting the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 14 . The non-transitory computer-readable storage medium according to claim 13 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 15 . The non-transitory computer-readable storage medium according to claim 14 , wherein B=log 2 L. 16 . A system to embed data in an audio signal, the system comprising: a processor configured to: select a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shape a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detect, for audio signal frames, presence or absence of transients; and add the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 17 . The system according to claim 16 , further comprising: a memory for storing computer-executable instructions accessible by said processor for embedding the data in the audio signal; and an input/output device configured to, at least, receive the audio signal and provide the audio signal to the processor. 18 . The system according to claim 16 , wherein the processor is further configured to select the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 19 . The system according to claim 18 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 20 . The system according to claim 19 , wherein B=log 2 L.

Assignees

Inventors

Classifications

  • G10L19/018Primary

    Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016111102A1 cover?
A spread spectrum data hiding for audio signals is described. A set of pseudo-random noise sequences is added to an audio signal according to a data to be embedded. A masking curve is used to shape the added noise. A transient detection step can be used to control whether a shaped noise sequence is to be added or not. Embedded information is detected by first performing a whitening step and the…
Who is the assignee on this patent?
Dolby Lab Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/018. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 21 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).