Audio data hiding based on perceptual masking and detection based on code multiplexing
US-9269363-B2 · Feb 23, 2016 · US
US9564139B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9564139-B2 |
| Application number | US-201514985047-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 30, 2015 |
| Priority date | Nov 2, 2012 |
| Publication date | Feb 7, 2017 |
| Grant date | Feb 7, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A spread spectrum data hiding for audio signals is described. A set of pseudo-random noise sequences is added to an audio signal according to a data to be embedded. A masking curve is used to shape the added noise. A transient detection step can be used to control whether a shaped noise sequence is to be added or not. Embedded information is detected by first performing a whitening step and then performing a phase-only correlation with a same set of pseudo-random noise sequences. A detection method that is based on correlation of multiplexed noise sequences with a noise sequence embedded in the audio is also described.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method to embed data in an audio signal, comprising: selecting a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shaping a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detecting, for audio signal frames, presence or absence of transients; and adding the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 2. The method of claim 1 , wherein selecting the pseudo-random sequence comprises selecting the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 3. The method of claim 2 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 4. The method of claim 3 , wherein B=log 2 L. 5. A non-transitory computer-readable storage medium having stored thereon computer-executable instructions executable by a processor to detect embedded data in an audio signal, comprising: performing a phase-only correlation between a frequency spectrum of the audio signal with embedded data and a noise sequence; and performing a detection decision based on a result of the phase-only correlation, wherein the data embedded in the audio signal is embedded according to a method comprising: selecting a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shaping a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detecting, for audio signal frames, presence or absence of transients; and adding the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 6. The non-transitory computer-readable storage medium according to claim 5 , wherein the embedded data has been embedded based on one or more pseudo-random noise sequences of a plurality of a set of unmultiplexed pseudo-random noise sequences; and performing the phase-only correlation comprises performing the phase-only correlation a plurality of times against a set of multiplexed pseudo-random noise sequences. 7. The non-transitory computer-readable storage medium of claim 6 , wherein the set of multiplexed pseudo-random noise sequences comprises a smaller number of pseudo-noise sequences than the number of pseudo-noise sequences in the set of unmultiplexed pseudo-random noise sequences. 8. The non-transitory computer-readable storage medium according to claim 7 , wherein the multiplexed noise sequences are derived from a subset of the set of unmultiplexed pseudo-noise sequences by circularly shifting each pseudo-noise sequence in the subset by a unique amount and accumulating. 9. The non-transitory computer-readable storage medium according to claim 7 , wherein phase-only correlation between the frequency spectrum of the audio signal with embedded data and the frequency spectrum of the pseudo-random noise sequence is performed a number of times in relation to the number of multiplexed pseudo-random noise sequences. 10. The non-transitory computer-readable storage medium according to claim 9 , wherein the number of times phase-only correlation is performed is one. 11. The non-transitory computer-readable storage medium according to claim 7 , wherein performing phase-only correlation comprises: computing a correlation between the noise sequences embedded in the audio signal and the set of multiplexed noise pseudo-random sequences; and identifying a location of a peak in a correlation value that relates to the data embedded in the audio signal. 12. The non-transitory computer-readable storage medium according to claim 5 , further comprising performing whitening of the audio signal with the embedded data before performing phase-only correlation, wherein the whitening of the audio signal is performed by dividing the complex number in each frequency bin (a+ib) by its absolute value (sqrt(a 2 +b 2 )). 13. The non-transitory computer-readable storage medium according to claim 5 , wherein selecting the pseudo-random sequence comprises selecting the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 14. The non-transitory computer-readable storage medium according to claim 13 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 15. The non-transitory computer-readable storage medium according to claim 14 , wherein B=log 2 L. 16. A system to embed data in an audio signal, the system comprising: a processor configured to: select a pseudo-random sequence according to desired data bits to be embedded in an audio frame; shape a frequency spectrum of the pseudo-random sequence, thus obtaining a shaped frequency spectrum of the pseudo-random noise sequence; detect, for audio signal frames, presence or absence of transients; and add the shaped frequency spectrum of the pseudo-random noise sequence to a frequency spectrum of the audio signal, the adding occurring on an audio signal frame by audio signal frame basis, wherein, for audio signal frames for which presence of a transient is detected, the shaped frequency spectrum of the pseudo-random noise sequence is not added to the frequency spectrum of the audio signal. 17. The system according to claim 16 , further comprising: a memory for storing computer-executable instructions accessible by said processor for embedding the data in the audio signal; and an input/output device configured to, at least, receive the audio signal and provide the audio signal to the processor. 18. The system according to claim 16 , wherein the processor is further configured to select the pseudo-random sequence from a plurality of concatenated pseudo-random sequences according to the data bits to be embedded. 19. The system according to claim 18 , wherein the number of concatenated pseudo-random sequences (L) is a function of the number of bits (B) representing the data to be embedded in the audio signal. 20. The system according to claim 19 , wherein B=log 2 L.
Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.