Multi-mode audio recognition and auxiliary data encoding and decoding
US-2017133022-A1 · May 11, 2017 · US
US9922658B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9922658-B2 |
| Application number | US-201615191855-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 24, 2016 |
| Priority date | Jun 26, 2015 |
| Publication date | Mar 20, 2018 |
| Grant date | Mar 20, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A challenge of audio watermarking systems in which an acoustic path is involved is the robustness against microphone pickup in case of surrounding noise. The strength of phase-based watermarking is increased by determining a masking threshold for a current frequency bin in a frequency/phase representation changing the phase based on that masking threshold and an allowed phase change value, calculating an allowed magnitude change value for the current frequency bin and calculating from an audio quality level value a magnitude change scaling factor for the magnitude change value, and increasing its magnitude accordingly.
Opening claim text (preview).
What is claimed is: 1. A method for increasing the strength of phase-based watermarking of an audio signal, which watermarked audio signal is suitable for acoustic reception and watermark detection in the presence of surrounding noise, said method including: receiving said audio signal; determining a masking threshold for a phase change based watermarking of a current frequency bin in a frequency/phase representation of said audio signal, wherein said masking threshold determination is controlled by a received audio quality level value representing the audio quality following said audio signal watermarking; determining an allowed phase change value for the phase of said current frequency bin, according to a reference angle to be embedded in that current frequency bin, which reference angle is derived from a watermark pattern; changing the phase of said current frequency bin according to said allowed phase change value; based on said masking threshold and said allowed phase change value, calculating an allowed magnitude change value for said current frequency bin, and calculating from the audio quality level value a magnitude change scaling factor; calculating a scaled allowed magnitude change values from said allowed magnitude change value and said scaling factor; increasing the magnitude of said current frequency bin by said scaled allowed magnitude change values; embedding said watermark into said current frequency bin with said changed phase and said increased magnitude; and providing the correspondingly watermarked current frequency bin suitable for acoustic reception and watermark detection in the presence of surrounding noise. 2. The method according to claim 1 , wherein no phase changes are carried out for frequency bins representing a frequency smaller than a first frequency threshold value and for frequency bins representing a frequency greater than a second frequency threshold value that is greater than said first frequency threshold value. 3. The method according to claim 1 , wherein a magnitude change value for said current frequency bin is denoted δX[i] and δX[i]=√{square root over (LT g [i] 2 −X[i] 2 +(X[i] cos(δφ[i])) 2 )}−X[i]+X[i] cos(δφ[i]), where LT g [i] is said current masking threshold, X[i] is the original magnitude of said current frequency bin, and δφ[i] is said current phase change value. 4. The method according to claim 1 , wherein said magnitude change scaling factor is denoted ƒ and f=10 −maskingCurveOffset/20 , where maskingCurveOffset = 100 - level 100 × 30 [ dB ] and level has a value between ‘0’ and ‘100’ and is said audio quality level value, with level=100 for the best audio quality. 5. An apparatus for increasing the strength of phase-based watermarking of an audio signal, which watermarked audio signal is suitable for acoustic reception and watermark detection in the presence of surrounding noise, said apparatus including means adapted to: receiving said audio signal; determining a masking threshold for a phase change based watermarking of a current frequency bin in a frequency/phase representation of said audio signal, wherein said masking threshold determination is controlled by a received audio quality level value representing the audio quality following said audio signal watermarking; determining an allowed phase change value for the phase of said current frequency bin, according to a reference angle to be embedded in that current frequency bin, which reference angle is derived from a watermark pattern; changing the phase of said current frequency bin according to said allowed phase change value; based on said masking threshold and said allowed phase change value, calculating an allowed magnitude change value for said current frequency bin, and calculating from the audio quality level value a magnitude change scaling factor; calculating a scaled allowed magnitude change values from said allowed magnitude change value and said scaling factor; increasing the magnitude of said current frequency bin by said scaled allowed magnitude change values; embedding said watermark into said current frequency bin with said changed phase and said increased magnitude; and providing the correspondingly watermarked current frequency bin suitable for acoustic reception and watermark detection in the presence of surrounding noise. 6. The apparatus according to claim 5 , wherein no phase changes are carried out for frequency bins representing a frequency smaller than a first frequency threshold value and for frequency bins representing a frequency greater than a second frequency threshold value that is greater than said first frequency threshold value. 7. The apparatus according to claim 5 , wherein a magnitude change value for said current frequency bin is denoted δX[i] and δ X [ i ]=√{square root over (LT g [ i ] 2 −X [ i ] 2 +( X [ i ] cos (δφ[ i ])) 2 )}− X [ i ]+ X [ i ] cos(δφ[ i ]), where LT g [i] is said current masking threshold, X[i] is the original magnitude of said current frequency bin, and δφ[i] is said current phase change value. 8. The apparatus according to claim 5 , wherein said magnitude change scaling factor is denoted ƒ and f=10 −maskingCurveOffset/20 , where maskingCurveOffset = 100 - level 100 × 30 [ dB ] and level has a value between ‘0’ and ‘100’ and is said audio quality level value, with level=100 for the best audio quality. 9. A non-transitory processor readable storage medium that contains or stores, or has recorded on it, a digital audio bitstream, said digital audio bitstream including a watermark embedded therein according to: determining a masking threshold for a phase change based watermarking of a current frequency bin in a frequency/phase representation of said audio signal, wherein said masking threshold determination is controlled by a received audio quality level value representing the audio quality following said audio signal watermarking; determining an allowed phase change value for the phase of said current frequency bin, according to a reference angle to be embedded in that current frequency bin, which reference angle is derived from a watermark pattern; changing the phase of said current frequency bin according to said allowed phase change value; based on said masking threshold and said allowed phase change value, calculating an allowed magnitude change value for said current frequency bin, and calculating from the audio quality level value a magnitude change scaling factor; calculating a scaled allowed magnitude change values from said allowed magnitude change value and said scaling factor; increasing the magnitude of said current frequency bin by said scaled allowed magnitude change va
Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title
using subband decomposition · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.