Who is the assignee on this patent?

Yang Qi, Feng Jie, Vixs Systems Inc

What technology area does this patent fall under?

Primary CPC classification G10L19/018. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Watermark insertion in frequency domain for audio encoding/decoding/transcoding

US9620133B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9620133-B2
Application number	US-201314096959-A
Country	US
Kind code	B2
Filing date	Dec 4, 2013
Priority date	Dec 4, 2013
Publication date	Apr 11, 2017
Grant date	Apr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An audio processing device includes an initial processing module to generate a stream of frequency coefficients based on input audio data, a watermarking module to embed a digital watermark into the stream of frequency coefficients to generate a modified stream of frequency coefficients, and a final processing module to process the modified stream of frequency coefficients to generate output audio data. In some implementations, the input audio data comprises unencoded audio data, the initial processing module comprises a frequency domain transform module to perform a time-to-frequency domain transform to generate the unencoded audio data, and the output audio data is encoded audio data. In other instances, the input audio data comprises encoded audio data, the initial processing module comprises an initial decoding module to partially decode the encoded audio data to generate the stream of frequency coefficients, and the output audio data is decoded audio data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: generating, at a processor of an audio processing device, a stream of frequency coefficients based on input audio data; embedding, at the processor, a digital watermark comprising a set of bit values into the stream of frequency coefficients to generate a modified stream of frequency coefficients, wherein embedding the digital watermark comprises: for each bit value of the set of bit values, modifying a corresponding set of frequency coefficients of the stream of frequency coefficients based on the bit value to generate a corresponding set of the modified stream of frequency coefficients, wherein modifying the corresponding set of frequency coefficients based on the bit value comprises: for each frequency coefficient of the set: performing a linear add of the frequency coefficient and a non-zero first value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a one (1); and performing a linear add of the frequency coefficient and a non-zero second value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a zero (0); wherein the non-zero first value differs from the non-zero second value; and processing, at the processor, the modified stream of frequency coefficients to generate output audio data. 2. The method of claim 1 , wherein: the input audio data comprises unencoded audio data; generating the stream of frequency coefficients comprises performing a time-to-frequency domain transform to the unencoded audio data; and the output audio data is encoded audio data. 3. The method of claim 2 , wherein the time-to-frequency domain transform comprises at least one of: a Modified Discrete Cosine Transform (MDCT); and a Fast Fourier Transform (FFT). 4. The method of claim 1 , wherein: the input audio data comprises encoded audio data; generating the stream of frequency coefficients comprises partially decoding the encoded audio data to generate the stream of frequency coefficients; and the output audio data is decoded audio data. 5. The method of claim 1 , further comprising: detecting a presence of the digital watermark in the output audio data. 6. The method of claim 5 , wherein detecting the presence of the digital watermark comprises detecting the presence of the digital watermark using at least one of: an average detector; and a most-likelihood detector. 7. An audio processing device comprising: a non-transitory storage medium to store a set of executable instructions; and at least one processor coupled to the storage medium, the at least one processor to execute the set of executable instructions, wherein the set of executable instructions, when executed by the at least one processor, manipulate the at least one processor to generate a stream of frequency coefficients based on input audio data; embed a digital watermark comprising a set of bit values into the stream of frequency coefficients to generate a modified stream of frequency coefficients; process the modified stream of frequency coefficients to generate output audio data; wherein the set of executable instructions are to manipulate the at least one processor to embed the digital watermark by manipulating the processor to modify, for each bit value of the set of bit values, a corresponding set of frequency coefficients of the stream of frequency coefficients based on the bit value to generate a corresponding set of the modified stream of frequency coefficients; and wherein the set of executable instructions are to manipulate the at least one processor to modify the corresponding set of frequency coefficients based on the bit value by manipulating the at least one processor to: for each frequency coefficient of the set of frequency coefficients: perform a linear add of the frequency coefficient and a non-zero first value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a one (1); and perform a linear add of the frequency coefficient and a non-zero second value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a zero (0); and wherein the non-zero first value differs from the non-zero second value. 8. The audio processing device of claim 7 , wherein: the input audio data comprises unencoded audio data; the set of executable instructions are to manipulate the at least one processor to perform a time-to-frequency domain transform to generate the unencoded audio data; and the output audio data is encoded audio data. 9. The audio processing device of claim 8 , wherein the time-to-frequency domain transform comprises at least one of: a Modified Discrete Cosine Transform (MDCT); and a Fast Fourier Transform (FFT). 10. The audio processing device of claim 7 , wherein: the input audio data comprises encoded audio data; the set of executable instructions are to manipulate the at least one processor to partially decode the encoded audio data to generate the stream of frequency coefficients; and the output audio data is decoded audio data. 11. A non-transitory computer readable storage medium storing a set of instructions, the set of instructions to manipulate at least one processor to: generate a stream of frequency coefficients based on input audio data; embed a digital watermark comprising a set of bit values into the stream of frequency coefficients to generate a modified stream of frequency coefficients, wherein the instructions to manipulate the at least one processor to embed the digital watermark comprise instructions to manipulate the at least one processor to: for each bit value of the set of bit values, modify a corresponding set of frequency coefficients of the stream of frequency coefficients based on the bit value to generate a corresponding set of the modified stream of frequency coefficients, wherein the instructions to manipulate the at least one processor to modify the corresponding set of frequency coefficients based on the bit value comprise instructions to manipulate the at least one processor to: for each frequency coefficient of the set: perform a linear add of the frequency coefficient and a non-zero first value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a one (1); and perform a linear add of the frequency coefficient and a non-zero second value to generate a corresponding modified frequency coefficient responsive to the bit value comprising a zero (0); wherein the non-zero first value differs from the non-zero second value; and process the modified stream of frequency coefficients to generate output audio data. 12. The non-transitory computer readable storage medium of claim 11 , wherein: the input audio data comprises unencoded audio data; the set of instructions to manipulate at least one processor to generate the stream of frequency coefficients comprises a set of instructions to manipulate at least one processor to perform a time-to-frequency domain transform to the unencoded audio data; and the output audio data is encoded audio data. 13. The non-transitory computer readable storage medium of claim 12 , wherein the time-to-frequency domain transform comprises at least one of: a Modified Discrete Cosine Transform (MDCT); and a Fast Fourier Transform (FFT). 14. The non-transitory computer readable storage medium of claim 11 , wherein: the input audio data comprises encoded audio data; the set of instructions to manipulate at least one processor to generate the stream of frequency coefficients comprises a set of

Assignees

Inventors

Classifications

G10L19/018Primary
Audio watermarking, i.e. embedding inaudible data in the audio signal · CPC title
G10L19/02
using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

Patent family

Related publications grouped by family.

View patent family 51951576

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9620133B2 cover?: An audio processing device includes an initial processing module to generate a stream of frequency coefficients based on input audio data, a watermarking module to embed a digital watermark into the stream of frequency coefficients to generate a modified stream of frequency coefficients, and a final processing module to process the modified stream of frequency coefficients to generate output au…
Who is the assignee on this patent?: Yang Qi, Feng Jie, Vixs Systems Inc
What technology area does this patent fall under?: Primary CPC classification G10L19/018. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).