What technology area does this patent fall under?

Primary CPC classification G10L19/002. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue May 29 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Post-encoding bitrate reduction of multiple object audio

US9984692B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9984692-B2
Application number	US-201514970320-A
Country	US
Kind code	B2
Filing date	Dec 15, 2015
Priority date	Mar 6, 2014
Publication date	May 29, 2018
Grant date	May 29, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A post-encoding bitrate reduction system and method for generating one more scaled compressed bitstreams from a single encoded plenary file. The plenary file contains multiple audio object files that were encoded separately using a scalable encoding process having fine-grained scalability. Activity in the data frames of the encoded audio object files at a time period are compared with each other to obtain a data frame activity comparison. Bits from an available bitpool are assigned to all of the data frames based on the data frame activity comparison and corresponding hierarchical metadata. The plenary file is scaled down by truncating bits in the data frames to conform to the bit allocation. In some embodiments frame activity is compared to a silence threshold and the data frame contains silence if the frame activity is less than or equal to the threshold and minimal bits are used to represent the silent frame.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for obtaining multiple scaled compressed bitstreams from a single plenary file, comprising: separately encoding a plurality of audio object files to obtain a plurality of encoded audio object files at a plenary bitrate using a scalable bitstream encoder having fine-grained scalability that ranks bits in each data frame of the encoded audio object files in an order of psychoacoustic importance to human hearing; generating the plenary file at the plenary bitrate by combining the plurality of independently encoded audio object files and corresponding hierarchical metadata that includes a user-defined priority for each of the independently encoded audio object files to rank the files in order of importance to such that the user defines one of the audio object files as more important to the user than another one of the audio object files; constructing a first scaled compressed bitstream at a first target bitrate from the plenary file and assigning each and every one of the independently encoded audio object files at least some bits based on the priority such that higher priority files are assigned more bits as compared to lower priority files; and constructing a second scaled compressed bitstream at a second target bitrate from the plenary file such that multiple scaled bitstreams at different target bitrates are obtained from the single plenary file without any re-encoding of the plurality of encoded audio object files for playback on an embedded device having a processor; wherein the first target bitrate and the second target bitrate are different from each other and are both less than the plenary bitrate. 2. The method of claim 1 , wherein the first target bitrate is a maximum bitrate at which the first scaled compressed bitstream will be transmitted. 3. The method of claim 2 , wherein each of the plurality of encoded audio object files is persistent and present for an entire duration of the plenary file. 4. The method of claim 3 , further comprising: comparing a data frame activity of data frames of each of the plurality of encoded audio files at a selected time period to each other to obtain a data frame activity comparison; assigning bits to each of the data frames of the encoded audio object files at the selected time period based on the data frame activity comparison and the first target bitrate to obtain a bit allocation for the selected time period; scaling down the plenary file by truncating bits of the data frames in accordance with the bit allocation to achieve the first target bitrate and obtain bit-reduced encoded audio object files; and multiplexing together and packing the bit-reduced audio object files into the first scaled compressed bitstream at the first target bitrate. 5. The method of claim 4 , further comprising: transmitting the first scaled compressed bitstream to a receiving device at the first target bitrate; and decoding the first scaled compressed bitstream to obtain decoded audio objects. 6. The method of claim 5 , further comprising mixing the decoded audio objects to create an audio objects mix for playback on an embedded device having a processor. 7. The method of claim 1 , wherein the hierarchical metadata further comprises cross-object masking such that one of the audio object files that is not masked has priority over one of the audio object files that is masked. 8. A post-encoding bitrate reduction system, comprising: one or more scalable bitstream encoders for separately encoding a plurality of audio object files to obtain a plurality of encoded audio object files at a plenary bitrate, the one or more scalable bitstream encoders having fine-grained scalability that ranks bits in each data frame of the encoded audio object files in an order of psychoacoustic importance to human hearing; a plenary file formed by combining the plurality of separately encoded audio object files at the plenary bitrate together with corresponding hierarchical metadata that includes at least one of the following: (a) a user-defined priority for each of the separately encoded audio object files to rank the files in order of importance to a user such that the user defines one of the audio object files as more important to the user than another one of the audio object files; and (b) cross-object masking such that one of the audio object files that is not masked has priority over one of the audio object files that is masked; and a bit reduction module for constructing a first scaled compressed bitstream for playback on an embedded device having a processor from the plenary file at a first target bitrate and assigning each and every one of the separately encoded audio object files at least some bits based on the priority such that higher priority files are assigned more bits as compared to lower priority files; and constructing a second scaled compressed bitstream from the plenary file at a second target bitrate different from the first target bitrate, wherein both the first target bitrate and the second target bitrate are less than the plenary bitrate, and are obtained from the plenary file without re-encoding the plurality of audio object files. 9. The system of claim 8 , wherein the first target bitrate is a maximum bitrate at which the first scaled compressed bitstream will be transmitted. 10. The system of claim 8 , wherein each of the plurality of encoded audio object files is persistent and present for an entire duration of the plenary file. 11. The system of claim 8 , wherein the bit reduction module further comprises: a bit-allocation module for comparing a data frame activity of data frames of each of the plurality of encoded audio files at a selected time period to each other to obtain a data frame activity comparison; assigning bits to each of the data frames of the encoded audio object files at the selected time period based on the data frame activity comparison and the first target bitrate to obtain a bit allocation for the selected time period; and scaling down the plenary file by truncating bits of the data frames in accordance with the bit allocation to achieve the first target bitrate and obtain bit-reduced encoded audio object files; and a multiplexer and bitstream-packer for multiplexing together the bit-reduced audio object files and then packing the bit-reduced audio object files into the first scaled compressed bitstream at the first target bitrate. 12. The system of claim 11 , further comprising: a communication link for transmitting the first scaled compressed bitstream to a receiving device at the first target bitrate; and a scalable bitstream-decoder at the receiving device for decoding the first scaled compressed bitstream to obtain decoded audio objects. 13. The system of claim 12 , further comprising a mixing device at the receiving device for mixing the decoded audio objects and generating an audio objects mix for playback on an embedded device having a processor.

Assignees

Dts Inc

Inventors

Fejzo Zoran

Classifications

G10L19/002Primary
Dynamic bit allocation (for perceptual audio coders G10L19/032) · CPC title
H04N21/2662
Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities · CPC title
H04N19/132
Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking · CPC title
G10L19/008
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
G10L19/24
Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title

Patent family

Related publications grouped by family.

View patent family 54017971

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9984692B2 cover?: A post-encoding bitrate reduction system and method for generating one more scaled compressed bitstreams from a single encoded plenary file. The plenary file contains multiple audio object files that were encoded separately using a scalable encoding process having fine-grained scalability. Activity in the data frames of the encoded audio object files at a time period are compared with each othe…
Who is the assignee on this patent?: Dts Inc
What technology area does this patent fall under?: Primary CPC classification G10L19/002. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue May 29 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).