Bit rate modification based on an ambient interference level
US-2015379997-A1 · Dec 31, 2015 · US
US9984692B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9984692-B2 |
| Application number | US-201514970320-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 15, 2015 |
| Priority date | Mar 6, 2014 |
| Publication date | May 29, 2018 |
| Grant date | May 29, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A post-encoding bitrate reduction system and method for generating one more scaled compressed bitstreams from a single encoded plenary file. The plenary file contains multiple audio object files that were encoded separately using a scalable encoding process having fine-grained scalability. Activity in the data frames of the encoded audio object files at a time period are compared with each other to obtain a data frame activity comparison. Bits from an available bitpool are assigned to all of the data frames based on the data frame activity comparison and corresponding hierarchical metadata. The plenary file is scaled down by truncating bits in the data frames to conform to the bit allocation. In some embodiments frame activity is compared to a silence threshold and the data frame contains silence if the frame activity is less than or equal to the threshold and minimal bits are used to represent the silent frame.
Opening claim text (preview).
What is claimed is: 1. A method for obtaining multiple scaled compressed bitstreams from a single plenary file, comprising: separately encoding a plurality of audio object files to obtain a plurality of encoded audio object files at a plenary bitrate using a scalable bitstream encoder having fine-grained scalability that ranks bits in each data frame of the encoded audio object files in an order of psychoacoustic importance to human hearing; generating the plenary file at the plenary bitrate by combining the plurality of independently encoded audio object files and corresponding hierarchical metadata that includes a user-defined priority for each of the independently encoded audio object files to rank the files in order of importance to such that the user defines one of the audio object files as more important to the user than another one of the audio object files; constructing a first scaled compressed bitstream at a first target bitrate from the plenary file and assigning each and every one of the independently encoded audio object files at least some bits based on the priority such that higher priority files are assigned more bits as compared to lower priority files; and constructing a second scaled compressed bitstream at a second target bitrate from the plenary file such that multiple scaled bitstreams at different target bitrates are obtained from the single plenary file without any re-encoding of the plurality of encoded audio object files for playback on an embedded device having a processor; wherein the first target bitrate and the second target bitrate are different from each other and are both less than the plenary bitrate. 2. The method of claim 1 , wherein the first target bitrate is a maximum bitrate at which the first scaled compressed bitstream will be transmitted. 3. The method of claim 2 , wherein each of the plurality of encoded audio object files is persistent and present for an entire duration of the plenary file. 4. The method of claim 3 , further comprising: comparing a data frame activity of data frames of each of the plurality of encoded audio files at a selected time period to each other to obtain a data frame activity comparison; assigning bits to each of the data frames of the encoded audio object files at the selected time period based on the data frame activity comparison and the first target bitrate to obtain a bit allocation for the selected time period; scaling down the plenary file by truncating bits of the data frames in accordance with the bit allocation to achieve the first target bitrate and obtain bit-reduced encoded audio object files; and multiplexing together and packing the bit-reduced audio object files into the first scaled compressed bitstream at the first target bitrate. 5. The method of claim 4 , further comprising: transmitting the first scaled compressed bitstream to a receiving device at the first target bitrate; and decoding the first scaled compressed bitstream to obtain decoded audio objects. 6. The method of claim 5 , further comprising mixing the decoded audio objects to create an audio objects mix for playback on an embedded device having a processor. 7. The method of claim 1 , wherein the hierarchical metadata further comprises cross-object masking such that one of the audio object files that is not masked has priority over one of the audio object files that is masked. 8. A post-encoding bitrate reduction system, comprising: one or more scalable bitstream encoders for separately encoding a plurality of audio object files to obtain a plurality of encoded audio object files at a plenary bitrate, the one or more scalable bitstream encoders having fine-grained scalability that ranks bits in each data frame of the encoded audio object files in an order of psychoacoustic importance to human hearing; a plenary file formed by combining the plurality of separately encoded audio object files at the plenary bitrate together with corresponding hierarchical metadata that includes at least one of the following: (a) a user-defined priority for each of the separately encoded audio object files to rank the files in order of importance to a user such that the user defines one of the audio object files as more important to the user than another one of the audio object files; and (b) cross-object masking such that one of the audio object files that is not masked has priority over one of the audio object files that is masked; and a bit reduction module for constructing a first scaled compressed bitstream for playback on an embedded device having a processor from the plenary file at a first target bitrate and assigning each and every one of the separately encoded audio object files at least some bits based on the priority such that higher priority files are assigned more bits as compared to lower priority files; and constructing a second scaled compressed bitstream from the plenary file at a second target bitrate different from the first target bitrate, wherein both the first target bitrate and the second target bitrate are less than the plenary bitrate, and are obtained from the plenary file without re-encoding the plurality of audio object files. 9. The system of claim 8 , wherein the first target bitrate is a maximum bitrate at which the first scaled compressed bitstream will be transmitted. 10. The system of claim 8 , wherein each of the plurality of encoded audio object files is persistent and present for an entire duration of the plenary file. 11. The system of claim 8 , wherein the bit reduction module further comprises: a bit-allocation module for comparing a data frame activity of data frames of each of the plurality of encoded audio files at a selected time period to each other to obtain a data frame activity comparison; assigning bits to each of the data frames of the encoded audio object files at the selected time period based on the data frame activity comparison and the first target bitrate to obtain a bit allocation for the selected time period; and scaling down the plenary file by truncating bits of the data frames in accordance with the bit allocation to achieve the first target bitrate and obtain bit-reduced encoded audio object files; and a multiplexer and bitstream-packer for multiplexing together the bit-reduced audio object files and then packing the bit-reduced audio object files into the first scaled compressed bitstream at the first target bitrate. 12. The system of claim 11 , further comprising: a communication link for transmitting the first scaled compressed bitstream to a receiving device at the first target bitrate; and a scalable bitstream-decoder at the receiving device for decoding the first scaled compressed bitstream to obtain decoded audio objects. 13. The system of claim 12 , further comprising a mixing device at the receiving device for mixing the decoded audio objects and generating an audio objects mix for playback on an embedded device having a processor.
Dynamic bit allocation (for perceptual audio coders G10L19/032) · CPC title
Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities · CPC title
Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.