Decoding device, decoding method, encoding device, encoding method, and program

US9542952B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9542952-B2
Application numberUS-201314238265-A
CountryUS
Kind codeB2
Filing dateJun 24, 2013
Priority dateJul 2, 2012
Publication dateJan 10, 2017
Grant dateJan 10, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present technique relates to a decoding device, a decoding method, an encoding device, an encoding method, and a program which can obtain a high-quality realistic sound. The encoding device stores speaker arrangement information in a comment region in a PCE of an encoded bit stream and stores a synchronous word and identification information in the comment region such that other public comments and the speaker arrangement information stored in the comment region can be distinguished from each other. When an encoded bit stream is decoded, it is determined whether the speaker arrangement information is stored on the basis of the synchronous word and the identification information stored in the comment region. Audio data included in the encoded bit stream is output according to the arrangement of the speakers corresponding to the determination result. The present technique can be applied to an encoding device.

First claim

Opening claim text (preview).

The invention claimed is: 1. A decoding device comprising: processing circuitry including: a decoding unit configured to decode audio data included in an encoded bit stream; a reading unit configured to read sound source position information about a height of a sound source of the audio data from a region which can store arbitrary data of the encoded bit stream; and an output unit configured to output the decoded audio data on the basis of the sound source position information, wherein the sound source position information is information indicating that the height of the sound source is substantially equal to a height of a user, is greater than the height of the user, or is less than the height of the user, wherein identification information for identifying whether the sound source position information is present is stored in the region which can store the arbitrary data, and the reading unit reads the sound source position information on the basis of the identification information, wherein first predetermined identification information and second identification information which is calculated on the basis of the sound source position information are stored as the identification information in the region which can store the arbitrary data, and wherein the reading unit determines that the sound source position information is valid when the first identification information included in the region which can store the arbitrary data is predetermined specific information and the second identification information read from the region which can store the arbitrary data is identical to the second identification information which is calculated on the basis of the read sound source position information. 2. The decoding device according to claim 1 , wherein the second identification information is calculated on the basis of information obtained by performing byte alignment for information including the sound source position information. 3. A decoding method comprising: decoding, by processing circuitry, audio data included in an encoded bit stream; reading, by the processing circuitry, sound source position information about a height of a sound source of the audio data from a region which can store arbitrary data of the encoded bit stream; and outputting, by the processing circuitry, the decoded audio data on the basis of the sound source position information, wherein the sound source position information is information indicating that the height of the sound source is substantially equal to a height of a user, is greater than the height of the user, or is less than the height of the user, wherein identification information for identifying whether the sound source position information is present is stored in the region which can store the arbitrary data, and the sound source position information is read on the basis of the identification information, wherein first predetermined identification information and second identification information which is calculated on the basis of the sound source position information are stored as the identification information in the region which can store the arbitrary data, and wherein the sound source position information is determined to be valid when the first identification information included in the region which can store the arbitrary data is predetermined specific information and the second identification information read from the region which can store the arbitrary data is identical to the second identification information which is calculated on the basis of the read sound source position information. 4. A computer-readable storage device encoded with computer-executable instructions that, when executed by processing circuitry, perform a process comprising: decoding audio data included in an encoded bit stream; reading sound source position information about a height of a sound source of the audio data from a region which can store arbitrary data of the encoded bit stream; and outputting the decoded audio data on the basis of the sound source position information, wherein the sound source position information is information indicating that the height of the sound source is substantially equal to a height of a user, is greater than the height of the user, or is less than the height of the user, wherein identification information for identifying whether the sound source position information is present is stored in the region which can store the arbitrary data, and the sound source position information is read on the basis of the identification information, wherein first predetermined identification information and second identification information which is calculated on the basis of the sound source position information are stored as the identification information in the region which can store the arbitrary data, and wherein the sound source position information is determined to be valid when the first identification information included in the region which can store the arbitrary data is predetermined specific information and the second identification information read from the region which can store the arbitrary data is identical to the second identification information which is calculated on the basis of the read sound source position information.

Assignees

Inventors

Classifications

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

  • Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9542952B2 cover?
The present technique relates to a decoding device, a decoding method, an encoding device, an encoding method, and a program which can obtain a high-quality realistic sound. The encoding device stores speaker arrangement information in a comment region in a PCE of an encoded bit stream and stores a synchronous word and identification information in the comment region such that other publi…
Who is the assignee on this patent?
Sony Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 10 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).