Apparatus and methods for compressing video content using adaptive projection selection

US10313686B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10313686-B2
Application numberUS-201715466559-A
CountryUS
Kind codeB2
Filing dateMar 22, 2017
Priority dateSep 20, 2016
Publication dateJun 4, 2019
Grant dateJun 4, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Apparatus and methods for encoding panoramic content, such as by a wide field of view and large image size. In one implementation, a panoramic image may be mapped to a cube, equirectangular or any other projection e.g., icosahedron or octahedron. Projection may be selected adaptively based on evaluation of the panoramic content. Content evaluation may include obtaining rate distortion cost metric for a given projection configuration including projection type, projection arrangement, and projection orientation. Projection configuration with the lowest cost may be selected as target projection for encoding content. As content composition changes (e.g., object motion, texture presence and/or location) projection may be adaptively selected to match changes in the content. Adaptive content selection methodology may provide for a lower encoded bitrate for a given encoded quality and/or higher quality for a given bitrate.

First claim

Opening claim text (preview).

What is claimed is: 1. A system configured to encode images using adaptive projection selection, the system comprising: a network interface; one or more physical processors; and a storage apparatus in data communication with the one or more physical processors, the storage apparatus comprising a plurality of machine-readable instructions, the plurality of machine-readable instructions configured to, when executed by the one or more physical processors, cause the system to: obtain a content block of data in a first projection; obtain a first rate distortion parameter associated with encoding the content block of data in the first projection; obtain a second rate distortion parameter associated with encoding the content block of data in a second projection, the second projection differing from the first projection; responsive to a determination that the second rate distortion parameter is lower than the first rate distortion parameter, select the second projection for the content block of data; obtain an encoded content block by encoding the content block of data in the second projection; and provide the encoded content block and a second projection configuration parameter to a target destination via the network interface. 2. The system of claim 1 , wherein the first rate distortion parameter and the second rate distortion parameter each include a rate distortion cost parameter. 3. The system of claim 2 , wherein the rate distortion cost parameter comprises a function of a number of bits used to encode an image and a measure of distortion. 4. The system of claim 3 , wherein the measure of distortion corresponds to an image quality metric, the image quality metric configured based on a difference between the encoded content block and the content block of data in the first projection. 5. The system of claim 2 , wherein the rate distortion cost parameter comprises a function of a peak signal-to-noise ratio, the function of the peak signal-to-noise ratio including a measure of mean square error. 6. The system of claim 1 , wherein the first projection and the second projection each comprise a differing projection type. 7. The system of claim 6 , wherein the differing projection types are selected from the group consisting of a cube projection, a hexahedron projection, an icosahedron projection, and an equirectangular projection. 8. The system of claim 1 , wherein the first projection and the second projection each comprise a differing projection facet arrangement. 9. The system of claim 8 , wherein the differing projection facet arrangements each comprise a same projection type. 10. An encoder apparatus configured to encode content, the encoder apparatus comprising: a network interface in data communication with one or more physical processors, the one or more physical processors comprising logic configured to: obtain content in a given projection; encode the content with the given projection along a default orientation; compute a rate distortion cost of encoding the content with the given projection along the default orientation; re-orient the content in a different orientation; encode the content with the given projection along the different orientation; compute a rate distortion cost of encoding the content with the given projection along the different orientation; compare the rate distortion cost of encoding the content with the given projection along the default orientation with the rate distortion cost of encoding the content with the given projection along the different orientation to find an orientation that takes a lowest rate distortion cost to encode the content; and provide the encoded content with the lowest rate distortion cost to the network interface. 11. The encoder apparatus of claim 10 , wherein the one or more physical processors further comprises logic configured to return the lowest rate distortion cost and provide the lowest rate distortion cost to a display device. 12. The encoder apparatus of claim 10 , wherein the computed rate distortion cost comprises a function of a number of bits used to encode an image and a measure of distortion. 13. The encoder apparatus of claim 10 , wherein the computed rate distortion cost comprises a function of a peak signal-to-noise ratio, the function of the peak signal-to-noise ratio including a measure of mean square error. 14. The encoder apparatus of claim 10 , wherein the given projection is selected from the group consisting of an equirectangular projection, a cube map projection, an equal area projection, an icosahedron projection, an octahedron projection, and a segmented sphere projection. 15. The encoder apparatus of claim 10 , further comprising logic configured to: re-orient the content in a third orientation; encode the content with the given projection along the third orientation; compute a rate distortion cost of encoding the content with the given projection along the third orientation; and compare the rate distortion cost of encoding the content with the given projection along the third orientation with the rate distortion cost of encoding the content with the given projection along the default orientation and the rate distortion cost of encoding the content with the given projection along the different orientation in order to find the orientation that takes the lowest rate distortion cost to encode the content. 16. A method of encoding images using adaptive projection selection, the method comprising: obtaining a first rate distortion parameter for encoding an image block in a first image representation; selecting a second image representation, the second image representation differing from the first image representation; obtaining a second rate distortion parameter for encoding the image block in the second image representation; and obtaining a target rate distortion parameter by comparing the first rate distortion parameter with the second rate distortion parameter; wherein the second image representation differs from the first image representation by altering one or more of a projection type, a projection arrangement, and a projection orientation. 17. The method of claim 16 , further comprising determining that the second rate distortion parameter is less than the first rate distortion parameter and assigning the target rate distortion parameter to the second rate distortion parameter. 18. The method of claim 17 , further comprising encoding the image block using the second image representation and providing the second image representation to a network interface. 19. The method of claim 16 , further comprising determining that the first rate distortion parameter is less than the second rate distortion parameter and assigning the target rate distortion parameter to the first rate distortion parameter. 20. The method of claim 16 , wherein the obtaining of the first rate distortion parameter and the obtaining of the second rate distortion parameter comprises computing a rate distortion cost as a function of a number of bits used to encode an image and a measure of distortion.

Assignees

Inventors

Classifications

  • using pre-processing or post-processing specially adapted for video compression · CPC title

  • according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • H04N19/176Primary

    the region being a block, e.g. a macroblock · CPC title

  • Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction · CPC title

  • specially adapted for multi-view video sequence encoding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10313686B2 cover?
Apparatus and methods for encoding panoramic content, such as by a wide field of view and large image size. In one implementation, a panoramic image may be mapped to a cube, equirectangular or any other projection e.g., icosahedron or octahedron. Projection may be selected adaptively based on evaluation of the panoramic content. Content evaluation may include obtaining rate distortion cost metr…
Who is the assignee on this patent?
Gopro Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/176. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jun 04 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).