Method and apparatus for region of interest video coding using tiles and tile groups

US9554133B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9554133-B2
Application numberUS-201314030204-A
CountryUS
Kind codeB2
Filing dateSep 18, 2013
Priority dateSep 18, 2012
Publication dateJan 24, 2017
Grant dateJan 24, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and instrumentalities are disclosed relating to region of interest (ROI) video coding using tiles and tile groups. An encoded video sequence including a plurality of tiles may be received. The plurality of tiles may be divided into one or more tile groups. Signaling indicating parameters of the one or more tile groups may be received. A tile group of the one or more tiles groups may be decoded and a picture relating to the decoded tile group may be displayed. The decoded tile group may overlap the ROI. The ROI may correspond to the displayed picture and the displayed picture may be a portion of the encoded video sequence. The tile groups that do not overlap the ROI may not be decoded.

First claim

Opening claim text (preview).

The invention claimed is: 1. An encoder for encoding a video sequence, comprising: a processor configured to: partition a picture in the video sequence into tiles; group at least a tile of the tiles into a tile group; send, via a supplemental enhancement information (SEI) message, parameters associated with the tile group, wherein the parameters indicate a number of tile groups, a number of tiles in each of the tile groups, and one or more tile indexes based on a raster scan order; and constrain temporal motion compensated prediction in the tile group, such that the temporal motion compensated prediction of pixels in the tile group in the first picture is limited to using reference pixels that are within the tile group in one or more temporal reference pictures in the video sequence. 2. The encoder of claim 1 , wherein the tile group is not spatially contiguous. 3. The encoder of claim 1 , wherein the tile belongs to at least two tile groups. 4. The encoder of claim 1 , wherein the tile is a first tile, and wherein the processor is further configured to encode a second tile without using a tile group-based temporal motion compensated prediction constraint when the second tile does not belong to any tile group. 5. The encoder of claim 1 , wherein the one or more tile indexes comprises an index of each tile in each respective tile group. 6. The encoder of claim 1 , wherein the one or more tile indexes indicates a location of the tile among the tiles in the tile group. 7. The encoder of claim 1 , wherein the encoder signals the parameters in a picture parameter set (PPS) or a video usability information (VUI) message. 8. The encoder of claim 1 , wherein the tile is a first tile, and wherein the processor is further configured to encode a second tile without using a tile group-based temporal motion compensated prediction constraint when the second tile does not belong to the tile group. 9. A device for receiving an encoded video sequence, comprising: a processor configured to: determine a region of interest (ROI) in the encoded video sequence, receive the encoded video sequence that comprises a tile group that overlaps the ROI, receive parameters associated with the tile group, wherein the parameters indicate a number of tile groups, a number of tiles in each of the number of tile groups, and one or more tile indexes based on a raster scan order, determine to decode the tile group based on the received parameters, decode the tile group based on a constraint that temporal motion compensated prediction in the tile group that overlaps the ROI uses reference pixels within the tile group, wherein the temporal motion compensated prediction is performed for a first picture in the encoded video sequence using a second picture in the encoded video sequence, and display a picture based on the decoded tile group. 10. The device of claim 9 , wherein the tile group is not spatially contiguous. 11. The device of claim 9 , wherein the one or more tile indexes indicates a location of the tile among the number of tiles in the tile group. 12. The device of claim 9 , wherein the processor is further configured to skip the decoding of one or more tile groups that do not overlap the ROI. 13. The device of claim 9 , wherein the second picture is a reference picture that comprises the reference pixels. 14. A method comprising: partitioning a picture within a video sequence into tiles; grouping at least a tile of the tiles into a tile group; sending, via a supplemental enhancement information (SEI) message, parameters associated with the tile group, wherein the parameters indicate a number of tile groups, a number of tiles in each of the number of tile groups, and one or more tile indexes based on a raster scan order; and constraining temporal motion compensated prediction in the tile group, such that the temporal motion compensated prediction of pixels in the tile group in the first picture is limited to using reference pixels that are within the tile group in one or more temporal reference pictures within the video sequence. 15. The method of claim 14 , wherein the tile group is not spatially contiguous. 16. The method of claim 14 , wherein the tile belongs to at least two tile groups. 17. The method of claim 14 , wherein the one or more tile indexes comprises an index of each tile in each respective tile group, and wherein the one or more tile indexes indicates a location of each tile among the number of tiles in each respective tile group in raster scan order. 18. The method of claim 14 , wherein the tile is a first tile, and wherein the processor is further configured to encode a second tile without using a tile group-based temporal motion compensated prediction constraint when the second tile does not belong to any tile group. 19. The method of claim 14 , wherein the parameters are signaled in a picture parameter set (PPS) or a video usability information (VUI) message. 20. The method of claim 14 , wherein the tile is a first tile, and wherein the processor is further configured to encode a second tile without using a tile group-based temporal motion compensated prediction constraint when the second tile does not belong to the tile group. 21. The device of claim 9 , wherein the ROI is determined based on input from a user.

Assignees

Inventors

Classifications

  • the unit being an image region, e.g. an object · CPC title

  • Motion estimation with spatial constraints, e.g. at image or region borders · CPC title

  • the region being a slice, e.g. a line of blocks or a group of blocks · CPC title

  • H04N19/105Primary

    Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction · CPC title

  • User input · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9554133B2 cover?
Systems, methods, and instrumentalities are disclosed relating to region of interest (ROI) video coding using tiles and tile groups. An encoded video sequence including a plurality of tiles may be received. The plurality of tiles may be divided into one or more tile groups. Signaling indicating parameters of the one or more tile groups may be received. A tile group of the one or more tiles grou…
Who is the assignee on this patent?
Vid Scale Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/105. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 24 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).